DevJobs

Senior Data Engineer

Overview
Skills
  • SQL SQL
  • Python Python
  • NoSQL NoSQL
  • AWS AWS
  • GCP GCP
  • Azure Azure
  • Data pipeline development
  • Data warehousing
  • Database management
  • ETL
  • Machine learning data preparation
  • ML applications
  • Big Data
  • Performance optimization
  • AI
  • BigQuery
  • Data modeling

Company Overview:



Cellebrite’s (Nasdaq: CLBT) mission is to enable its customers to protect and save lives, accelerate justice, and preserve privacy in communities around the world. Cellebrite is a global leader in Digital Intelligence solutions for the public and private sectors, empowering organizations to master the complexities of legally sanctioned digital investigations by streamlining intelligence processes. Trusted by thousands of leading agencies and companies in more than 140 countries, Cellebrite’s Digital Intelligence platform and solutions transform how customers collect, review, analyze and manage data in legally sanctioned investigations.






Position Overview:



We are assembling an elite, small-scale team of innovators committed to a transformative mission: advancing generative AI from conceptual breakthrough to tangible product reality. As a Senior Data Engineer, you will be the critical data backbone of our innovation engine, transforming raw data into the fuel that powers groundbreaking GenAI solutions, driving Cellebrite's digital intelligence capabilities to unprecedented heights.





Your Strategic Role



You are not just a data engineer – you are a strategic enabler of GenAI innovation. Your primary mission is to:

  • Prepare, structure, and optimize data for cutting-edge GenAI project exploration
  • Design data infrastructures that support rapid GenAI prototype development
  • Uncover unique data insights that can spark transformative AI project ideas
  • Create flexible, robust data pipelines that accelerate GenAI research and development



What Sets This Role Apart



  • Data as the Foundation of AI Innovation
  • You'll be working at the intersection of advanced data engineering and generative AI
  • Your data solutions will directly enable the team's ability to experiment with and develop novel AI concepts
  • Every data pipeline you design has the potential to unlock a breakthrough GenAI project
  • Exploration and Innovation
  • Conduct deep data exploration to identify potential GenAI application areas
  • Work closely with AI researchers to understand data requirements for cutting-edge GenAI projects





Data Engineering Expertise



  • Advanced skills in designing data architectures that support GenAI research
  • Ability to work with diverse, complex datasets across multiple domains
  • Expertise in preparing and transforming data for AI model training
  • Proficiency in creating scalable, flexible data infrastructure




Technical Capabilities



  • Deep understanding of data requirements for machine learning and generative AI
  • Expertise in cloud-based data platforms
  • Advanced skills in data integration, transformation, and pipeline development
  • Ability to develop automated data processing solutions optimized for AI research




Research and Innovation Skills



  • Proven ability to derive strategic insights from complex datasets
  • Creative approach to data preparation and feature engineering
  • Capacity to identify unique data opportunities for GenAI projects
  • Strong experimental mindset with rigorous analytical capabilities





Requirements



  • Degree in Computer Science, Data Science, or related field
  • 4+ years of progressive data engineering experience


Demonstrated expertise in:

  • Cloud platforms (AWS, Google Cloud, Azure)
  • Big Data technologies
  • Advanced SQL and NoSQL database systems
  • Data pipeline development for AI/ML applications
  • Performance optimization techniques



Technical Skill Requirements


  • Expert-level SQL and database management
  • Proficiency in Python, with strong data processing capabilities
  • Experience in data warehousing and ETL processes
  • Advanced knowledge of data modeling techniques
  • Understanding of machine learning data preparation techniques
  • Experience integrating with BigQuery – advantage

Cellebrite