Senior ML Data Engineer - AV Dataset - Mobileye - Jerusalem District

Senior ML Data Engineer - AV Dataset

Overview

Job TypeHybrid

Experience4 years

Job PositionAI/ML

UpdatedApr 05, 2026

LocationJerusalem District

SalaryN/A

Skills

Python ꞏ 4y
Pandas
Numpy
Airflow
DuckDB
Embeddings
LLM agents
Camera models
Object detection
PyArrow
3D geometry
Segmentation
VLMs
Kubeflow
LanceDB
Argo
TensorRT
Triton
vLLM

The AI Engineering group builds modern infrastructure and solutions that improve how algorithms are developed at Mobileye.

We are a small, independent team of experienced engineers with a mix of skills in algorithms, software, and infrastructure. We work in a DevOps style and build cross-team solutions that support research and development of advanced perception algorithms.

Our flagship project is a unified AV dataset used to train and evaluate next-generation models. We take large volumes of multi-camera video, object labels, HD maps, and sensor data from across the organization, and turn it into a curated, high-quality training set - at scale.

We are looking for someone who brings ML and computer-vision depth to the team - someone who can help shape the intelligence layer that decides what data is worth training on.

What will your job look like:

Work collaboratively with shared ownership. Your focus area will be the curation and ML side of our data pipeline, but you will contribute across the full stack alongside the rest of the team.
Build and improve the curation pipeline - from vision-model embeddings and scene detection, through VLM-based scene analysis, to scoring, deduplication, and sampling that produces a balanced and diverse dataset.
Run and optimize GPU inference at scale (embedding extraction, VLM inference) across thousands of driving sessions using workflow orchestration.
Develop scoring and sampling strategies that ensure rare but important scenarios (night driving, adverse weather, hazardous situations) are well-represented in the final dataset.
Work with algorithm teams to understand what data gaps hurt model performance and translate those into curation criteria.
Build validation and diagnostics that measure dataset quality - not just pipeline health, but whether the data is actually good for training.
Contribute to the core dataset SDK, converter, and 3D-geometry tooling (camera projection, calibration, coordinate transforms).

All you need is:

4+ years in ML engineering, applied CV, or a similar role combining model work with production data systems.
Hands-on experience with vision models - embeddings, VLMs, or object detection/segmentation.
Strong Python and comfort with the PyData stack (NumPy, PyArrow, Pandas, DuckDB).
Experience building data or ML pipelines that run at scale (not just notebooks).
Solid understanding of 3D geometry and camera models - or the mathematical background to ramp up quickly.
Good understanding of LLM agents and agentic workflows, with genuine interest in applying them to data and engineering problems.
Ability to work across team boundaries with algorithm and infrastructure people.

Strong advantage:

Experience with autonomous-driving datasets or perception pipelines.
Familiarity with dataset curation techniques (active learning, hard-example mining, distribution balancing).
Experience with GPU inference serving (vLLM, Triton, TensorRT).
Familiarity with vector databases or columnar analytics (LanceDB, DuckDB).
Experience with workflow orchestration (Argo, Airflow, Kubeflow)

Mobileye

Similar jobs

Data Science team lead

Ramat GanJul 01, 2026
Agentic AI Researcher

Tel Aviv-YafoMay 27, 2026
Senior Algorithm Developer – Deep Learning

Center DistrictJun 05, 2026
Senior Deep Learning Researcher

Ramat GanJun 19, 2026
Algorithm and Applied AI Scientist (DSP)

Haifa DistrictJun 15, 2026
AI/ML Applied Researcher – Insights & Recommendations

Tel Aviv-YafoJun 27, 2026
Senior Data Scientist

Petah TikvaApr 08, 2026
Research Scientist, Sensing

Tel Aviv-YafoJun 09, 2026

Your Account

Your Account