Summary
We are seeking a talented Computer Vision / ML Researcher to develop diffusion-based generative models for video restoration in new imaging systems. The role will focus on large, high-quality models that restore degraded video while preserving fidelity, detail, motion, and temporal consistency. You will be part of an interdisciplinary team developing technologies that will shape future Apple products.
Deep hands-on experience with diffusion models is required. The ideal candidate has worked on diffusion-based image or video models and has a strong research track record, ideally including publications on diffusion, generative vision, video models, restoration, or inverse problems.
Description
Deep learning development: Design, train, evaluate, and optimize deep learning models for video restoration tasks such as denoising, deblurring, artifact removal, super-resolution, detail recovery, and temporal stabilization. Work with large-scale video datasets, synthetic and real degradations, and rigorous evaluation pipelines.
Research: Develop and adapt diffusion models for video restoration, including video diffusion, DiT-style architectures, latent diffusion, conditional diffusion, diffusion for inverse problems, and efficient sampling. Analyze results, compare against leading methods, and investigate approaches for fidelity preservation, hallucination control, temporal consistency, distillation, and production-quality inference.
Collaboration: Work closely with ML researchers, engineers, and cross-functional teams to translate research ideas into robust models and practical systems.
Minimum Qualifications
- Strong foundation in computer vision, machine learning, deep learning, and video processing.
- Deep hands-on experience with diffusion models.
- Experience training diffusion-based image or video models.
- Proficiency in Python and deep learning frameworks such as PyTorch.
- Hands-on experience training deep learning models using large-scale datasets.
- Experience with model evaluation, debugging, experimental analysis, and failure analysis.
- Master’s or PhD in Computer Science, Electrical Engineering, Machine Learning, Computer Vision, or a related field, or equivalent experience.
- Strong written and verbal communication skills.
- Ability to work both independently and collaboratively.
- 5+ years of relevant experience, or a PhD with relevant research contributions.
Preferred Qualifications
- Publications on diffusion models, generative vision, video models, restoration, or inverse problems at top-tier venues such as CVPR, ICCV, ECCV, NeurIPS.
- Experience with video diffusion, DiT architectures, latent diffusion, conditional diffusion, rectified flow, consistency models, or diffusion distillation.
- Experience with video restoration, super-resolution, denoising, deblurring, artifact removal, inverse problems, or computational imaging.
- Strong understanding of temporal consistency, motion, occlusion, flicker, hallucination control, and fidelity-preserving generation.
- Experience with efficient inference, model optimization, distillation, or deployment on constrained hardware.
- Background in signal processing, physics, computational imaging, or inverse problems.
At Apple, we believe accessibility is a fundamental human right. You’ll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong.
Learn about accessibility in Apple’s workplace
Role Number: 200668550-0865