DevJobs

Data Engineer

Overview
Skills
  • Python Python
  • SQL SQL
  • AWS AWS
  • GCP GCP
  • Big data
  • bioinformatics
  • genomics
  • MLOps
We’re looking for a Data Engineer to build and scale the data infrastructure that powers our machine learning research. You’ll own pipelines and databases that make large, complex datasets usable for training foundation models.

What You'll Do

  • Build and maintain scalable data pipelines for genomic and biological data
  • Design and manage the company’s core database, including defining and evolving the ERD
  • Develop and orchestrate ETL workflows for ingestion, preprocessing, and validation
  • Optimize storage, retrieval, and distributed data processing

Requirements:

  • 4+ years in software engineering or related fields
  • Proficiency in Python, SQL
  • Hands-on experience with cloud platforms (AWS, GCP)
  • Solid software engineering practices
  • Strong experience with distributed data processing.
  • Bonus: experience with Big data, MLOps workflows or bioinformatics/genomics
Converge Bio