DevJobs

Data Engineer

Overview
Skills
  • Python Python ꞏ 3y
  • Spark Spark
  • Linux Linux
  • RESTful API RESTful API
  • CI/CD CI/CD
  • AWS AWS ꞏ 2y
  • Docker Docker
  • Kubernetes Kubernetes
  • JSON ꞏ 3y
  • Databases ꞏ 3y
  • Glue ꞏ 2y
  • RDS ꞏ 2y
  • Redshift ꞏ 2y
  • Step Functions ꞏ 2y
  • Athena ꞏ 2y
  • EMR ꞏ 2y
  • HDFS
  • LLM
  • Parquet
  • Prompt engineering
  • AI
  • Text files
  • Avro
  • Delta Lake
  • GenAI
abra R&D is looking for a Data Engineer!

We are looking for a Data Engineer to join our R&D team and contribute to exciting AI-related projects.

The role involves ingesting and processing large volumes of data, performing deep analysis, and collaborating closely with Data Scientists.

You will design and develop critical, large-scale, and diverse data pipelines in both cloud and on-premise environments.

Requirements
  • Minimum 3 years of experience as a Data Engineer – mandatory
  • 3 years of hands-on experience with Python, including work with JSON files and databases – mandatory
  • At least 2 years of practical experience with AWS, using services such as Athena, Glue, Step Functions, EMR, Redshift, and RDSmandatory
  • Experience working with text files for AI or LLM-related projectsstrong advantage
  • Practical experience with Spark for large-scale data processing – advantage
  • Hands-on experience integrating and processing data via REST APIsadvantage
  • Solid understanding of optimization techniques and data partitioning using formats like Parquet, Avro, HDFS, and Delta Lake
  • Experience working with Docker, Linux, CI/CD tools, and Kubernetes
  • Familiarity with GenAI solutions or prompt engineeringstrong advantage


abra