At CodeValue, we specialize in delivering cutting-edge software solutions and driving innovation across industries. We are now looking for a Data Engineer to join our team and play a key role in building and optimizing large-scale Big Data systems in production environments.
Key Responsibilities
- Design, implement, and maintain Big Data pipelines in production.
- Work extensively with Apache Spark (2.x and above), focusing on complex joins, shuffle optimization, and performance improvements at scale.
- Integrate Spark with relational databases, NoSQL systems, cloud storage, and streaming platforms.
- Contribute to system architecture and ensure scalability, reliability, and efficiency in data processing workflows
Qualifications
- Proven hands-on experience as a Data Engineer in production Big Data environments.
- Hands-on experience in Python development is required
- Expertise in Apache Spark, including advanced performance optimization and troubleshooting.
- Practical experience with complex joins, shuffle optimization, and large-scale performance improvements.
- Familiarity with relational and NoSQL databases, cloud data storage, and streaming platforms.
- Strong understanding of distributed computing principles and Big Data architecture patterns.