DevJobs

Data Engineering Team Lead

Overview
Skills
  • Python Python ꞏ 2y
  • SQL SQL
  • NoSQL NoSQL
  • AWS AWS
  • GCP GCP
  • Deep Learning NLP
  • Fine Tuning LLMs
  • LLMs
  • Transformer Models
Explorium is a cutting-edge data science company that has their total funding to $127 million.

Explorium offers a first of its kind data science platform powered by augmented data discovery and cutting edge feature extraction with LLMs . By automatically connecting to thousands of external data sources and leveraging machine learning to distill the most impactful signals, the Explorium platform empowers data scientists and business leaders to drive decision-making by eliminating the barrier to acquiring the right data and enabling superior predictive power.

We are looking for a talented Data Engineering Team Lead with a passion for data and complex problems.

As a Team Lead, you will join a diversified engineering group consisting of Data Engineers and Machine Learning Engineers. You will work on data pipelines varying from entity resolution systems to unstructured textual datasets to create unique datasets, classification engines, and specialized inference to implement both infrastructure and serving. You will have a key role in Explorium’s R&D Organization, responsible for collecting, integrating, and serving high quality features for machine learning models.

At Explorium we believe strongly in personal and professional development, constantly researching new technologies and methodologies.

Responsibilities:

  • Lead the research of how we integrate LLMs with our data assets
  • Work closely with business and research teams to deliver high quality results to customers and partners.
  • Design and Implement complex end-to-end data pipelines including data extraction, feature engineering, and data serving.
  • Work with Large Language Models to deliver high scale, high quality features for customers
  • Take ownership of a true research project from POC to production.
  • Contribute to a wide variety of projects using a range of technologies and tools.
  • Lead top performers in a research and development setting
  • If you are someone who thrives in a fast-paced environment where being self-directed, creative, and determined are a requirement, we would love for you to join us.

Requirements:

  • 2+ years of industry experience with building data-intensive platforms.
  • 2+ years of hands-on experience programming in Python.
  • BSc/BA in Computer Science or equivalent military background.
  • Experience with working with complex data sets.
  • Experience with Databases, SQL and NoSQL, and Data modeling.
  • Experience working with cloud compute and storage services on AWS/GCP.
  • Experience with deep learning NLP problems - advantage
  • Experience with building with LLMs or transformer models - advantage
  • Experience with fine tuning LLMs - advantage.
Explorium