DevJobs

Staff MLOps Engineer

Overview
Skills
  • Java Java
  • Python Python
  • Kafka Kafka
  • Spark Spark
  • TensorFlow TensorFlow
  • Cassandra Cassandra
  • Elasticsearch Elasticsearch
  • Git Git
  • Jenkins Jenkins
  • GCP GCP
  • Docker Docker
  • Kubernetes Kubernetes
  • Airflow Airflow
  • BigQuery
  • HDFS
Taboola, the world’s largest content recommendation platform, is hiring a Staff MLOps Engineer for the Algo platform group.

About Algo Platform

The objective of the algo platform group is to own the existing algo platform (including health, stability, productivity and enablement), to facilitate and be involved in new platform experimentation within the algo craft and lead the platformization of the parts which should graduate into production scale. This includes support of ongoing ML projects while ensuring smooth operations and infrastructure reliability, owning a full set of capabilities, design and planning, implementation and production care.

The group has deep ties with both the algo craft as well as the infra group. The group reports to the infra department and has a dotted line reporting to the algo craft leadership.

The group serves as the professional authority when it comes to ML engineering and ML ops, serves as a focal point in a multidisciplinary team of algorithm researchers, product managers, and engineers and works with the most senior talent within the algo craft in order to achieve ML excellence.

What are some of the things you do on a day-to-day basis?

  • Develop, enhance and maintain highly scalable Machine-Learning infrastructures and tools, including CI/CD, monitoring and alerting and more
  • Have end to end ownership: Design, develop, deploy, measure and maintain our machine learning platform, ensuring high availability, high scalability and efficient resource utilization
  • Identify and evaluate new technologies to improve performance, maintainability, and reliability of our machine learning systems
  • Work in tandem with the engineering-focused and algorithm-focused teams in order to improve our platform and optimize performance
  • Optimize machine learning systems to scale and utilize modern compute environments (e.g. distributed clusters, CPU and GPU) and continuously seek potential optimization opportunities.
  • Build and maintain tools for automation, deployment, monitoring, and operations.
  • Troubleshoot issues in our development, production and test environments
  • Influence directly on the way billions of people discover the internet

Our Tech Stack

Java, Python, TensorFlow, Spark, Kafka, Cassandra, HDFS, vespa.ai, ElasticSearch, AirFlow, BigQuery, Google Cloud Platform, Kubernetes, Docker, git and Jenkins

What are the skills a good Machine Learning Ops Engineer needs to have?

  • Experience developing large scale systems. Experience with filesystems, server architectures, distributed systems, SQL and No-SQL. Experience with Spark and Airflow / other orchestration platforms is a big plus.
  • Highly skilled in software engineering methods. 8+ years experience.
  • Passion for ML engineering and for creating and improving platforms
  • Knowledge in algorithms and machine learning. Good familiarity with Machine learning concepts. 3+ years experience working closely with algorithm engineering teams.
  • Experience with designing and supporting ML pipelines and models in production environment
  • Excellent coding skills – in Java & Python
  • Experience with TensorFlow – a big plus
  • Possess strong problem solving and critical thinking skills
  • MSc in Computer Science or related field.
  • Proven ability to work effectively and independently across multiple teams and beyond organizational boundaries
  • Deep understanding of strong Computer Science fundamentals: object-oriented design, data structures systems, applications programming and multi threading programming
  • Strong communication skills to be able to present insights and ideas, and excellent English, required to communicate with our global teams.

It Would Be Great If You Also Have

  • Experience in leading Algorithms projects or teams.
  • Experience in developing models using deep learning techniques and tools
  • Experience in developing software within a distributed computation framework

Why Taboola?

  • Taboola is the world’s leading recommendation platform reaching over 500 million daily active users. We’re growing rapidly, and have recently gone public on the NASDAQ.
  • Adam Singolda, Taboola Founder and CEO says; “You can copy anything from another business but you can’t copy a company’s culture.”
  • If you ask our employees what they love about Taboola they will tell you that here, they are able to discover their best professional selves, explore where they can grow to, and learn from and together with smart and talented people.
  • At Taboola, we pride ourselves in making an impact on how people consume content across the world, a culture of transparency, passion, and a diverse, inclusive and friendly work environment.
  • Learn more about #TaboolaLife on LinkedIn, Facebook, Instagram, YouTube, & the Taboola Life Blog.

Want to learn more about us, you’re welcome to watch the latest News article about Taboola

Sounds good, how do I apply?

It’s easy, submit your CV by clicking the “Apply” button below.

Taboola is an equal opportunity employer and we value diversity in all forms. We are committed to creating an inclusive environment for all employees and believe such an environment is critical for success. Employment is decided on the basis of qualifications, merit, and business need.

Taboola