DevJobs

Senior DevOps Engineer - AI Infra Group

Overview
Skills
  • Python Python
  • Go Go
  • Bash Bash
  • CI/CD CI/CD
  • Jenkins Jenkins
  • GitHub Actions GitHub Actions
  • AWS AWS
  • Docker Docker
  • Kubernetes Kubernetes
  • Helm
  • Istio
  • Terraform Terraform
  • Ansible Ansible
  • Grafana Grafana
  • ArgoCD
  • Kubeflow
  • MLflow
  • Ray
  • Prometheus Prometheus
Dream is a pioneering AI cybersecurity company delivering revolutionary defense through artificial intelligence. Our proprietary AI platform creates a unified security system safeguarding assets against existing and emerging generative cyber threats. Dream's advanced AI automates discovery, calculates risks, performs real-time threat detection, and plans an automated response. With a core focus on the "unknowns," our AI transforms data into clear threat narratives and actionable defense strategies.

Dream's AI cybersecurity platform represents a paradigm shift in cyber defense, employing a novel, multi-layered approach across all organizational networks in real-time. At the core of our solution is Dream's proprietary Cyber Language Model, a groundbreaking innovation that provides real-time, contextualized intelligence for comprehensive, actionable insights into any cyber-related query or threat scenario.

We're seeking an experienced Senior DevOps Engineer to join our AI Infra Group, responsible for building robust infrastructure and CI/CD pipelines that enable internal teams to rapidly develop and deploy AI-driven products and tools. You'll be a key enabler for our development teams, providing them with streamlined workflows, reliable environments, and modern DevOps practices across hybrid cloud and on-premise infrastructure.

Responsibilities:

  • Build and maintain CI/CD pipelines and Kubernetes clusters (cloud + on-prem).
  • Implement Infrastructure as Code (Terraform, Ansible, Helm) and GitOps practices.
  • Automate deployment, monitoring, and scaling workflows.
  • Collaborate with engineers to improve developer experience and system reliability.
  • Ensure observability, security, and compliance across infrastructure.

Skills:

  • 5+ years of DevOps experience.
  • Strong Kubernetes and CI/CD expertise (Jenkins, ArgoCD, GitHub Actions).
  • Skilled in Terraform, Ansible, Helm, and scripting (Python, Bash, or Go).
  • Experience with AWS or hybrid environments.
  • Excellent problem-solving and collaboration skills.

Nice to Have

  • Experience with GPU infrastructure or ML platforms (Ray, Kubeflow, MLflow).
  • Familiarity with observability tools (Prometheus, Grafana) and service meshes (Istio).

Tech Stack

  • AWS, Kubernetes, Terraform, Ansible, Helm, Jenkins, ArgoCD, GitHub Actions, Docker, Python, Bash.
Dream Security