DevJobs

Site Reliability Engineering Manager

Overview
Skills
  • Python Python
  • TypeScript TypeScript
  • Node.js Node.js
  • CI/CD CI/CD
  • Azure Azure
  • Kubernetes Kubernetes
  • Grafana Grafana
  • Prometheus Prometheus
  • Infrastructure as Code
  • OpenTelemetry

Founded in 2002, AU10TIX is the global leader in AI-driven identity verification and management, protecting the world’s largest brands against advanced fraud. The company’s future-proof product portfolio helps businesses provide frictionless customer onboarding and verification in 4–8 seconds while staying ahead of emerging threats and evolving regulatory requirements.

We are seeking an SRE (Site Reliability Engineering) Team Leader | SaaS | Cloud-Native | Observability-Driven to build a lean, high-impact Site Reliability Engineering (SRE) function at the core of our SaaS platform’s production reliability and long-term quality strategy. This is a hands-on leadership role driving excellence, scalability, and innovation across our production environments. This role is perfect for someone who wants to define and own a strategic reliability function from day one.


What You’ll Do

• Lead and scale a small SRE team (2–3 engineers) with end-to-end ownership of observability and diagnostics across production.

• Design and implement a central observability platform supporting Engineering, Support, and NOC teams.

• Write production-grade code and automation to enhance system reliability, tooling, and platform resilience.

• Drive operational excellence through effective incident response, alerting, monitoring, and continuous reliability improvements.


Requirements:


• Deep experience in SRE or Production Engineering, ideally in cloud-native SaaS environments.

• Strong coding skills in Python, Node.js, or TypeScript - you’re expected to build, not just configure.

• Mastery of monitoring, logging, and distributed tracing (e.g., Prometheus, Grafana, OpenTelemetry).

• Solid understanding of CI/CD, Kubernetes, Infrastructure as Code, and scalable operations.

• Hands-on experience with Azure cloud infrastructure.

• A true “builder” mindset, hands-on, practical, and quality-obsessed.

AU10TIX