DevJobs

Site Reliability Engineer (SRE)

Overview
Skills
  • Bash Bash
  • Python Python
  • Ansible Ansible
  • Terraform Terraform

We’re growing and looking to hire Site Reliability Engineer (SRE) who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.

We are looking for a skilled and motivated Site Reliability Engineer (SRE) to join our team and help ensure our production cloud environment's reliability, performance, and scalability. As an SRE, you will work at the intersection of software engineering and operations, taking ownership of system stability, incident response, automation, and continuous improvement of our infrastructure.

This role is ideal for engineers who thrive in dynamic environments, value reliability, and enjoy building resilient and scalable systems.

About Claroty:   

Claroty has redefined cyber-physical systems (CPS) protection with an unrivaled industry-centric platform built to secure mission-critical infrastructure. The Claroty Platform provides the deepest asset visibility and the broadest, built-for-CPS solution set in the market comprising exposure management, network protection, secure access, and threat detection – whether in the cloud with Claroty xDome or on-premise with Claroty Continuous Threat Detection (CTD). Backed by award-winning threat research and a breadth of technology alliances, The Claroty Platform enables organizations to effectively reduce CPS risk, with the fastest time-to-value and lower total cost of ownership. Our solutions are deployed by over 1,000 organizations at thousands of sites across all seven continents.

A Great Place to Work® certified company, Claroty is headquartered in New York City with employees across the Americas, Europe, Asia-Pacific, and Tel Aviv. The company is widely recognized as the industry leader in CPS protection, with backing from the world’s largest investment firms and industrial automation vendors, as well as being named a Leader in the 2025 Gartner® Magic Quadrant™ for CPS Protection Platforms, recognized by KLAS Research as Best in KLAS for Healthcare IoT Security five years in a row, and ranking on the Forbes Cloud 100 and Deloitte Technology Fast 500 multiple consecutive years. 



Responsibilities:


As an SRE, your impact will be:

  • Production Reliability: Ensure system uptime and performance by identifying and addressing potential issues before they affect end users.
  • Incident Response: Serve as part of the on-call rotation, rapidly diagnosing and resolving incidents, and conducting root cause analysis and postmortems.
  • Monitoring and Alerting: Build and maintain monitoring dashboards and alerting systems to detect and respond to anomalies in real time.
  • Automation and Tooling: Develop and maintain automation tools for deployments, scaling, and operational efficiency using Terraform, Ansible, Bash, or Python.
  • Infrastructure Maintenance: Perform regular maintenance and upgrades of production infrastructure to ensure security, stability, and performance.
  • Release Engineering: Support and optimize the rollout of new features and updates, minimizing risk and impact on production environments.
  • Staging Environment Management: Ensure staging environments accurately reflect production for robust testing and validation of changes.
Claroty