About Astelia
Astelia helps security teams dramatically reduce the volume of incidents generated by vulnerability management tools. We do this by building a deep, contextual understanding of our customers' networks - turning noise into signal and enabling security teams to focus on what truly matters. We're a small, fast-moving startup with great people who care about impact, craftsmanship, and each other.
About The Role
We're looking for a DevOps Lead to own and shape our infrastructure from the ground up. This is a hands-on leadership role — one moment you're writing Terraform and designing CI/CD pipelines, the next you're mentoring engineers and driving architectural decisions that will define how we operate at scale.
You'll build the platform our engineering and security teams depend on every day, and grow the team around it as we scale. If you love working end-to-end — from architecture to operations — and want outsized impact in a fast-moving, early-stage team, we'd love to meet you.
Responsibilities
- Lead and grow the DevOps team — hire, mentor, and develop engineers while building a culture of ownership, collaboration, and continuous improvement.
- Design, build, and maintain cloud infrastructure on AWS, making architectural decisions that balance reliability, security, cost-efficiency, and velocity.
- Own CI/CD pipelines end-to-end using GitHub Actions, with automated testing, security scanning, and robust deployment strategies.
- Drive infrastructure-as-code practices with Terraform, ensuring reproducible, auditable, and version-controlled environments across dev, staging, and production.
- Manage and scale Kubernetes (EKS) clusters, networking, and container orchestration for production workloads.
- Establish observability, monitoring, and incident response practices that give the team real-time visibility into system health and enable fast recovery.
- Partner closely with Engineering, Product, and Security teams to align infrastructure capabilities with product needs and security requirements.
- Build paved-road developer tooling — shared templates, reusable IaC modules, CI/CD workflows, and runbooks — that help the entire engineering org move faster and more safely.
- Define and enforce security best practices across infrastructure, including secrets management, network security, access controls, and compliance automation.
Requirements:
- Experience: 5+ years of hands-on experience in DevOps, SRE, or infrastructure engineering.
- Cloud Infrastructure: Deep hands-on experience with AWS services (EC2, EKS, S3, IAM, VPC, Lambda, CloudWatch, Neptune) and designing production-grade cloud architectures.
- Container Orchestration: Strong expertise in Kubernetes (EKS) including Helm, networking, autoscaling and docker.
- Infrastructure-as-Code: Proven experience with Terraform, Helm charts, including module design, state management, and GitOps workflows (ArgoCD).
- CI/CD: Solid experience with GitHub Actions, including artifact management, automated security scanning, and deployment strategies.
- Scripting & Automation: Proficiency in Python and Bash.
- FinOps: Familiarity with cost optimization strategies in AWS at scale.
- Security Practices: Strong understanding of secrets management (Vault, AWS Secrets Manager), network security, IAM, and compliance.
- Production Ownership: Comfortable owning systems in production — observability, debugging, performance, and reliability.
- Team Mindset: Strong communication skills, high ownership, and the ability to thrive in a fast-paced, “build-first” startup environment.
Advantage
- Experience with at least 2 years in a team lead or technical leadership role.
- Experience in the cybersecurity domain — vulnerability management, exposure management, or security product engineering.
- Experience with air-gapped, on-prem, or hybrid cloud environments.
- Experience in software engineering as a developer.
- Familiarity with high-scale data ingestion and processing pipelines.
- Experience with graph-based or network-aware security systems.
- Familiarity with cost optimization strategies in AWS at scale.