At
aiOla, we’re redefining how people interact with technology through voice-driven AI. Our mission is to make everyday business operations faster, smarter, and easier — and security is at the heart of everything we build
We’re looking for a
Senior DevOps Engineer to join our growing infrastructure team.
This role is perfect for someone who thrives in a high-scale, cloud-native environment and is passionate about building robust, scalable, and secure infrastructure. You will play a key role in designing and maintaining our AWS based cloud environments.
Requirements:
- 4+ years of running production applications on AWS.
- 4+ years of hands-on experience with Kubernetes in production environments.
- Strong hands-on expertise in AWS services and EKS (Kubernetes).
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform or OpenTofu.
- Experience with CI/CD & GitOps methodologies using GitHub Actions, ArgoCD or similar tools.
- Solid knowledge of monitoring and logging solutions (Prometheus, Grafana, CloudWatch, etc.).
- Strong scripting skills in Python, Bash, or similar languages.
- Excellent problem-solving abilities, with a proactive and collaborative mindset.
- Willingness to be on call from time to time as part of the team’s support rotation.
Nice to Have:
- Experience with compliance frameworks (SOC2, ISO27001, GDPR).
- Hands-on experience with SAST, DAST and vulnerabilities management tools.
- MLOps experience — securing AI/ML pipelines and data workflows.
- Prior experience in a startup or SaaS environment.
Responsibilities:
- Design, implement, and manage scalable and secure AWS and EKS environments.
- Drive CI/CD best practices to ensure smooth and efficient deployments.
- Enhance system reliability, security, and performance through monitoring and automation.
- Collaborate closely with R&D teams to optimize development workflows and system architecture.
- Establish and enforce DevOps best practices, ensuring high availability and disaster recovery strategies.
- Participate in an on-call rotation to respond to critical issues and ensure system stability.