We are looking for a DevOps Engineer to join our team!
We have a 100% Kubernetes microservices architecture running on 20+ clusters on both AWS and GCP.
We are looking for someone to help us maintain, develop and extend our:
- Kube Prometheus monitoring stack with Thanos for long term metrics
- GitOps workflows and methodologies using Argo CD
- Multi-cloud PaaS with Crossplane
Role and Responsibilities:
- Define Site Reliability Engineering strategies, review specifications and design
- Work with engineering teams on defining and implementing service monitoring to enhance reliability
- Increase visibility on the platform health, create reports and dashboards to make sure the trends are good
- Work with teams to design and implement automated code deployment solutions and remediate issues impacting the cost, health, and performance of our production systems & infrastructure
- Work with teams to diagnose and isolate issues at all layers of the stack, whether it be code or infrastructure, during development and in production
- Maintain production services by measuring and monitoring availability, latency, and overall system health
- Solve problems in mission-critical services creating solutions to prevent problem recurrence, automating remediation procedures
- Develop our data-driven culture by providing statistical analysis to increase the quality of service
- Providing operational support for day-to-day activities involving deployments of services, configurations of service interaction, etc.
Requirements:
- 3+ years of experience as a DevOps/SRE
- 2+ years of experience with public cloud
- 2+ years of experience with Kubernetes
- Experience with networking, distributed systems, SQL and NoSQL databases
- Extensive experience with the following: Shell scripting, Helm and Terraform
- Experience with Unix/Linux operating systems internals and administration
- Maintaining production services, and experience analyzing and troubleshooting systems
- Commitment to a collaborative environment infused with professionalism, integrity, passion, and accountability
- Experience with writing in high level languages such as Python or Go - advantage
- Experience with writing/maintaining Kubernetes operators - advantage
About Us:
AI21 Labs was formed by AI luminaries and veterans of the elite technology unit of Israel’s IDF (Yoav Shoham, Amnon Shashua, and Ori Goshen), with the mission of building AI systems with an unprecedented capacity to understand and generate natural language. The company’s products - Wordtune and Wordtune Read - aim to transform the way we read and write. Wordtune is the first AI-based writing companion that understands context and meaning, while Wordtune Read is an AI-based reading companion which helps people read faster, more efficiently and process information in less time. AI21 Studio is our B2B channel focused on empowering developers anywhere to build text based apps and services using our state-of-the-art language models.