DevJobs

DevOps Engineer

Overview
Skills
  • Java Java
  • Python Python
  • Elasticsearch Elasticsearch
  • Linux Linux
  • AWS AWS
  • Kubernetes Kubernetes
  • Terraform Terraform
  • Crossplane
  • Karpenter
Totango is a leading customer success platform that drives customer adoption, retention, and growth for businesses worldwide. Totango helps companies get real-time access to customer health, stay alert to risks and opportunities, and engage proactively and intelligently with their customers to increase net revenue retention and drive productivity within their organization. Our platform is used by some of the biggest enterprise and hottest SaaS companies including Google, SAP, Github, Zeplin, Walkme and more.

We are looking for a top-notch DevOps engineer to join our team!

What will you do:

As a key member of our DevOps team, you will be responsible for a wide range of critical tasks aimed at enhancing the overall efficiency, reliability, and scalability of our systems and services. The team's primary focus includes:

  • CI/CD Development: Be part of the team that develops our next-generation Continuous Integration and Continuous Deployment (CI/CD) pipelines. Leverage modern technologies such as GitOps, CueLang and Crossplane, to ensure that our software delivery processes are efficient, automated, and reliable.
  • ElasticSearch Cluster Management: Building and maintaining a set of Elasticsearch clusters on AWS EKS. This involves optimizing performance, ensuring high availability, and managing data effectively.
  • Development Acceleration: Crafting innovative tools and automation solutions to boost development velocity for our Totango development teams. Help reduce manual tasks and streamlining processes, to provide faster and more efficient software development.
  • System Design and Capacity Planning: Actively participating in system design consultations and providing expertise in platform management and capacity planning. Advise on architectural decisions to ensure optimal performance and scalability.
  • Monitoring and Alerting: Implementing and enhancing monitoring and alerting systems that provide real-time insights into the health and performance of our applications, enabling swift response to issues
  • Highly Available Systems: Building and maintaining highly available systems to ensure our services are always accessible to our customers, even in the face of failures.
  • Metrics and Performance Tuning: Collecting, analyzing, and deriving insights from metrics generated by operating systems and applications. This data will be used to fine-tune system performance and identify and resolve potential issues proactively.
  • Root Cause Analysis: Conducting root cause analysis for production errors to prevent reoccurrence and continuously improve system reliability.
  • Cloud Infrastructure Maintenance: Continually maintain our cloud infrastructure to enhance its efficiency, security, and cost-effectiveness.
  • On-Call Support: Participating in on-call rotations to prevent and address incidents, ensuring the availability and stability of our systems 24/7.

This dynamic role offers a unique opportunity to be at the forefront of technology, working on cutting-edge solutions that have a direct impact on our software development processes and infrastructure. Join our team and help shape the future of our DevOps practices.

Requirements:

  • At least 2 years of experience as a DevOps Engineer.
  • Linux: A background in Linux is crucial for managing and maintaining our systems, as many of our infrastructure components are based on Linux.
  • Amazon Cloud Expertise: You should have a strong command of Amazon Web Services (AWS) cloud infrastructure. Proficiency in configuring and managing AWS resources is fundamental to our cloud-based solutions.
  • Kubernetes: You should be familiar with setting up and managing Kubernetes Clusters, including security, monitoring, and auto-scaling.
  • High-Level Language Proficiency for Problem-Solving and Automation: A solid understanding of high-level programming languages, such as Java or Python, is highly beneficial. We expect the candidate to leverage their programming skills to tackle complex problems, automate repetitive tasks, and enhance our DevOps workflows. Your ability to work with code is instrumental in driving efficiency and innovation within our DevOps practices.
  • Proactive Problem Solver
  • Results-Oriented
  • Continuous Learner
  • Enthusiastic and Committed

Experience with the following technologies is an advantage:

  • Terraform
  • Crossplane
  • Karpenter
  • Elasticsearch
Totango