DevJobs

DevOps Tech Lead

Overview
Skills
  • Bash Bash
  • Go Go
  • Python Python
  • GitHub Actions GitHub Actions
  • GCP GCP
  • AWS AWS
  • Azure Azure
  • Docker Docker
  • Kubernetes Kubernetes
  • Terraform Terraform
  • Grafana Grafana
  • Elastic APM
  • Prometheus Prometheus
Pipl is looking for a passionate and experienced DevOps Tech Lead to join our Platform Engineering team and play a key role in shaping the future of our infrastructure and delivery pipelines. This role is critical in bridging the gap between development and operations, championing DevOps best practices and fostering a culture of automation, reliability, and scalability.

In this role, you’ll work closely with software development teams to streamline deployment processes, ensure high system reliability and performance, and drive infrastructure innovation. You’ll also lead efforts to improve system observability and enforce security and compliance. In addition, you’ll influence technical decision-making, guide architectural direction, and mentor engineers in a fast-paced, collaborative environment.

Key Responsibilities:

  • Team Leadership & Mentorship: A) Provide technical leadership and guidance to DevOps engineers through coaching, design reviews, and knowledge sharing. B) Foster a culture of continuous improvement, accountability, and innovation within the DevOps team.
  • Infrastructure & Automation: A) Implement, and maintain scalable, secure, and highly available cloud infrastructure (GCP). B) Champion Infrastructure as Code to enable repeatable, reliable, and version-controlled infrastructure deployments. C) Ensure high availability and fault tolerance across all environments, from development to production.
  • CI/CD & Deployment: A) Design and maintain robust Continuous Integration and Continuous Deployment (CI/CD) pipelines. B) Optimize build and release processes to increase developer velocity and reduce downtime. C) Collaborate with developers and QA to enable automated testing and seamless deployments.
  • Monitoring, Logging & Incident Response: A) Implement observability strategies using tools such as Prometheus, Grafana, and Elastic APM. B) Ensure comprehensive monitoring and alerting is in place for all critical infrastructure and applications. C) Lead incident response, root cause analysis, and post-mortems to drive long-term improvements in system reliability.
  • Security & Compliance: A) Implement security best practices across infrastructure. B) Manage secrets, define network policies, and enforce access controls to ensure secure environments.

Requirements:

  • 7+ years of experience in DevOps, Site Reliability Engineering, or related roles.
  • At least 2+ years in a lead or technical leadership position, guiding teams and driving technical projects.
  • Deep hands-on experience with cloud platforms such as Google Cloud, AWS or Azure.
  • Strong proficiency in automation and scripting languages (e.g., Python, Bash, Go).
  • Expertise in building and maintaining CI/CD pipelines (GitHub Actions).
  • Solid understanding of containerization and orchestration (Docker, Kubernetes).
  • Practical experience with Terraform as Infrastructure as Code
  • Strong knowledge of networking, security principles, and system administration.
  • Excellent problem-solving, organizational, and communication skills.
  • Familiarity with service mesh architectures and microservices observability.
  • Experience implementing DevSecOps pipelines and security automation.
Pipl