Location
Our Senior Platform Engineer will be an integral part of our Engineering teams. This role is based remotely as a full-time employee in the UK, Ireland, Estonia, the Netherlands, Sweden and Israel. We are also open to contractors in Eastern Europe and Portugal.
Who We Are
DoiT is a global technology company that works with cloud-driven organizations to leverage the cloud to drive business growth and innovation. We combine data, technology, and human expertise to ensure our customers operate in a well-architected and scalable state - from planning to production.
Delivering DoiT Cloud Intelligence, the only solution that integrates advanced technology with human intelligence, we help our customers solve complex multicloud problems and drive efficiency.
With decades of multicloud experience, we have specializations in Kubernetes, GenAI, CloudOps, and more. An award-winning strategic partner of AWS, Google Cloud, and Microsoft Azure, we work alongside more than 4,000 customers worldwide.
The Opportunity
As a Senior Platform Engineer, you will be responsible for developing and evolving the foundational software and services that empower our product and development teams. This is an Individual Contributor role requiring strong coding skills alongside hands-on work with GCP & AWS, Kubernetes, and Terraform. You will be a key contributor to the design, implementation, and automation of our platform as a product, ensuring its scalability, reliability, and security.
Responsibilities
- Function as an individual contributor within the team: actively collaborating with peers through thorough code reviews, providing constructive support and mentorship, and contributing to a unified technical direction for the platform. This role also requires collaboration with individuals in feature teams, providing them with support and working with them to facilitate the adoption of developed platform features.
- Architect, Design, and Implement Infrastructure as Code (IaC): You will treat our infrastructure as a sophisticated software system, responsible for its comprehensive lifecycle management using Terraform. This involves applying best practices from software engineering, such as designing reusable code modules, implementing robust unit and integration testing strategies, and ensuring that our infrastructure is consistently provisioned and managed in a predictable and repeatable manner.
- Deploy, Manage, and Optimize Kubernetes Clusters on GCP (GKE) and AWS (EKS): You will take ownership of the deployment, configuration, and ongoing maintenance of our Kubernetes clusters on GCP Google Kubernetes Engine (GKE) and AWS Elastic Kubernetes Service (EKS) kit. This includes managing node groups, configuring network policies, implementing service meshes, handling cluster upgrades, and ensuring high availability and fault tolerance. You will also be responsible for monitoring cluster health, performance, and resource utilization, and proactively addressing any issues that arise.
- Develop and Refine Internal Software Delivery Systems (CI/CD: You will design, implement, and maintain robust Continuous Integration/Continuous Deployment (CI/CD) software specifically tailored for our platform components. This involves integrating tools like Argo CD or Atlantis, applying advanced programming concepts to automate build and release processes, and ensuring seamless deployment of platform updates. You will also focus on optimizing pipeline performance and reducing deployment times.
- Diagnose, Troubleshoot, and Resolve Platform-Related Issues: You will be the primary point of contact for diagnosing and resolving platform-related issues, including performance bottlenecks, scalability challenges, and security vulnerabilities. This involves utilizing advanced troubleshooting techniques, analyzing logs and metrics, and collaborating with development teams to identify and resolve root causes. You will also contribute to creating comprehensive incident response plans and post-mortem analyses.
- Drive Automation Initiatives to Streamline Operational Tasks and Enhance System Reliability: You will champion automation initiatives to eliminate manual operational tasks, reduce human error, and improve overall system reliability. This involves developing scripts, tools, and workflows to automate tasks such as infrastructure provisioning, configuration management, and monitoring. You will also proactively identify opportunities for automation and drive continuous improvement in our operational processes.
- Act as a Strategic Partner to Development Teams, Understanding and Addressing Their Infrastructure Needs: You will foster strong relationships with feature teams, treating them as your internal customers. You will actively engage with them to understand their infrastructure and developer experience requirements, provide expert guidance on platform capabilities, and ensure our platform effectively supports their development workflows. You will also translate developer needs into actionable platform product features and roadmaps.
- Drive Development of Internal Platform Product and Services: You will actively participate in the full software development lifecycle (design, code, test, and deploy) of internal tools and APIs. These services will enhance the functionality, self-service capability, and usability of our platform, directly improving developer productivity across the organization.
- Implement and Enforce Rigorous Security Best Practices and Ensure Compliance with Industry Standards: You will be responsible for implementing and enforcing robust security best practices across our platform, including access control, vulnerability management, and data encryption. You will also ensure compliance with relevant industry standards and regulations, such as SOC 2 and GDPR. You will also conduct regular security audits and penetration testing to identify and mitigate potential security risks.
Qualifications
- 6+ years of proven experience in platform engineering, DevOps engineering, or related roles, with a strong track record of building and maintaining complex cloud infrastructure.
- Strong hands-on experience with GCP/AWS, Kubernetes (GKE/EKS), and Terraform.
- Demonstrated expertise in building and maintaining scalable, reliable, and secure cloud infrastructure, with a focus on automation and efficiency.
- Strong Software Engineering fundamentals and demonstrated coding proficiency in Go or Typescript (or other relevant languages), including experience with data structures, algorithms, and defensive programming.
- Proven experience with CI/CD tools, such as Argo CD, Atlantis, or similar technologies, and a deep understanding of CI/CD principles and best practices.
- Understanding of networking concepts and protocols.
- Extensive experience with monitoring and logging tools, such as Prometheus, Grafana, and the ELK stack, and a proven ability to use these tools to diagnose and resolve performance issues.
- Knowledge of security best practices for cloud environments.
- Excellent communication skills in English, both written and verbal.
- Self-organized, goal-oriented, and self-motivated.
- Ability to work effectively in a remote and distributed team environment.
- Prior experience working specifically on platform engineering projects.
Are you a Do’er?
Be your truest self. Work on your terms. Make a difference.
We are home to a global team of incredible talent who work remotely and have the flexibility to have a schedule that balances your work and home life. We embrace and support leveling up your skills professionally and personally.
What does being a Do’er mean? We’re all about being entrepreneurial, pursuing knowledge and having fun! Click here to learn more about our core values.
Sounds too good to be true? Check out our Glassdoor Page.
We thought so too, but we’re here and happy we hit that ‘apply’ button.
Full-time employee benefits include:
- Unlimited PTO
- Flexible Working Options
- Health Insurance
- Parental Leave
- Employee Stock Option Plan
- Home Office Allowance
- Professional Development Stipend
- Peer Recognition Program
Many Do’ers, One Team
DoiT unites as
Many Do’ers, One Team, where diversity is more than a goal—it's our strength. We actively cultivate an inclusive, equitable workplace, recognizing that each unique perspective enhances our innovation. By celebrating differences, we create an environment where every individual feels valued, contributing to our collective success.