your responsibilties will include:
- Designing, building, running, and monitoring Planview production infrastructure.
- Push the team to engineering excellence by introducing methodologies, producing best-in-class production environments, documentation, testing, and monitoring.
- Manage and troubleshoot Planview application in the staging and production environments.
- Collaborate with R&D engineers to translate product requirements into technical solutions.
- Identifying and resolving technical issues and performance bottlenecks
- Responsible for the production environments, SLA and performance.
- Knowledge in security best practices and ability to run security project.
- Knowledge with CI/CD pipelines using Jenkins, Rundeck, helm charts etc in the on-prem and AWS.
- Responding to production incidents and determining how we can prevent them in the future.
- Developing and maintaining technical documentation, runbooks, and procedures
- Be a mentor for your team and help promote knowledge-sharing.
What you’ll bring to the role
- 8+ years of experience as a site reliability or platform engineer, preferably in a fast-scaling environment
- 2+ years of experience as a technical leader
- Experience with the deployment of production workloads on on-prem and public cloud infrastructure (AWS)
- Ability to look on the big picture and manage risks.
- Strong experience in security practices and network engineering.
- Experience managing CI/CD infrastructures, with a strong proficiency in platforms like bitbucket and Jenkins to streamline deployment pipelines and ensure efficient software delivery.
- Knowledge of observability tools such as LogicMonitor, New Relic, Prometheus, and Coralogix, as well as their implementation
- Strong technical knowledge in OS’s ( Linux and Windows ), virtualizations, storage systems, networking, and firewall implementations
- Excellent problem-solving skills and a detail-oriented mindset.
- Fluent English
- Strong communication and collaboration abilities to work effectively within a team.