DevJobs

Site Reliability Engineer

Overview

We build and integrate our solutions, continuously seeking improvement, focusing on the smallest of details, and having an in-depth analysis and system-oriented vision.

Our team works in a dynamic multi-tasking environment, where constant learning and adaptation are part of the job.

We deliver the best customer experience and directly affect our clients and the company’s strategy.

If you want to be a part of a dominant team and find joy in your work – join us!


What will you do?

You will be directly in charge of our production environment.

You will integrate new tools into our systems, such as monitoring, configuration, etc.

You will manage proactively and independently on-going tasks, priorities, and alerts.

You will diagnose and troubleshoot complicated technical cases.

You will handle external and internal escalations while you will constantly learn & share knowledge.

Work closely with DevOps / R&D teams / Product & Customer managers to enhance and drive service reliability.

You will work in multi-tasking mode and in a dynamic environment.



You will be great for this role if you have:

  • Root cause analysis skills and big-picture thinking.
  • Ability to document technical information.
  • Knowledge with one scripting or programming language. Basic PowerShell Skills needed.
  • Experience with managing and maintaining Windows server environment – Must.
  • Experience with Linux.
  • Experience in operation & managing production deployments – Advantage.
  • Experience in Cloud environments (AWS, Azure), Git, GitHub.
  • Networking knowledge, TCP/IP, HTTP.
  • Experience with Automated IT operations using Ansible and Terraform – Advantage.
  • BA / BSc in Computer Science or Engineering – Advantage.
  • Prior hands-on experience with software or reliability engineering – Advantage.
  • Experience with monitoring and logging platforms such as Prometheus, Grafana, Docker, Kubernetes – Advantage.
LabOS