DevJobs

Senior Infrastructure Engineer

Overview
Skills
  • Bash Bash ꞏ 2y
  • Spark Spark
  • Linux Linux ꞏ 2y
  • CI/CD CI/CD ꞏ 6y
  • AWS AWS ꞏ 6y
  • Azure Azure ꞏ 6y
  • GCP GCP ꞏ 6y
  • Podman
  • AWS ECR
  • Terraform Terraform ꞏ 6y
  • Networking Networking ꞏ 6y
  • Relational DB Management ꞏ 6y
  • Cloud Formation ꞏ 6y
  • distributed architectures ꞏ 2y
  • infrastructure operations process ꞏ 2y
  • NewRelic ꞏ 2y
  • observability platforms ꞏ 2y
  • SRE platforms ꞏ 2y
  • Data Dog ꞏ 2y
  • Databricks
  • ECS
  • EKS
  • PySpark
  • Vercel
Job Overview:


Our clinical trial SaaS platform leverages AWS infrastructure spanning multiple transactional databases, data warehouses, and key services like Vercel and Databricks to deliver predictive analytics for enterprise healthcare clients. We have a distributed, single-tenant architecture that demands robust cloud infrastructure management and optimization. As Quant Health's first dedicated Infrastructure Engineer, you'll own and optimize the cloud systems that power our clinical trial innovation platform.

Key Responsibilities:
  • Develop, maintain, optimize and harden our single-tenant cloud infrastructure
  • Implement a secure, high-performance network topology that connects frontend services, databases, and ML processing clusters
  • Design and implement disaster recovery strategies, including backup automation, fail-over procedures, and restore drills
  • Coordinate organization-wide SRE practices, including cross-component tracing, incident management, alerting, and reliability metrics
  • Work closely with engineering teams to understand their infrastructure requirements and enable them to achieve continuous and stable deployments
  • Administer key systems (AWS, Databricks, DBs, etc.) including access controls, security hardening, monitoring, and compliance management
  • Establish and manage an infrastructure request ticketing system with self-service capabilities enabling engineers to request changes, provision resources, and receive guidance
Requirements:
  • At least 6 years of experience in each of the following: managing core services on a major cloud provider (AWS, GCP, Azure), cloud networking, IaC tools (Terraform, Cloud Formation, etc.), Relational DB Management, CI/CD pipelines
  • At least 2 year of experience in each of the following: distributed architectures, infrastructure operations process, Linux and bash scripting, SRE and observability platforms (NewRelic, Data Dog, etc.)
  • Excellent written and verbal communication skills
  • Ability to work independently and as part of a team
Advantage:
  • Experience with Databricks: metastore management, access control, asset bundles and the Databricks terraform provider, etc.
  • Experience developing single-tenant solutions for large enterprise clients
  • Working knowledge of PySpark, Spark, AWS ECR and the like
  • Experience with container orchestration platforms (EKS, ECS, Podman, etc.)
  • Experience with Vercel


QuantHealth