NVIDIA is at the forefront of the AI revolution, delivering cutting-edge accelerated compute platforms for global impact. Our Network Insights group is seeking a talented and motivated Sr. DevOps Engineer to architect, scale, and optimize the DevOps infrastructure supporting our advanced networking simulation services. In this high-impact role, you will lay the foundations to scale a key insight product to reach 10–100 times more users, design robust CI/CD pipelines, drive automation, and ensure the reliability, scalability, and security of our cloud-based, and on-prem platforms.. If you are passionate about solving complex infrastructure challenges and enabling world-class software delivery, we want to hear from you.
What You'll Be Doing
- Architect and optimize CI/CD pipelines for large-scale, high-availability simulation services, ensuring fast, reliable, and secure deployments.
- Drive automation across infrastructure provisioning, configuration management, and monitoring to support rapid development cycles and minimize manual intervention.
- Collaborate with software engineering and product teams to design and implement scalable, cloud-native solutions that meet evolving business needs.
- Promote standard processes in infrastructure as code, containerization, and cloud security, ensuring compliance and resilience across environments.
- Monitor, troubleshoot, and resolve infrastructure and deployment issues, maximizing uptime and ensuring efficient performance for internal and external customers.
- Evaluate and integrate new tools and technologies to continually enhance the reliability, observability, and efficiency of our DevOps ecosystem.
- Participate in incident response and post-mortem processes, driving root cause analysis and systemic improvements.
What We Need To See
- BSc or above in Computer Science, Computer Engineering, or a related field, or equivalent experience.
- 5+ overall years of hands-on experience in DevOps or Site Reliability Engineering roles.
- Proven expertise in designing, building, and maintaining CI/CD pipelines (e.g., Jenkins, GitLab CI, GitHub Actions, or similar).
- Deep knowledge of cloud platforms (AWS, preferably), On-Prem deployment, container orchestration (Kubernetes, Docker), and infrastructure as code.
- Strong scripting and automation skills (Python, Bash, or similar).
- Experience with monitoring, logging, and observability tools (Prometheus, Grafana, ELK, etc.).
- Proven understanding of security standard methodologies in cloud & on-prem DevOps environments.
- Excellent communication and interpersonal skills, with a track record of multi-functional collaboration.
- Experience supporting large-scale, high-availability production systems.
Ways To Stand Out From The Crowd
- Prior background in networking or simulation environments.
- Prior experience with building a new team from the grounds up.
- Familiarity with performance tuning and cost optimization in cloud and on-prem environments.
- Experience with building CI/CD pipelines from the ground up.
NVIDIA is home to some of the most innovative and dedicated professionals in the industry, and as we continue to grow rapidly, we are looking for creative and driven engineers to join our world-class engineering teams. If you are an autonomous, visionary DevOps professional with a passion for technology and excellence, we want to hear from you.
We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request accommodation.
JR2006613