For over two decades, NVIDIA has been a leader in visual computing and innovation.With the invention of the GPU, the company has expanded into areas like gaming, movies, product design, healthcare, and research.Today, NVIDIA is at the forefront of AI, high-performance computing, and advanced technologies.Our data centers and infrastructure support large-scale systems used by hundreds of engineers worldwide.We are looking for a highly motivated DevOps Engineer to join our team.The role includes working on automation, infrastructure, CI/CD, and large-scale distributed systems.The ideal candidate will have strong experience with Linux, cloud platforms, Kubernetes, and monitoring tools.
What you'll be doing:
Build, improve, and maintain automation and self-service solutions across infrastructure, Data Center, and engineering environments.
Develop and maintain CI/CD pipelines to support faster, safer, and more reliable software and infrastructure delivery.
Work hands-on with Linux/Unix systems, infrastructure platforms, compute, storage, networking, and production environments.
Support Data Center-related workflows, including server lifecycle, hardware readiness, rack-and-stack processes, and operational improvements.
Use Infrastructure as Code and configuration management tools such as Terraform, Ansible, Puppet, or similar platforms.
Support cloud, hybrid, containerized, and Kubernetes-based environments. Improve monitoring, logging, observability, reliability, and operational standards.
Partner closely with engineering, IT, operations, and infrastructure teams to improve scalability, efficiency, and automation.
What we need to see:
5+ years experience as a Senior DevOps Engineer with strong hands-on expertise in infrastructure, automation, and operational excellence.
Bachelor’s degree in Computer Science or other technical certification or relevant work experience.
Strong scripting and programming skills using Python, Bash, Go, or other automation-focused languages.
Experience building and maintaining CI/CD pipelines, release automation, testing integration, and deployment strategies.
Hands-on experience with Infrastructure as Code and configuration management tools.
Experience with Docker, Kubernetes, cloud, and hybrid environments.
Strong communication skills and the ability to work in a global, cross-functional, fast-paced environment.
Hands-on experience in Data Center, lab, or large-scale infrastructure environments.
Experience automating hardware, server provisioning, deployment, or infrastructure lifecycle processes.
Proven ability to build self-service platforms that reduce manual work and improve delivery speed.
Ways to stand out from the crowd:
Strong understanding of monitoring, observability, reliability engineering, and production support.
Experience working with engineering, IT, and operations teams to standardize processes and improve operational efficiency.
Ability to combine DevOps expertise with deep infrastructure and system-level understanding.
Experience driving automation initiatives that create measurable impact in time savings, quality, consistency, or scale.