NVIDIA is looking for a Senior DevOps Engineer to join the NSV (Network Solutions Validation) group. NSV builds high-performing software automation for NVIDIA’s Data Center environments and helps drive the data growth of the world’s biggest companies. In this role, you will lead and support our delivery platform with a strong focus on OpenShift, GitLab CI/CD, and reliable production operations. You will partner with engineering teams to build secure and scalable deployment workflows, improve CI/CD speed and reliability, and raise the operational maturity of our platform.
What You’ll Be Doing:
Operate OpenShift clusters and support production deployments (upgrades, scaling, probes, autoscaling, rollouts).
Manage platform integrations: Routes/Ingress, networking/DNS, storage, scheduling.
Build and optimize GitLab CI/CD pipelines (build→test→deploy→promote; caching, parallelism, reliability).
Operate and tune GitLab Runners for performance and peak scale.
Own Vault secrets management for CI/CD and runtime (access + rotation).
Improve observability and incident response (dashboards, alerts, runbooks).
Enable developers with deployment standards, docs, and troubleshooting support.
What We Need to See:
Bachelor's degree in Computer Science or a related field, or equivalent experience.
5+ years of experience in DevOps / SRE / Platform Engineering roles (or equivalent deep hands-on expertise).
Strong production experience with OpenShift administration and operations.
Proven experience building and maintaining GitLab CI/CD pipelines at scale across multiple services.
Real-world experience operating GitLab Runners (capacity, reliability, performance tuning).
Excellent Linux and troubleshooting skills (networking, processes, container runtime behavior).
Strong containerization expertise (Docker, image lifecycle, registries).
Hands-on experience with Vault for secrets management in pipelines and runtime environments.
Ability to work cross-functionally with developers and engineering managers to improve delivery standards and reliability.
Ways to Stand Out from the Crowd:
Experience with Helm and/or Kustomize for consistent deployments.
Exposure to GitOps workflows in general (without requiring Flux).
Experience operating in multi-environment setups (dev/staging/prod) with promotion flows.
Familiarity with performance optimization techniques across CI/CD and container platforms.
OpenShift/Kubernetes certification (e.g., Red Hat OpenShift certs, CKAD/CKA).
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#LI-Hybrid