NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.
Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
The Data Center System Engineering team is looking for a software engineering manager to lead a team working on system design and tuning mechanisms for large-scale AI system performance and datacenter applications. You will work with the latest accelerated computing, deep learning software and hardware platforms to craft improved workflows and develop new, leading differentiated solutions.
Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing!
What you'll be doing:
Lead, mentor, and grow your engineering team.
Drive strong software engineering practices in large scale infrastructure, deliver powerful tools, methodologies, and flows to validate and improve datacenter products in parallel.
Specific responsibilities include aligning the next generation AI workloads on top of next generation datacenter designs. This involves early engagement with HW/FW/SW/platform internal and customer teams, and other groups.
Deliver engineering solutions that offer insights into performance of AI workloads over evolving environments, generating quick insights to improvements and regressions over time.
Decompose high-complexity issues into minimal reproduction cases, working towards root cause of underlying problems.
Participate in engagements with various software and firmware (BMC/SBIOS/OS/drivers) teams to develop best-in-class practices and tools, you will be analyzing, debugging and resolving critical firmware and software issues for the best AI workload performance at scale.
What we need to see:
Proven understanding of accelerated computing software stacks (CUDA).
Experience using and handling modern Cloud and container-based Enterprise computing architectures.
Strong C/C++/Python/Bash programming/scripting experience.
Deep experience with systems architecture and the impact of various components in performance
Experience with container technology and Linux based OSes.
Experience working with high performance computing or deep learning.
Strong verbal and written communication skills.
Strong teamwork and communication skills.
Ability to multitask in a dynamic environment.
Action driven with strong analytical and analytical skills.
BS in Engineering, Mathematics, Physics, or Computer Science, MS or PhD desirable (or equivalent).
8+ overall years of experience, including 3+ years or more in team management
Ways to Stand Out from the Crowd:
End-to-end performance engineering from the profiler to systems analysis
Linux systems programming and optimization experience
Exposure to virtualization techniques, cloud platform solutions.
Exposure to scheduling and resource management systems.
Experience with large scale HPC environments.
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 345,000 PLN - 598,000 PLN for Level 3, and 405,000 PLN - 702,000 PLN for Level 4.