Are you passionate about programming languages, compiler technology, and GPU performance? Do you want to help shape the future of high-performance kernel development for AI? We are looking for outstanding engineers to buildCUTLASSDSL, a Python-native language for GPU kernel development, along with the MLIR dialects and lowering passes behind it. In this role, you willalsohelp accelerate kernel compilation while delivering performance comparable to CUTLASS C++, enabling efficient hardware-software co-design for NVIDIA's next generation of AI platforms.