🚀 AI Engineer – (New York City, NY)
💼 Full-Time | 🧠 3–10+ Years Experience | 💰 Competitive salary + meaningful early equity
We’re building the next-generation AI compiler stack, enabling any AI model to run at peak efficiency on any hardware, without manual tuning or vendor-specific engineering. Backed by top investors and advised by industry pioneers from NVIDIA, Google, and Gensyn, we’re pushing the boundaries of how AI systems generate and optimize code for GPUs.
🧩 What You’ll Do
Create an LLM-based AI agent using proprietary frontier models, open-source models, fine-tuning, prompt engineering, RAG, or any other combination of technologies you can think of
Develop tools and APIs for LLMs to use in the process of generating and augmenting GPU kernels
Work closely with compiler and performance engineers to improve model efficiency and runtime speed.
Ship production systems in C/C++ and Python.
⚡ What We’re Looking For
Strong programming experience in C/C++ and Python.
Hands-on experience building AI agents or autonomous coding systems powered by LLMs.
Familiarity with LLM training, fine-tuning, and deployment.
🧠 Tech Stack
Languages: C++, Python
AI/ML: OpenAI, Anthropic, vLLM, SGlang
Infra: CUDA, HIP, Triton
📍 On-site in New York City
👉 Apply now to help build the compiler layer that powers the next generation of efficient, scalable AI systems.