Tech Stack
- Python
- JAX and XLA
- Rust / C++
- Spark / Ray
Focus
- Training trillion parameter multimodal models at the web scale, as well as a variety of smaller specialized models.
- Pushing boundaries in spatial-temporal compression, world modeling, multimodal reasoning, cross-modal alignment, and emergent capabilities.
- Rapidly implementing the latest state-of-the-art methods from the deep learning literature.
- Innovating new ideas for pretraining and new scaling paradigm.
Ideal Experience
- Strong engineering skills with passion on model-hardware co-design.
- Expert in ML and large model scaling, familiar with all kinds of scaling laws.
- Familiar with distributed training, multi-GPU neural network training and experience on optimizing ML training efficiency.
- Familiar with state-of-the-art techniques for multimodal language models, especially for image/video understanding.
Location
The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.
Annual Salary Range
$180,000 - $440,000 USD
Benefits
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.