Member of Technical Staff - RL Data Scaling

xAI • Palo Alto, CA • 1w ago

About the Team

The team focuses on scaling up high-quality reinforcement learning data for training Grok reasoning models. Our team spans the end-to-end data lifecycle, including designing evaluations, curating and synthesizing training data, and advancing RL algorithms.

About the Role

In this role you might:

Design pipelines to scale up RL data generation and collection.
Innovate next-paradigm of RL algorithms

Exceptional candidates may have:

Strong engineering abilities
Extremely familiar with large language model data collection
Optional: Experiences with cross-domain expertise

Location

We hire engineers in Palo Alto. Our team usually works from the office 5 days a week but allow work-from-home days when required. Candidates are expected to be located near Palo Alto or open to relocation.

Interview Process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

Coding assessment in a language of your choice.
Technical sessions (2): These sessions will be testing your ability to formulate, design and solve concrete problems in real world with LLM.
Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.

Our goal is to finish the main process within one week. All interviews will be conducted via Google Meet.

Annual Salary Range

$180,000 - $440,000 USD

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.