Lead AI Researcher

Cerebro • Full-time • San Francisco Bay Area, US • $380k - $500k / year • 2w ago

We’re an early-stage AI company building a new infrastructure layer for AI systems: enabling models and agents to reliably interact with the real-world data that matters.

Today, most enterprise data (~80%+) is unstructured and multimodal—documents, tables, charts, images—yet existing systems (LLMs, OCR, RAG pipelines) fail to process it reliably.

We’re solving this by building state-of-the-art vision-language models and systems that transform complex, real-world inputs into structured, machine-readable representations that downstream models can actually reason over.

Our models are already deployed in production across high-stakes domains (finance, legal, healthcare) where accuracy matters, and are outperforming existing approaches on real-world benchmarks.

We believe this layer—making messy reality legible to AI systems—is one of the most important unsolved problems in the stack.

The Role

We’re hiring a Founding ML Researcher to work directly with the founders to define and build the core research agenda.

This is not a typical applied ML role. You’ll be operating at the intersection of:

Multimodal foundation models
Vision-language reasoning
Structured generation / parsing
Reliability and determinism in AI systems

You’ll have end-to-end ownership—from first-principles research → model design → training → production deployment.

What You’ll Work On

Designing novel architectures for multimodal understanding (documents, tables, layouts, graphs)
Pushing beyond standard LLM paradigms into structure-aware and layout-aware models
Improving factuality, determinism, and reliability in model outputs
Building systems that combine:
Vision models
Language models
Structured decoding / constrained generation
Developing evaluation frameworks for real-world correctness (not just benchmark scores)
Shipping research directly into production systems used by real customers

Who This Is For

We’re specifically looking for researchers currently at (or competitive with):

Frontier labs (e.g. Anthropic, OpenAI, DeepMind, Meta, etc.)
Top-tier research groups or high-end startups doing foundational ML work

You should have:

Strong background in deep learning / ML research
Experience with at least one of:
Multimodal models (VLMs, vision transformers, etc.)
LLMs / generative models
Representation learning or structured prediction
A track record of building or shipping real systems, not just papers
Taste for first-principles thinking over incremental work

What Makes This Different

Greenfield research direction: You will define major parts of the roadmap
Tight feedback loop: Your work goes into production quickly
Hard, unsolved problems:
Turning perception into structured reasoning
Bridging vision + language + symbolic structure
Making AI systems reliable in the real world
Small, elite team: You’ll work directly with highly technical founders
Massive surface area: This problem sits upstream of RAG, agents, and enterprise AI

Why Join

Most frontier labs are focused on scaling general models.

We’re focused on something orthogonal and equally critical:

Making models actually work on real-world data.

If solved, this unlocks: