Research Engineer

Harrison Clarke • Full-time • San Francisco Bay Area, US • 3w ago

AI Research Engineer (On-Device ML Focus) | Bay Area | Stealth Startup

We are working with a well-funded, early-stage AI company building something genuinely different at the intersection of agent systems + on-device intelligence.

They’re tackling a very real problem - replacing manual IT workflows (tickets, logs, reactive support) with an AI-native system that can detect, diagnose, and resolve issues autonomously on every device.

What makes this role interesting

This is not just another LLM/backend role.

A big part of the challenge is building lightweight ML systems that run directly on-device (Windows, Mac, Linux), under real-world constraints:

Limited compute + memory
Real-time inference requirements
High-volume telemetry / behavioural signals
Need for reliability in production (not sandbox experiments)

You’ll be working across

On-device models (edge ML, optimisation, inference)
Backend reasoning systems (LLMs, agents, workflows)
End-to-end pipelines (training → evaluation → deployment)

What they’re looking for

Strong preference for engineers/researchers who have:

Hands-on experience with on-device / edge ML
Built or deployed models in constrained environments (CPU, mobile, embedded, etc.)
Worked on real-world systems, not just research prototypes
Comfort operating in 0→1 environments with high ownership

Nice to have