AI Engineer
Sunnyvale, CA (2-3 Days On-Site)
Our client is a rapidly growing venture-backed software company pioneering AI innovation for next-generation technology. Their team is dedicated to transforming traditional hardware into intelligent, software-defined platforms deployed at a global scale.
About the Role
As an AI Engineer, you will work on breakthrough AI products that redefine how devices interact with software - building, iterating, and deploying solutions that reach millions of users. Embedded in a nimble, high-impact engineering team, you will help drive AI initiatives from proof-of-concept to production, collaborating directly with technical leadership and shaping the trajectory of AI.
Responsibilities
- Research, develop, and deploy innovative AI/ML models for next-generation in-device experiences and edge computing.
- Architect, fine-tune, and optimize large language models (LLMs), agentic frameworks, and retrieval-augmented generation (RAG) systems for real-world deployment.
- Deliver production-ready code in Python, leveraging frameworks such as TensorFlow and PyTorch.
- Collaborate closely with the CTO and technical team on greenfield projects - from ideation through scalable implementation.
- Benchmark, evaluate, and iterate on models for performance, efficiency, and reliability in constrained hardware environments.
- Work hands-on across the ML lifecycle, including MLOps, IoT/embedded integration, and inference optimization.
- Remain up to date on the latest research and technologies to rapidly prototype and validate new concepts.
Qualifications
- Degree in Mathematics, Physics, Computer Science, or Data Science from a top-tier university.
- 6+ years of progressive experience in AI/ML, with a successful track record shipping production AI systems.
- Expert-level Python programming skills.
- Hands-on experience with TensorFlow or PyTorch.
- Strong foundation in LLM fine-tuning, RAG systems, and agentic frameworks.
- Background in traditional machine learning or data science.
- Experience in startup environments or companies where AI/ML is a core mission.
- Familiarity with compiler technologies (e.g., ONNX, TensorFlow Lite), optimization techniques, and embedded system architectures.
- Willingness to work 2–3 days per week onsite in the Bay Area.
Preferred Skills
- Experience leading the development of visionary AI/ML proof-of-concept projects.
- Demonstrated ability to wear multiple hats (coding, MLOps, LLM benchmarking).
- Prior exposure to IoT or related industry.