About Eightpoint
Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for partners. With offices in the United States and Cayman Islands, Eightpoint collaborates with partners globally on the next generation of user-centric offerings. Our growing ecosystem includes innovative products like Weather Now, a sleek app that delivers real-time forecasts with clarity and ranks among the top three most-used weather apps in the U.S.; Check Heart Rate Now, a quick and easy wellness monitor; and Wave Browser, a powerful and secure way to search the web. Every product we launch is designed to engage users, enhance daily life, and deliver real-world value. Backed by data and driven by a relentless commitment to quality, Eightpoint moves fast, thinks big, and builds digital experiences that people love.
Our innovative culture thrives on collaboration, data-driven decisions, and a startup mindset. Join our team and enjoy top-tier benefits, unlimited PTO, flexible schedules, free lunches, and the opportunity to work alongside a world-class team of professionals.
About the Role
We're looking for a self-directed AI/ML Engineer to join the Mobile Group at Eightpoint, where you'll be responsible for designing, building, and deploying AI-powered features that directly enhance our portfolio of mobile applications on iOS and Android.
In this role, you'll take ownership of complex problems from concept through production, leveraging large language models (LLMs), computer vision, recommendation systems, predictive models, and other machine learning technologies to create intelligent, personalized user experiences.
You'll collaborate closely with Product Managers, Mobile Engineers, Designers, and Data teams. This will lead to identifying opportunities where AI can drive engagement, retention, and monetization while remaining responsible for the technical architecture, model selection, experimentation, evaluation, and production deployment of AI solutions.
Who You Are
- Strong Python proficiency
- Deep experience with LLM application development: Retrieval-Augmented Generation (RAG), vector databases, and embedding models
- LLM orchestration frameworks (LangChain, LangGraph, or equivalent)
- Fine-tuning experience, specifically LoRA (PEFT and quantization a plus)
- PyTorch and/or TensorFlow
- Dataset creation—sourcing, curating, and building datasets for training and evaluation
- AWS SageMaker
- Evaluation of model and retrieval quality (eval datasets, measuring retrieval performance, regression testing)
- Telemetry and logging for production systems
Preferred Qualifications
- FastAPI
- Public presence on HuggingFace (model/dataset contributions)
- MLOps experience (containerization, CI/CD, orchestration)
- Experience with managed ML platforms beyond SageMaker (Vertex AI, etc.)
- ONNX (model export/optimization for inference)