Software Engineer - Machine Learning

FocusKPI Inc. • Contract • Mountain View, California, United States • 20h ago

FocusKPI is seeking a Software Engineer - Machine Learning to join one of our clients, a high-tech SaaS company.

We are looking for an experienced Machine Learning Engineer to lead the development of prompt-injection and prompt-safety models to protect the client's downstream agentic AI systems across phones, the cloud, and XR/AR. You will design, train, and deploy classifier and guardrail models (both cloud-based and hybrid on-device) that screen agent inputs and outputs for injection attacks, unsafe content, and policy violations. A core part of the role is post-training these models using RLHF, DPO, and related optimization techniques to push detection accuracy and false-positive rates beyond what off-the-shelf solutions can achieve.

Work Location: Mountain View, CA (Onsite role, 5 days/week onsite)
Duration: 12-month contract with potential to extend the contract depending on your performance & budget
Pay Range: $95 - 110/hr

**No C2C resumes are considered**

Position Responsibilities:

Design and train prompt-injection detection models and prompt-safety classifiers that operate on both inputs to and outputs from the client's agentic AI systems.
Build hybrid deployment pipelines that split safety inference between on-device (phone, XR/AR) and cloud, optimizing for latency, privacy, and detection coverage.
Apply post-training techniques (e.g., RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries.
Curate and generate adversarial training data: direct and indirect prompt injections, jailbreaks, tool-use exploits, and unsafe-output cases drawn from red-teaming and production signals.
Build evaluation harnesses that measure attack success rate, false-positive rate, latency, and on-device footprint across model iterations and threat categories.
Partner with agent, device, and platform teams to integrate safety models into mobile-use agents, XR/AR assistants, and cloud agentic workflows, and to close the loop from production incidents back into training data.
Work cross-functionally with security researchers, modeling teams, and product engineers; document methods and, where appropriate, contribute to patents and publications.

Qualifications:

M.S. or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related field; or B.S. with equivalent industry experience.
3+ years of industry experience in ML engineering or applied AI research, with demonstrated ownership of production ML systems post-master's degree graduation.
2+ years of industry experience in software engineering post-master's degree graduation.
Strong proficiency in Python and PyTorch (or JAX/TensorFlow), with solid software engineering fundamentals (version control, testing, and reproducible experimentation).
Hands-on experience post-training LLMs with RLHF, DPO, RLAIF, or reward modeling, including reward design, preference data curation, and training stability.
Hands-on experience training and deploying classifier or guardrail models for safety, content moderation, abuse detection, or adversarial robustness.
Familiarity with prompt injection, jailbreak, and agentic AI threat models, and with distributed training frameworks (DeepSpeed, FSDP, Accelerate).

Preferred Qualifications:

Experience building safety or moderation systems for agentic AI: tool-use guardrails, indirect prompt injection defenses, or output filtering for autonomous agents.
Experience with red-teaming, adversarial data generation, or automated attack pipelines (e.g., GCG, PAIR, generator–critic frameworks).
Experience with on-device or edge ML deployment (ExecuTorch, Core ML, TFLite, MLC-LLM, vendor NPU toolchains) and model compression (quantization, distillation, pruning) for safety models.
Experience with telemetry, logging, or user-facing data systems on mobile, XR/AR, or consumer platforms, including privacy-preserving handling of user data (e.g., anonymization, on-device processing, federated approaches).
Publications at top-tier ML/NLP/security venues (NeurIPS, ICML, ICLR, ACL, EMNLP, USENIX Security, IEEE S&P), patents, or open-source contributions in the safety, alignment, or AI security space.

Education:

M.S. in Computer Science, Machine Learning, Electrical Engineering, or a related field with 3 years of experience post graduation
Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related field with publications in the AI/ML domain and 1 year of experience post-graduation

**No C2C resumes are considered**

Thank you!

FocusKPI Hiring Team

Founded in 2010, FocusKPI, Inc. (FocusKPI) is a data science and technology firm specializing in predictive analytics practice and methodologies. FocusKPI is a US company headquartered in Silicon Valley, California, with an East Coast office in Boston, Massachusetts.

NOTICE: Please be aware of fraudulent emails regarding job postings, job offers and fake checks. FocusKPI's recruiting team will strictly reach out via @focuskpi.com email domain. If you have received fraudulent emails now or in the past, please report it to https://reportfraud.ftc.gov/ .
The domain @focuskpijobs.com is fraudulent and not related to FocusKPI. Please do not not reply or communicate to anyone with @focuskpijobs.com.