This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a AI Researcher based in the United Kingdom.
This role sits at the frontier of agentic AI research, focused on building next-generation systems that learn from real-world interaction data at scale. You will explore how multimodal signals such as text, audio, logs, and structured workflows can be transformed into more capable reasoning and decision-making models. The environment blends deep research with production impact, where breakthroughs are expected to translate directly into deployed systems. You will work closely with engineering and product teams to ensure that research outputs improve real user-facing agents and continuously evolve through live feedback. The role requires strong scientific rigor combined with practical system-building skills. It is ideal for someone who thrives in ambiguous problem spaces and wants to shape the future of multimodal and agentic AI.
\n
Accountabilities
- Conduct advanced research on agentic AI systems trained on real-world interaction data, focusing on improving reasoning, planning, and tool use.
- Design and experiment with learning frameworks such as RAG, fine-tuning, RLHF, DPO, and GRPO to enhance large-scale model performance.
- Develop multimodal representation learning approaches, including joint embedding spaces across text, audio, logs, and structured data.
- Improve speech and audio intelligence systems, including STT, ASR, and audio-driven learning pipelines.
- Define evaluation methodologies to measure agent performance in real-world and domain-specific environments.
- Translate complex behavioral and interaction signals into structured training objectives for large-scale models.
- Collaborate with engineering and product teams to bring research into production and iterate based on live system feedback.
Requirements
- PhD in Computer Science, Machine Learning, AI, Electrical Engineering, or a related field.
- 5+ years of experience in applied AI research or ML systems with production-level impact.
- Strong expertise in large-scale machine learning, LLMs, or multimodal AI systems.
- Hands-on experience with RAG systems, LLM fine-tuning, and reinforcement learning methods such as RLHF, DPO, or GRPO.
- Strong background in representation learning, embeddings, and joint multimodal spaces.
- Experience with speech and audio modeling, including STT, ASR, or audio signal processing.
- Proficiency in Python and modern ML frameworks such as PyTorch and Hugging Face.
- Experience designing evaluation frameworks for LLMs or agentic systems.
- Strong ability to define research hypotheses from ambiguous real-world problems.
- Excellent written and verbal communication skills in English.
Benefits
- Fully remote position within a global AI research organization
- Opportunity to shape cutting-edge agentic and multimodal AI systems
- High-impact research with direct production deployment
- Collaboration with top-tier engineering and product teams
- Strong ownership over research direction and technical strategy
- Access to large-scale proprietary datasets and AI infrastructure
- Competitive compensation aligned with experience and expertise
\n
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1