About the Role
As a Cross-functional Prompt Engineer at Anthropic, you'll own and shape Claude's behaviors across all of our products, ensuring users get a consistent, safe, and beneficial experience whether they're using claude.ai, Claude Code, or our other offerings. This role sits at the intersection of research and product, combining rigorous prompt engineering with strategic thinking about model behaviors.
You'll be the expert on Claude's behavioral quirks and capabilities, authoring critical system prompts for new model releases while also delivering complex meta-prompts that drive our research pipelines. When behavioral issues arise—like sycophancy concerns or safety incidents—you'll lead the response, working with product and research teams to identify, prioritize, and resolve problems. You'll also serve as a trusted resource for product teams tackling difficult prompting challenges, bringing rigor to our production prompts and scaling best practices across the organization.
This role requires someone who can balance immediate product needs with long-term behavioral goals, and who cares deeply about making Claude a healthy alternative in the AI landscape. You'll need strong technical foundations, excellent judgment about model behaviors, and the collaborative skills to work across research, product, and safety teams. This role offers a unique opportunity to directly shape how Claude behaves across all of Anthropic's products, ensuring our AI systems are safe, beneficial, and aligned with human values at scale.
Note: We are open to candidates who are less comfortable with coding, and adjusting the role scoping accordingly.
Responsibilities:
- Author and maintain behavior system prompts for each new Claude model release, ensuring consistent and aligned behaviors across products
- Deliver meta-prompts for critical research synthetic data pipelines, enabling our alignment and training efforts
- Review production prompt changes from product teams and serve as a resource for particularly challenging prompting problems involving alignment and reputational risks
- Identify, triage, and prioritize behavioral issues across Claude products, leading incident response for behavioral and policy concerns
- Develop behavioral evaluations in collaboration with product teams and alignment research to measure and track Claude's behaviors
- Define and streamline processes for rolling out prompt changes, including launch criteria and review practices
- Create model-specific prompt guides that document quirks and optimal prompting strategies for each release
- Contribute to product evaluations and prompt infrastructure improvements
- Track how Claude's behaviors compare to competitors, particularly on safety dimensions
- Scale prompting best practices and define success metrics for production behaviors
You May Be a Good Fit If You:
- Have extensive prompt engineering experience with large language models, including writing and evaluating complex multi-step prompts
- Possess deep knowledge of Claude's behaviors, capabilities, and limitations, with strong intuition for what issues are promptable versus requiring model-layer changes
- Can write Python and create behavioral evaluations from scratch
- Have excellent judgment about what model behaviors should look like in response to various inputs
- Demonstrate strong technical understanding, including comprehension of agent scaffold architectures and model training processes
- Excel at working across organizational boundaries, collaborating effectively with research, product, and safety teams
- Have core product management skills: prioritization, requirements gathering, stakeholder management, and translating user feedback into actionable specifications
- Can independently drive changes through production systems with strong execution and responsiveness
- Care deeply about AI safety and model welfare, understanding the ethical implications of model behaviors
Strong Candidates May Also Have:
- Background in philosophy, ethics, or psychology that informs thinking about model behaviors and values
- Experience with RLHF, constitutional AI, or other alignment techniques
- Track record of writing specifications or guidelines that shape complex system behaviors
- Experience responding to safety incidents or behavioral issues in production AI systems
- Formal training in ethics or moral philosophy
- Published work or demonstrated expertise in AI safety or alignment
- Experience building and maintaining evaluation frameworks for language models
- Background in data science with emphasis on data quality and verification
This role offers a unique opportunity to directly shape how Claude behaves across all of Anthropic's products, ensuring our AI systems are safe, beneficial, and aligned with human values at scale.