Cross-functional Prompt Engineer

Anthropic • San Francisco, CA • 1d ago

About the Role

As a Cross-functional Prompt Engineer at Anthropic, you'll own and shape Claude's behaviors across all of our products, ensuring users get a consistent, safe, and beneficial experience whether they're using claude.ai, Claude Code, or our other offerings. This role sits at the intersection of research and product, combining rigorous prompt engineering with strategic thinking about model behaviors.

You'll be the expert on Claude's behavioral quirks and capabilities, authoring critical system prompts for new model releases while also delivering complex meta-prompts that drive our research pipelines. When behavioral issues arise—like sycophancy concerns or safety incidents—you'll lead the response, working with product and research teams to identify, prioritize, and resolve problems. You'll also serve as a trusted resource for product teams tackling difficult prompting challenges, bringing rigor to our production prompts and scaling best practices across the organization.

This role requires someone who can balance immediate product needs with long-term behavioral goals, and who cares deeply about making Claude a healthy alternative in the AI landscape. You'll need strong technical foundations, excellent judgment about model behaviors, and the collaborative skills to work across research, product, and safety teams. This role offers a unique opportunity to directly shape how Claude behaves across all of Anthropic's products, ensuring our AI systems are safe, beneficial, and aligned with human values at scale.

Note: We are open to candidates who are less comfortable with coding, and adjusting the role scoping accordingly.

Responsibilities:

Author and maintain behavior system prompts for each new Claude model release, ensuring consistent and aligned behaviors across products
Deliver meta-prompts for critical research synthetic data pipelines, enabling our alignment and training efforts
Review production prompt changes from product teams and serve as a resource for particularly challenging prompting problems involving alignment and reputational risks
Identify, triage, and prioritize behavioral issues across Claude products, leading incident response for behavioral and policy concerns
Develop behavioral evaluations in collaboration with product teams and alignment research to measure and track Claude's behaviors
Define and streamline processes for rolling out prompt changes, including launch criteria and review practices
Create model-specific prompt guides that document quirks and optimal prompting strategies for each release
Contribute to product evaluations and prompt infrastructure improvements
Track how Claude's behaviors compare to competitors, particularly on safety dimensions
Scale prompting best practices and define success metrics for production behaviors

You May Be a Good Fit If You:

Have extensive prompt engineering experience with large language models, including writing and evaluating complex multi-step prompts
Possess deep knowledge of Claude's behaviors, capabilities, and limitations, with strong intuition for what issues are promptable versus requiring model-layer changes
Can write Python and create behavioral evaluations from scratch
Have excellent judgment about what model behaviors should look like in response to various inputs
Demonstrate strong technical understanding, including comprehension of agent scaffold architectures and model training processes
Excel at working across organizational boundaries, collaborating effectively with research, product, and safety teams
Have core product management skills: prioritization, requirements gathering, stakeholder management, and translating user feedback into actionable specifications
Can independently drive changes through production systems with strong execution and responsiveness
Care deeply about AI safety and model welfare, understanding the ethical implications of model behaviors

Strong Candidates May Also Have:

Background in philosophy, ethics, or psychology that informs thinking about model behaviors and values
Experience with RLHF, constitutional AI, or other alignment techniques
Track record of writing specifications or guidelines that shape complex system behaviors
Experience responding to safety incidents or behavioral issues in production AI systems
Formal training in ethics or moral philosophy
Published work or demonstrated expertise in AI safety or alignment
Experience building and maintaining evaluation frameworks for language models
Background in data science with emphasis on data quality and verification

This role offers a unique opportunity to directly shape how Claude behaves across all of Anthropic's products, ensuring our AI systems are safe, beneficial, and aligned with human values at scale.