All Jobs
Anthropic

AI Safety Researcher

Anthropic|San Francisco, United StatesRemote
$200k - $370kUSDVerified
Apply Now

Job Description

Research AI safety and alignment for frontier models. Focus on interpretability, red teaming, and constitutional AI.

Responsibilities

  • Conduct safety research
  • Red team Claude models
  • Develop interpretability tools
  • Write research papers

Benefits

Top-tier equityFull health coverageFlexible workResearch stipend

Category

AI Research Scientist

Posted

1/5/2025

Ready to Apply?

Applications go directly to Anthropic's career portal

Apply on Anthropic