Research Scientist, Interpretability
Posted
$350,000 USD
Tech Stack
Responsibilities
- Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights.
- Design and run robust experiments, both quickly in toy scenarios and at scale in large models.
- Create and analyze new interpretability features and circuits to better understand how models work.
- Build infrastructure for running experiments and visualizing results.
- Work with colleagues to communicate results internally and publicly.
Benefits
- Equity
- Health Insurance
- Learning Budget
- Parental Leave
- Remote Work
Culture
Collaborative SpaceCross-Functional TeamsMission-DrivenTeam LeadershipTransparent Leadership
Requirements
Required: Bachelor’s degree or an equivalent combination of education, training, and/or experience
Regions: Us
Get jobs like this in your inbox
Weekly AWS, Git, Python hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Anthropic
Industry: ai
Size: small
Anthropic's mission is to create reliable, interpretable, and steerable AI systems to ensure AI is safe and beneficial for users and society. The team is a quickly growing group of researchers, engineers, policy experts, and business leaders committed to building beneficial AI systems.
View company profile →Compensation
Base salary: $350,000 USD
Equity: optional equity donation matching