Anthropic Fellows Program, Reinforcement Learning

Posted Jul 14, 2026

AnthropicLondon, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CARemotefull-timestaff

$15,000 USD

Tech Stack

AWSGitGoNext.jsPythonRustTypeScript

Responsibilities

Work on empirical projects aligned with Anthropic's research priorities, aiming to produce a public output like a paper submission.
Build model-based tools to understand AI training data and improve its quality.
Create Reinforcement Learning (RL) environments to enhance Claude models' capabilities or for safety-related tasks.
Conduct research and implement solutions in areas such as RL algorithms.
Analyze and debug model training processes.

Benefits

Equity
Learning Budget
Parental Leave
Remote Work

Culture

Mission-DrivenImpact-OrientedCollaborative SpaceFast-PacedInclusive Hiring

Requirements

Required: Bachelor’s degree or an equivalent combination of education, training, and/or experience

Regions: Canada, Uk, Us

About Anthropic

Industry: saas

Size: medium

Anthropic is an AI safety and research company that builds reliable, interpretable, and steerable AI systems.

View company profile →

Compensation

Base salary: $15,000 USD

Similar Jobs

Anthropic Fellows Program, AI Safety

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA

Anthropic Fellows Program, AI Security

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA

Anthropic Fellows Program, The Anthropic Institute (Economics & Policy)

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA

Anthropic Fellows Program, ML Systems & Performance

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA

Anthropic Fellows Program

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA