Anthropic Fellows Program, Reinforcement Learning
Posted
AnthropicLondon, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CARemotefull-timestaff
$15,000 USD
Tech Stack
Responsibilities
- Work on empirical projects aligned with Anthropic's research priorities, aiming to produce a public output like a paper submission.
- Build model-based tools to understand AI training data and improve its quality.
- Create Reinforcement Learning (RL) environments to enhance Claude models' capabilities or for safety-related tasks.
- Conduct research and implement solutions in areas such as RL algorithms.
- Analyze and debug model training processes.
Benefits
- Equity
- Learning Budget
- Parental Leave
- Remote Work
Culture
Mission-DrivenImpact-OrientedCollaborative SpaceFast-PacedInclusive Hiring
Requirements
Required: Bachelor’s degree or an equivalent combination of education, training, and/or experience
Regions: Canada, Uk, Us
Get jobs like this in your inbox
Weekly AWS, Git, Go hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Anthropic
Industry: ai
Size: small
Anthropic's mission is to create reliable, interpretable, and steerable AI systems to ensure AI is safe and beneficial for users and society. The team is a quickly growing group of researchers, engineers, policy experts, and business leaders committed to building beneficial AI systems.
View company profile →Compensation
Base salary: $15,000 USD