Research Scientist, Safety Post Training
Posted
$216,000 USD
Tech Stack
Responsibilities
- Design and run post-training pipelines to study how training choices affect model safety, robustness, and alignment properties.
- Develop interpretability-informed evaluations that reveal how and why models produce unsafe, deceptive, or otherwise undesirable behaviors.
- Collaborate with policymakers, engineers, and other researchers to translate post-training and interpretability findings into actionable safety standards, evaluation benchmarks, and best practices.
Benefits
- 401k
- Equity
- Health Insurance
- Learning Budget
Culture
Mission-DrivenCross-Functional TeamsInclusive Hiring
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly AWS, Rust, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Scale
Industry: ai
Size: enterprise
Scale AI is building the data infrastructure behind the world's most capable AI systems, providing high-quality data and full-stack technologies to power leading models and help enterprises and governments deploy AI applications.
View company profile →Compensation
Base salary: $216,000 USD
Equity: equity based compensation, subject to Board of Director approval