Research Scientist, Safety Post Training

Posted Jul 9, 2026

ScaleSan Francisco, CA; New York, NYfull-time

$216,000 USD

Apply Now

Tech Stack

AWSRustTypeScript

Responsibilities

Design and run post-training pipelines to study how training choices affect model safety, robustness, and alignment properties.
Develop interpretability-informed evaluations that reveal how and why models produce unsafe, deceptive, or otherwise undesirable behaviors.
Collaborate with policymakers, engineers, and other researchers to translate post-training and interpretability findings into actionable safety standards, evaluation benchmarks, and best practices.

Benefits

401k
Equity
Health Insurance
Learning Budget

Culture

Mission-DrivenCross-Functional TeamsInclusive Hiring

Requirements

Regions: Us

About Scale

Industry: saas

Size: large

Scale AI develops reliable AI systems for the world's most important decisions, providing high-quality data and full-stack technologies to power leading AI models.

View company profile →

Compensation

Base salary: $216,000 USD

Equity: equity based compensation, subject to Board of Director approval

Similar Jobs

Research Scientist, Agent Robustness

Scale · San Francisco, CA; New York, NY

$216k

Research Scientist, Frontier Risk Evaluations

Scale · San Francisco, CA; New York, NY

$216k

Research Scientist, AI Controls and Monitoring