Tech Stack
Responsibilities
- Own the red-teaming and adversarial evaluation pipeline for Reflection’s models, continuously probing for failure modes across security, misuse, and alignment gaps.
- Work hand-in-hand with the Alignment team to translate safety findings into concrete guardrails, ensuring models behave reliably under stress and adhere to deployment policies.
- Validate that every release meets the lab’s risk thresholds before it ships, serving as a critical gatekeeper for our open weight releases.
- Develop scalable, automated safety benchmarks that evolve alongside our model capabilities, moving beyond static datasets to dynamic adversarial testing.
- Research and implement state-of-the-art jailbreaking techniques and defenses to stay ahead of potential vulnerabilities in the wild.
Benefits
- Equity
- Gym Membership
- Health Insurance
- Parental Leave
Culture
Fast-PacedStartup EnergyImpact-OrientedTeam Celebrations
Requirements
Required: Graduate degree (MS or PhD) in Computer Science, Machine Learning, or related discipline, or equivalent practical experience in AI Safety.
Get jobs like this in your inbox
Weekly Ruby, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Reflection AI
Industry: ai
Size: startup
Reflection AI is building open superintelligence and making it accessible to all, developing open weight AI models for individuals, agents, enterprises, and nation states.
View company profile →Compensation
Equity: Equity structured to recognize and retain the best talent globally.