Tech Stack
Responsibilities
- Design technically grounded adversarial prompts that test whether models provide meaningful uplift toward CBRNE threats
- Evaluate model outputs for technical accuracy, assessing whether responses contain genuinely dangerous information
- Probe dual-use knowledge boundaries, testing how models handle queries that blend legitimate scientific, medical, or industrial use cases with potential weapons applications
- Test multi-step and multi-turn attack chains that simulate how a motivated actor might extract dangerous information incrementally
- Score model responses against structured harm taxonomies and severity rubrics calibrated to real-world risk
Benefits
- Equity
- Health Insurance
- Learning Budget
Culture
Collaborative Space
Requirements
Required: Graduate-level education or equivalent professional experience in a relevant CBRNE field (chemistry, biochemistry, microbiology, virology, nuclear physics, radiochemistry, materials science, munitions/ordnance, chemical engineering, or closely related disciplines)
Regions: Us
Get jobs like this in your inbox
Weekly Git, Next.js, Python hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About handshake
Industry: edtech
Size: enterprise
Handshake was founded on the belief that everyone deserves a path to a great career, connecting 25 million job seekers with over 1 million employers and 1,600 educational institutions. Handshake AI, started in 2025, is a fast-growing AI data business working with frontier AI labs to create evaluations, publish benchmarks, and advance AI data.
View company profile →