Tech Stack
Responsibilities
- Design technically grounded adversarial prompts to test models across the cyber kill chain (reconnaissance through exfiltration and impact).
- Evaluate model-generated code and technical output for functional correctness and real-world exploitability.
- Test model behavior across offensive categories like malware generation, vulnerability exploitation, and social engineering.
- Simulate attacker personas at varying skill levels to assess how model risk scales with user sophistication.
- Contribute to the development and refinement of cybersecurity-specific evaluation frameworks and threat models.
Benefits
- Learning Budget
Culture
Collaborative Space
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly Git, Java, JavaScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About handshake
Industry: edtech
Size: enterprise
Handshake was founded on the belief that everyone deserves a path to a great career, connecting 25 million job seekers with over 1 million employers and 1,600 educational institutions. Handshake AI, started in 2025, is a fast-growing AI data business working with frontier AI labs to create evaluations, publish benchmarks, and advance AI data.
View company profile →