Tech Stack
Responsibilities
- Define and lead fairness and bias-testing strategies for AI-assisted People processes, models, agents, and decision-support systems.
- Design rigorous algorithmic audits and validation studies, including adverse-impact analysis, subgroup and intersectional evaluation, error-rate analysis, calibration, measurement invariance, reliability, criterion-related validity, and sensitivity testing.
- Identify appropriate fairness criteria for each use case, evaluate tradeoffs among competing definitions of fairness, and clearly document assumptions, limitations, and residual risks.
- Evaluate end-to-end human-AI decision systems, including model outputs, user behavior, human overrides, escalation pathways, and whether AI assistance changes the quality, consistency, or equity of decisions.
- Develop evaluation approaches for generative and agentic AI, including test-set design, counterfactual testing, behavioral evaluation, human-rating studies, robustness testing, and analysis of disparate performance across populations and contexts.
Soft Skills
Algorithmic FairnessBias MeasurementResponsible AIPsychometricsApplied Statistics
Benefits
- Equity
Culture
Mission-DrivenCustomer-ObsessedInclusive Hiring
Requirements
Preferred: Advanced degree in Quantitative Psychology, Computer Science, Statistics, Economics, Data Science, Behavioral Science, or a related quantitative field; PhD preferred
Regions: Us
Get jobs like this in your inbox
Weekly Machine Learning Models, Generative AI Systems, Python hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About OpenAI
Industry: ai
Size: large
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by safely building and deploying AGI through its products.
View company profile →