Agent Post-Training, Frontier Evals and Environments Research

Posted Jun 26, 2026

OpenAISan Franciscofull_timestaff

Tech Stack

Machine LearningSoftware EngineeringStatisticsLarge Language ModelsReinforcement LearningRlhfRlaifSynthetic DataModel TrainingCoding AgentsTool-Using AgentsProduction ML Systems

Solid badges = required, outlined = preferred

Responsibilities

Create ambitious RL environments to push models to their limits and measure frontier model capabilities, skills, and behaviors.
Develop new methodologies for automatically exploring the behavior of these models.
Dive deep into the science of measurement, including understanding scalability, reliability, and variance of evaluation methodology.
Help steer training for the largest training runs and see the future first.
Design scalable systems and processes to support continuous evaluation.

Soft Skills

ResearchEngineering ExecutionCross-Functional CollaborationCommunication

Culture

Mission-DrivenCustomer-ObsessedWork-Life Balance

Requirements

Regions: Us

About OpenAI

Industry: ai

Size: large

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by developing and safely deploying AI systems. They are committed to pushing the boundaries of AI capabilities while ensuring safety and human needs are at the core of their work.

View company profile →

OpenAI · San Francisco

Agent Post-Training, Context Research

OpenAI · San Francisco

Agent Post-Training, Artifacts Research

OpenAI · San Francisco

Agent Post-Training, Connectors Research

OpenAI · San Francisco

Agent Post-Training, API & Power Users

OpenAI · San Francisco