Agent Post-Training, Context Research

Posted Jun 26, 2026

OpenAISan Franciscofull_timemid

Apply Now

Tech Stack

Machine LearningSoftware EngineeringStatisticsLlmsReinforcement LearningRlhfSynthetic DataData PipelinesEvals

Solid badges = required, outlined = preferred

Responsibilities

Design and run experiments that improve scaling of compute on context.
Own end-to-end improvements to the post-training stack, including RL, data pipelines, graders, reward signals, evals, diagnostics, and model-behavior analysis.
Build evals and environments that expose the next set of model failures, then turn those failures into training data, product fixes, or new research directions.
Partner with Codex and ChatGPT product teams to understand what users need and translate product signal into model improvements.
Work on early-training and alignment interventions, including data mixtures, objectives, synthetic data, and eval loops that shape downstream agent behavior.

Soft Skills

System DesignCross-Functional CollaborationProblem Solving

Culture

Mission-DrivenCustomer-ObsessedImpact-OrientedAutonomous Teams

Requirements

Regions: Us

About OpenAI

Industry: ai

Size: large

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by developing and safely deploying AI systems. They are committed to pushing the boundaries of AI capabilities while ensuring safety and human needs are at the core of their work.

View company profile →

OpenAI · San Francisco

Agent Post-Training, Frontier Evals and Environments Research

OpenAI · San Francisco

Agent Post-Training, Artifacts Research

OpenAI · San Francisco

Agent Post-Training, Connectors Research

OpenAI · San Francisco

Agent Post-Training, API & Power Users

OpenAI · San Francisco