Agent Post-Training, API & Power Users

Posted Jun 26, 2026

OpenAISan Franciscofull_timemid

Tech Stack

Machine LearningSoftware EngineeringSystemsStatisticsLlmsRl/Rlhf/RlaifSynthetic DataCoding AgentsTool-Using AgentsAPI ProductsProduction ML Systems

Solid badges = required, outlined = preferred

Responsibilities

Design and run experiments to improve model behavior in API and power-user workflows, including function calling, tool use, coding, and planning.
Build evals, graders, and environments from real developer and power-user workflows, then convert observed failures into training data, hypotheses, and shipped improvements.
Partner with API and power-users to identify high-leverage behavior gaps and translate product signals into post-training interventions.
Improve how models behave when composed into systems, focusing on reliable tool use, respecting developer intent, and maintaining coherence across multi-step tasks.
Own end-to-end model behavior projects, from qualitative failure analysis through data generation, training experiments, eval design, integration, and launch readiness.

Soft Skills

Applied ResearchCross-Functional CollaborationProblem Solving

Culture

Mission-DrivenCross-Functional TeamsInclusive Hiring

Requirements

Regions: Us

About OpenAI

Industry: ai

Size: large

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by developing and safely deploying AI systems. They are committed to pushing the boundaries of AI capabilities while ensuring safety and human needs are at the core of their work.

View company profile →

Similar Jobs

Agent Post-Training, Artifacts Research

OpenAI · San Francisco

Agent Post-Training, Computer Use Research

OpenAI · San Francisco

Agent Post-Training, Personality

OpenAI · San Francisco

Agent Post-Training, Connectors Research

OpenAI · San Francisco

Agent Post-Training, Context Research

OpenAI · San Francisco