Tech Stack
Responsibilities
- Design and build high-performance inference runtimes for large-scale AI models, focusing on efficiency, reliability, and scalability.
- Own and optimize core execution paths, including model execution, memory management, batching, and scheduling.
- Develop and improve distributed inference across multiple GPUs, including parallelism strategies, communication patterns, and runtime coordination.
- Implement and optimize inference-critical operators and kernels informed by real-world workloads.
- Partner closely with research teams to ensure new model architectures are supported accurately and efficiently in inference systems.
Culture
Cross-Functional CollaborationImpact-OrientedMission-DrivenInclusive Hiring
Get jobs like this in your inbox
Weekly AWS, Rust, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About OpenAI
Industry: ai
Size: medium
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by pushing the boundaries of AI capabilities and safely deploying them.
View company profile →