Tech Stack
Responsibilities
- Evaluate and select silicon platforms (GPUs, NPUs, and specialized accelerators) for on-device and edge deployment of OpenAI models.
- Work closely with research teams to co-design model architectures that meet real-world deployment constraints such as latency, memory, power, and bandwidth.
- Analyze and model system performance, identifying tradeoffs between model design, memory hierarchy, compute throughput, and hardware capabilities.
- Partner with hardware vendors and internal infrastructure teams to bring up new accelerators and ensure efficient execution of transformer workloads.
- Build and lead a team of engineers responsible for implementing the low-level inference stack, including kernel development and runtime systems.
Benefits
- Health Insurance
Culture
Hybrid WorkMission-DrivenCross-Functional TeamsInclusive Hiring
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly AWS, Rust, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About OpenAI
Industry: ai
Size: medium
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity by pushing the boundaries of AI capabilities and safely deploying them.
View company profile →