Tech Stack
Responsibilities
- Define and drive the end-to-end infrastructure architecture for Deepgram's AI/ML workloads across production inference and research training.
- Design multi-cloud and hybrid infrastructure strategies that balance performance, reliability, cost, and vendor flexibility.
- Architect compute orchestration systems that efficiently schedule and manage GPU and CPU workloads across heterogeneous infrastructure.
- Design storage architectures that handle massive datasets for speech and audio ML, from high-throughput training to low-latency model serving.
- Lead capacity planning across all infrastructure dimensions, modeling growth and ensuring Deepgram can scale ahead of demand.
Benefits
- 401k
- Flexible Hours
- Gym Membership
- Health Insurance
- Learning Budget
- Parental Leave
- Remote Stipend
- Unlimited PTO
Culture
AI-First MindsetFast-PacedExperimentationAdaptabilityContinuous Learning
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly AWS, Express, Kubernetes hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Deepgram
Industry: ai
Size: small
Deepgram is the leading Voice AI platform providing real-time APIs for speech-to-text, text-to-speech, and building production-grade voice agents, trusted by over 200,000 developers and 1,300+ organizations. The company's voice-native foundation models offer unmatched accuracy, low latency, and cost efficiency, having processed over 50,000 years of audio.
View company profile →