Tech Stack
Responsibilities
- Pioneer the development of Latent Space Models (LSMs) to solve fundamental data, scale, and cost challenges for voice AI.
- Build next-generation neural audio codecs for extreme compression and high-fidelity reconstruction across diverse audio corpora.
- Develop steerable generative models to synthesize diverse human speech from latent representations.
- Design embedding systems to factorize latent space into interpretable dimensions (speaker, content, style, environment, channel effects) for precise control and data amplification.
- Leverage latent recombination to generate synthetic audio data at scale, enabling multimodal speech-to-speech systems that understand and respond empathetically to any human.
Benefits
- 401k
- Flexible Hours
- Gym Membership
- Health Insurance
- Learning Budget
- Parental Leave
- Remote Stipend
- Unlimited PTO
Culture
AI-FirstFast-PacedExperimentationAdaptabilityContinuous Learning
Requirements
Regions: Worldwide
Get jobs like this in your inbox
Weekly AWS, Express, Next.js hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Deepgram
Industry: ai
Size: small
Deepgram is the leading Voice AI platform providing real-time APIs for speech-to-text, text-to-speech, and building production-grade voice agents, trusted by over 200,000 developers and 1,300+ organizations. The company's voice-native foundation models offer unmatched accuracy, low latency, and cost efficiency, having processed over 50,000 years of audio.
View company profile →