Research Staff, Voice AI Foundations

Posted Jun 24, 2026

DeepgramUSA | RemoteRemotefull-timestaff

Apply Now

Tech Stack

AWSExpressNext.jsRustTypeScript

Responsibilities

Pioneer the development of Latent Space Models (LSMs) to solve fundamental data, scale, and cost challenges for voice AI.
Build next-generation neural audio codecs for extreme compression and high-fidelity reconstruction across diverse audio corpora.
Develop steerable generative models to synthesize diverse human speech from latent representations.
Design embedding systems to factorize latent space into interpretable dimensions (speaker, content, style, environment, channel effects) for precise control and data amplification.
Leverage latent recombination to generate synthetic audio data at scale, enabling multimodal speech-to-speech systems that understand and respond empathetically to any human.

Benefits

Health Insurance

Culture

AI-FirstFast-PacedExperimentationAdaptabilityContinuous LearningCross-Functional TeamsCustomer-ObsessedCollaborative SpaceInclusive HiringErg/Affinity Groups

Requirements

Regions: Worldwide

About Deepgram

Industry: ai

Size: small

Deepgram is the leading Voice AI platform providing real-time APIs for speech-to-text, text-to-speech, and building production-grade voice agents, trusted by over 200,000 developers and 1,300+ organizations. The company's voice-native foundation models offer unmatched accuracy, low latency, and cost efficiency, having processed over 50,000 years of audio.

View company profile →

Similar Jobs

Research Staff, LLMs

Deepgram · USA | Remote

Remote

Research Staff, Data Science

Deepgram · USA | Remote

Remote

AI research scientist

writer · San Francisco, CA

Systems Architect AI/ML Infrastructure

Deepgram · USA | Remote