Principal Software Engineer - Compute Infrastructure
$248,000 – $391,000 USD
Tech Stack
Responsibilities
- Define Platform Architecture: Lead initiatives to architect and transform the global enterprise compute platform, defining service tiers, SLAs, and automated cluster lifecycles.
- Operationalize Frontier AI Infrastructure: Build the operational foundation for an internal AI inference platform scaling to frontier-class models, developing automated remediation pipelines, hardware watchdogs, and telemetry.
- Drive Strategic Capacity & Scale: Collect and review system data for capacity planning, developing proactive strategies including public cloud bursting, hardware dogfooding, and evaluating alternative compute architectures.
- Build the "Paved Road": Collaborate with NVIDIA engineering teams to drive cultural adoption of standard platforms, designing self-service architectures, APIs, and Terraform/OpenTofu providers.
- Lead Complex Migrations: Evaluate existing application architectures and drive the migration of massive legacy workloads into modern Kubernetes orchestration.
Soft Skills
Team Leadership
Benefits
- Equity
Culture
Mission-DrivenInnovationDiverse LeadershipWork-Life Balance
Requirements
Required: Bachelor’s degree in Engineering, Computer Science, Mathematics, or related field, or equivalent experience
Regions: Us
Get jobs like this in your inbox
Weekly Kubernetes, Kubevirt, Openshift hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About NVIDIA
Industry: ai
Size: enterprise
NVIDIA is a technology company focused on AI systems, building products like the NeMo Platform for developing, evaluating, deploying, and operating AI systems at scale.
View company profile →Compensation
Base salary: $248,000 – $391,000 USD
Equity: eligible for equity