Staff Software Engineer - GenAI inference
Posted
$190,900 USD
Tech Stack
Responsibilities
- Own and drive the architecture, design, and implementation of the inference engine for large-scale LLMs inference.
- Partner closely with researchers to integrate new model architectures or features into the engine.
- Lead end-to-end optimization for latency, throughput, memory efficiency, and hardware utilization across GPUs and accelerators.
- Define and guide standards for building and maintaining instrumentation, profiling, and tracing tooling.
- Architect scalable routing, batching, scheduling, memory management, and dynamic loading mechanisms for inference workloads.
Benefits
- Equity
Culture
Cross-Functional TeamsTransparent Leadership
Requirements
Required: BS in Computer Science or a related field
Preferred: MS/PhD in Computer Science or a related field
Regions: Us
Get jobs like this in your inbox
Weekly Express, Node.js, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Databricks
Industry: saas
Size: large
Databricks is a data and AI company that builds and operates the world’s best data and AI infrastructure platform, enabling data teams to turn deep data insights into business impact.
View company profile →Compensation
Base salary: $190,900 USD
Equity: equity
Bonus: eligibility for annual performance bonus