Software Engineer - GenAI inference
Posted
$142,200 USD
Tech Stack
Responsibilities
- Contribute to the design and implementation of the inference engine for large-scale LLM inference.
- Collaborate with researchers to integrate new model architectures and features into the engine.
- Optimize for latency, throughput, memory efficiency, and hardware utilization across GPUs and accelerators.
- Build and maintain instrumentation, profiling, and tracing tooling to identify and resolve performance bottlenecks.
- Develop and enhance scalable routing, batching, scheduling, memory management, and dynamic loading for inference workloads.
Benefits
- Equity
Culture
Cross-Functional TeamsTransparent LeadershipInclusive Hiring
Requirements
Required: BS in Computer Science or a related field
Preferred: MS/PhD in Computer Science or a related field
Regions: Us
Get jobs like this in your inbox
Weekly Express, Node.js, TypeScript hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Databricks
Industry: saas
Size: large
Databricks is a data and AI company that builds and operates the world’s best data and AI infrastructure platform, enabling data teams to turn deep data insights into business impact.
View company profile →Compensation
Base salary: $142,200 USD
Equity: equity
Bonus: eligibility for annual performance bonus