Senior GenAI Research Engineer - Optimization and Kernels
Posted
$166,000 USD
Tech Stack
Responsibilities
- Drive performance improvements through advanced optimization techniques including kernel fusion, mixed precision, memory layout optimization, tiling strategies, and tensorization for training-specific patterns.
- Design, implement, and optimize high-performance GPU kernels for training workloads (e.g., attention mechanisms, custom layers, gradient computation, activation functions) targeting NVIDIA architectures.
- Design and implement distributed training frameworks for large language models, including parallelism strategies (data, tensor, pipeline, ZeRO-based) and optimized communication patterns for gradient synchronization and collective operations.
- Profile, debug, and optimize end-to-end training workflows to identify and resolve performance bottlenecks, applying memory optimization techniques like activation checkpointing, gradient sharding, and mixed precision training.
- Advance the scientific frontier by creating new techniques that go beyond the state of the art in deep learning.
Benefits
- Equity
Culture
Cross-Functional TeamsMission-DrivenCustomer-ObsessedInclusive Hiring
Requirements
Required: BS/MS/PhD in Computer Science or related field
Get jobs like this in your inbox
Weekly Express, Go, Python hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Databricks
Industry: saas
Size: large
Databricks is a data and AI company that builds and operates the world’s best data and AI infrastructure platform, enabling data teams to turn deep data insights into business impact.
View company profile →Compensation
Base salary: $166,000 USD
Equity: equity
Bonus: eligibility for annual performance bonus