Lead Member of Technical Staff, Inference Infrastructure

Posted Apr 28, 2026

cohereSan Franciscofull-timestaff

Apply Now

Tech Stack

AWSAzureGCPGitGoKubernetesNext.jsTypeScript

Responsibilities

Lead the design and architecture of high-performance, scalable, and reliable machine learning systems for Cohere's AI platform.
Drive the strategy for deploying optimized NLP models to production in low latency, high throughput, and high availability environments.
Serve as a key point of contact for customers, leading the design of customized deployments to meet specific needs.
Mentor engineers to raise the technical bar across the Model Serving team.
Own compute/storage/network resource and cost management at an organizational level, including optimization strategies.

Benefits

401k
Gym Membership
Health Insurance
Learning Budget
Parental Leave
Remote Stipend
Remote Work

Culture

Open And Inclusive CultureWork-Life BalanceFast-PacedCross-Functional TeamsMentorship Program

Requirements

Regions: Us

About cohere

Industry: saas

Size: medium

Cohere is the leading security-first enterprise AI company, building cutting-edge foundation AI models and end-to-end products designed to solve real-world business problems.

View company profile →

cohere · New York

Senior Member of Technical Staff, Multimodal AI

cohere · San Francisco

Member of Technical Staff, MLE

cohere · San Francisco

Member of Technical Staff, Senior/Staff MLE

cohere · San Francisco

Member of Technical Staff, Search

cohere · United States