Machine Learning Scientist - Open Source Lead

Posted Dec 18, 2025

arenaBay Areafull-timelead

Apply Now

Tech Stack

ExpressGoPythonRustTypeScript

Responsibilities

Design and conduct experiments to evaluate AI model behavior across reasoning, style, robustness, and user preference dimensions.
Develop new metrics, methodologies, and evaluation protocols that go beyond traditional benchmarks.
Analyze large-scale human voting and interaction data to uncover insights into model performance and user preferences.
Communicate results with the broader research community via academic papers, educational content, and conference talks.
Collaborate with engineers to implement and scale research findings into production systems.

Benefits

Equity
Gym Membership
Health Insurance
Learning Budget

Culture

Mission-DrivenTransparencyWork-Life BalanceCross-Functional TeamsMentorship Program

Requirements

Required: PhD or equivalent research experience in Machine Learning, Natural Language Processing, Statistics, or a related field

Regions: Us

About arena

Industry: ai

Size: startup

Arena is a platform for evaluating how AI models perform in the real world, founded by researchers from UC Berkeley's SkyLab, with a mission to measure and advance the frontier of AI for real-world use. Tens of millions of people use Arena monthly to evaluate frontier systems.

View company profile →

Compensation

Equity: competitive equity

Similar Jobs

Machine Learning Scientist

arena · Bay Area

Research Scientist

openrouter · Remote (US)

Remote

VP of Research, Machine Learning

bjakcareer · United States

Research Scientist