Senior Software Engineer in Test (AI Agentic Systems)
$99,200 – $124,000 USD
Tech Stack
Responsibilities
- Build and maintain a versioned library of "Grounding Data" results to define "Ground Truth."
- Design automated "LLM-grading-LLM" workflows using custom rubrics to score factual grounding and policy compliance.
- Develop testing libraries to validate semantic equivalence and numerical accuracy in agent outputs.
- Programmatically verify that mandatory tools were invoked with correct arguments using Vertex AI traces.
- Integrate automated regression runs with CI/CD for every prompt or model update in Vertex AI Prompt Management.
Soft Skills
Architectural TestingHealthcare ClaimsHIPAA/PhiShadow Mode Monitoring
Benefits
- Health Insurance
- 401k
- Flexible PTO
- Equity
- Hybrid Work
- Internal Mobility
- Mentorship Program
- Learning Budget
Culture
Mission-DrivenCollaborative SpaceWork-Life BalanceInternal MobilityMentorship Program
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly Python, Pytest, Vertex AI hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Collective Health
Industry: healthtech
Collective Health is transforming health benefits engagement for employers and their people through cutting-edge technology, compassionate service, and world-class user experience design.
View company profile →Compensation
Base salary: $99,200 – $124,000 USD
Equity: 115000 stock options