RL Deep Learning Engineer (Remote)
$400,000 USD
Tech Stack
Responsibilities
- Build and maintain the evaluation harness and RL environment infrastructure, including task runners, sandboxed environments, and scoring logic that can scale to thousands of parallel agents.
- Own the data pipeline for transforming freshly collected court filings into benchmark and RL tasks for model training.
- Integrate with partner harnesses and model APIs to ensure contamination-free evaluations.
- Collaborate with attorneys to translate legal workflows into structured, scorable task formats using the Harbor spec.
Soft Skills
System DesignSelf-ManagementCommunication
Culture
Autonomous TeamsDeep Work FocusFast-Paced
Requirements
Regions: Worldwide
Get jobs like this in your inbox
Weekly Python, TypeScript, Llm Evaluation hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About midpage.ai
Industry: legal tech
Size: startup
midpage.ai is building the largest case law dataset to power its lawyer-facing AI platform and B2B data services, covering US laws and court decisions.
View company profile →Compensation
Base salary: $400,000 USD