Staff Site Reliability Engineer
Posted
$119,000 USD
Tech Stack
Responsibilities
- Own the reliability of a large-scale cloud service by partnering with Engineering and Network teams to define requirements, conduct operability reviews, and contribute code/design docs for platform resilience.
- Develop and operate end-to-end observability (metrics/logs/traces, dashboards, alerting) and incident tooling to manage SLOs/error budgets, reduce noise, and improve system detection and diagnosis.
- Participate in an on-call rotation to lead full-cycle incident response; perform deep cross-stack troubleshooting to drive permanent software fixes and codify learnings into runbooks and tests.
- Build and maintain everything-as-code for fleet and service lifecycle, driving provisioning, configuration, release automation, canary deployments, and complex rollout/rollback workflows.
- Continuously improve platform hygiene through consistent OS/app upgrades, dependency/vulnerability patching, capacity and performance tuning, and strict CI/CD validation prior to production rollouts.
Benefits
- 401k
- Equity
- Health Insurance
- Learning Budget
- Parental Leave
- Remote Work
Culture
Impact-OrientedConstructive, Honest DebateHigh-Performing TeamsCustomer-ObsessedCollaboration
Requirements
Regions: Us
Get jobs like this in your inbox
Weekly AWS, Express, Git hiring trends and salary data — free.
Join 6 engineers getting weekly insights
Get market intelligence in your inbox
Free weekly insights on tech hiring trends, salaries, and in-demand stacks.
Already a subscriber? Sign in
About Zscaler
Industry: cybersecurity
Size: enterprise
Zscaler accelerates digital transformation, leveraging an AI-forward approach and the world's largest security data lake to power its cloud-native Zero Trust Exchange platform, protecting customers from cyberattacks and data loss.
View company profile →Compensation
Base salary: $119,000 USD