Related skills
sre distributed systems ai leadership observability📋 Description
- Define and evolve Dropbox’s company-wide reliability strategy to support AI-enabled development.
- Set multi-year reliability goals and roadmaps for observability, incidents, and service health.
- Lead cross-team initiatives to reduce reliability risk with higher delivery velocity.
- Partner with engineering leaders and platform teams to improve monitoring, alerts, SLOs/SLAs.
- Identify AI-enabled reliability risks and design scalable guardrails.
- Provide technical leadership and mentorship to engineers, boosting reliability.
- Drive clear alignment with senior stakeholders on reliability priorities and risks.
🎯 Requirements
- BS degree in Computer Science or related technical field.
- 12+ years of software engineering, SRE, or infrastructure experience.
- Proven ability to define and deliver multi-year reliability strategies with measurable impact.
- Deep experience with distributed systems, observability, incident response, SLOs/SLAs.
- Diagnose complex issues, debug production systems, automate operations.
- Influence engineering roadmaps across multiple teams.
- Strong communication and cross-team collaboration.
Preferred Qualifications
- Experience adapting reliability strategies for AI-enabled workflows.
- Experience building observability, debugging, or developer productivity platforms.
- Leading reliability improvements in high-velocity deployments.
- Mentoring senior engineers and setting reliability standards.
- Familiarity with AI-enabled tooling and agentic development workflows.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!