This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs

Staff Site Reliability Engineer, Incident and Disaster

Added
21 days ago
Type
Full time
Salary
Not Specified

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Save job

Dropbox – Staff Site Reliability Engineer, Incident and Disaster

Dropbox is seeking a Staff Site Reliability Engineer focused on Incident and Disaster to join the Engineering team. This senior IC role will own end-to-end incident response, disaster recovery planning, and reliability improvements across Dropbox's services. You will lead incident command during outages, help define SLOs, and mentor other SREs as part of a strong, blameless culture. You will partner with product, platform, and security teams to ensure high availability and fast recovery times, and you will help evolve our reliability practices at scale.

Responsibilities

  • Lead and coordinate incident response during outages; own on-call readiness and post-incident reviews.
  • Design and implement disaster recovery strategies; test disaster recovery plans and backup/recovery processes.
  • Improve monitoring, alerting, and observability; define service level indicators and objectives (SLIs/SLOs).
  • Build reliability tooling and automation; reduce toil and improve automation for resiliency.
  • Collaborate with product, platform, and security teams to drive reliability initiatives across services.
  • Mentor junior SREs and contribute to a healthy, blameless incident culture.

Qualifications

  • 8+ years of experience in Site Reliability Engineering, DevOps, or a related field.
  • Strong experience with cloud platforms (AWS and/or GCP) and container orchestration (Kubernetes).
  • Proficiency with monitoring and observability stacks (Prometheus, Grafana, logging/trace tools).
  • Programming/scripting skills in Python, Go, or Bash.
  • Demonstrated incident management and disaster recovery expertise; ability to lead post-incident reviews and drive improvements.
  • Excellent communication, collaboration, and leadership abilities for cross-functional partnerships.

What We Offer

  • Remote-friendly work environment (Remote - US: Select locations).
  • Competitive compensation, equity, and benefits.
  • Generous time off, health benefits, and professional development opportunities.

Apply via the Dropbox listing: Dropbox Careers.

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Remote DevOps Jobs. Just set your preferences and Job Copilot will do the rest—finding, filtering, and applying while you focus on what matters.

Related DevOps Jobs

See more DevOps jobs →