This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs
Added
22 days ago
Location
Type
Full time
Salary
Not Specified

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Save job

Databricks is seeking a Senior Incident Manager to lead the incident management lifecycle for our critical services. You will own incident response from detection through resolution, coordinate cross-functional teams, communicate effectively with stakeholders, and drive continuous improvements to prevent recurrence. This role is based in Amsterdam, Netherlands, and may require on-call rotations.

Overview

As a Sr. Incident Manager at Databricks, you will be responsible for orchestrating incident response, containment, and remediation across complex data platform environments. You will partner with SRE, Platform Engineering, Security, Product, and Support teams to ensure rapid recovery and long-term reliability improvements.

Responsibilities

  • Lead end-to-end incident response for critical services, including triage, containment, escalation, and resolution.
  • Manage on-call rotations and ensure timely, clear communications to stakeholders and cross-functional teams.
  • Develop, maintain, and improve runbooks, playbooks, and incident response procedures.
  • Conduct post-incident reviews and root-cause analyses; track remediation actions and verify completion.
  • Collaborate with SRE, Platform Engineering, Security, Product, and Support to restore services and implement mitigations.
  • Define and monitor incident metrics; report on service health and reliability improvements.
  • Participate in problem management and change management processes to minimize risk during deployments.
  • Contribute to capacity planning and readiness exercises to reduce future incidents.

Qualifications

  • 5+ years of incident management, operations, or site reliability engineering experience in large-scale environments.
  • Strong knowledge of ITIL or equivalent incident management frameworks.
  • Excellent written and verbal communication; ability to coordinate across multiple teams and stakeholders.
  • Experience with cloud platforms and monitoring/alerting tools; comfortable working in a fast-paced environment.
  • Strong analytical and problem-solving skills; ability to perform root-cause analysis and drive improvements.
  • Bachelor’s degree in Computer Science, Information Systems, or related field (or equivalent work experience).

Benefits

  • Competitive compensation and benefits package.
  • Opportunity to work with a leading data and AI platform.
  • Health benefits, retirement plans, and professional development opportunities.
  • Flexible work arrangements and a collaborative, inclusive culture.

Location

Amsterdam, Netherlands

How to apply

Apply via the Databricks career page: Databricks Careers

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to On site Operations Jobs. Just set your preferences and Job Copilot will do the rest—finding, filtering, and applying while you focus on what matters.

Related Operations Jobs

See more Operations jobs →