This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs

Site Reliability Engineering (SRE) Manager

Fully Remote

Added
1 month ago
Type
Full-time
Salary
$190K - $920K

At Netflix, we are shaping the future of global entertainment, bringing moments of joy to 200+ million customers. Our mission is to entertain the world, and we strive to deliver the best user experience possible through innovation and a relentless focus on quality. The N-Tech Site Reliability Engineering (SRE) team at Netflix ensures the reliability, scalability, and efficiency of our workforce-focused products and services. The SRE team provides services such as Incident Response, Reliability Engineering consulting, and limited embedded SRE engagements.

We seek a highly experienced and motivated SRE Manager to lead a team of 11 high-performing Site Reliability Engineers. You will play a crucial role in maintaining the reliability and efficiency of our services, ensuring that our workforce-enabling products and services are reliable. You will have a proven track record of leading top-performing teams in complex, fast-paced environments and will excel in organizing and motivating a team amidst rapid growth and change.

This leadership role is rewarding for people who have a passion for growing talent, building a high-impact team, and leveraging engineering principles to improve reliability. You’ll be a key engineering leader in the Netflix Technology Services Organization and contribute to cross-functional initiatives supporting engineering teams across Netflix. If this excites you, we invite you to bring your unique career and life experiences to enrich the culture and diversity of our team.

RESPONSIBILITIES

  • You will lead, mentor, and develop a team of 11 SREs, fostering a culture of collaboration, innovation, and continuous improvement.
  • You will communicate effectively with stakeholders at all levels, providing updates on team performance, project status, and incident resolutions.
  • You will ensure an appropriate balance exists between incident management's reactive work and the proactive work of reducing future issues.
  • You will develop and implement strategies to improve the reliability, performance, and scalability of the products and services supported by the SRE team.
  • You will collaborate with cross-functional teams (engineering, product, and operations) to drive critical projects and initiatives.
  • You will influence and improve our incident management lifecycle to identify, mitigate, and learn from reliability risks.
  • You will oversee the design, implementation, and maintenance of monitoring, alerting, and incident response systems.
  • You will ensure the team follows best practices in infrastructure as code, continuous integration/deployment (CI/CD), and system automation.
  • You will cultivate and maintain high-trust relationships with internal and external partners.
  • You will advocate for the SRE team within the broader organization, representing their needs and concerns.
  • WE VALUE

  • Curiosity about how complex socio-technical systems successfully operate at scale when failure is inevitable
  • People who see influence as their preferred tool for cultivating relationships and helping the organization improve
  • Collaboration and continuous improvement are fundamental to growing the team’s impact over time
  • A desire to learn and readiness to mentor others both within and outside of the team
  • SKILLS AND EXPERIENCE

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
  • Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment
  • Outstanding communication and interpersonal skills, with the ability to build strong relationships with team members and stakeholders
  • Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring
  • Excellent problem-solving skills with a focus on root cause analysis and proactive improvements
  • Exceptional organizational skills, with the ability to manage multiple priorities and projects simultaneously
  • Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, Jenkins, and similar.
  • Additional Information

    Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $190,000 - $920,000.

    Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more detail about our Benefits here.

    Netflix is a unique culture and environment. Learn more here.

    We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

    Share job

    Help us maintain the quality of jobs posted on Empllo!

    Is this position not a remote job?

    Let us know!
    Similar Engineering Jobs
    See more Engineering jobs →
    Algolia logo
    On-site
    YC Company
    🇫🇷 France
    +1
    Full-Time
    💰 Salary not provided
    Circle logo
    Blockdaemon logo
    On-site
    Full-Time
    💰 Salary not provided
    Parity logo
    Fully Remote
    Full-Time
    💰 Salary not provided