Senior Site Reliability Engineer

Added
19 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

puppet linux grafana prometheus python

📋 Description

  • Operate Wikimedia’s public-facing infrastructure (deployment, maintenance, troubleshooting).
  • Implement and use Puppet and Kubernetes for deployment.
  • Automate installation, configuration, and maintenance of services.
  • Collaborate with product teams to design scalable services at scale.
  • Participate in a 24/7 on-call rotation and incident response.
  • Mentor peers and work with a globally distributed team.

🎯 Requirements

  • 6+ years in SRE/Operations/DevOps.
  • Experience with shell scripting and Python; Puppet; Ansible.
  • Experience with distributed caching systems and optimization.
  • Strong knowledge of TCP/IP, HTTP, TLS, DNS.
  • Linux packaging on Debian; strong Linux troubleshooting.
  • Incident response experience and post-incident reviews.

🎁 Benefits

  • Remote-first organization with teams in 40+ countries.
  • Open source software and community involvement.
  • Inclusive, equal-opportunity workplace.
  • Global, asynchronous collaboration across time zones.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →