Senior Site Reliability Engineer

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

docker ansible terraform aws python

πŸ“‹ Description

  • Collaborate with engineering teams to design scalable, secure, and highly available systems.
  • Establish and manage SLOs and SLAs for ClickHouse Cloud.
  • Ensure monitoring and alerting across all components to detect incidents.
  • Improve incident response processes and post-mortem analysis with the support team.
  • Continuously improve reliability and performance of ClickHouse services.
  • Plan and drive Chaos initiatives across Engineering teams.

🎯 Requirements

  • Bachelor's or Master's degree in Computer Science or a related field.
  • At least 8 years of experience in Site Reliability Engineering.
  • Previous experience using ClickHouse in production.
  • Hands-on experience with Go and/or Python.
  • Cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Hands-on experience with Kubernetes or Docker Swarm.

🎁 Benefits

  • Flexible, remote-friendly work environment across 20+ countries.
  • Healthcare – employer contributions towards your healthcare.
  • Equity in the company – stock options for new team members.
  • Time off – flexible time off in the US; generous elsewhere.
  • A $500 Home office setup for remote employees.
  • Global Gatherings – opportunities for company-wide offsites.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’