Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
docker ansible terraform aws pythonπ Description
- Collaborate with engineering teams to design scalable, secure, and highly available systems.
- Establish and manage SLOs and SLAs for ClickHouse Cloud.
- Ensure monitoring and alerting across all components to detect incidents.
- Improve incident response processes and post-mortem analysis with the support team.
- Continuously improve reliability and performance of ClickHouse services.
- Plan and drive Chaos initiatives across Engineering teams.
π― Requirements
- Bachelor's or Master's degree in Computer Science or a related field.
- At least 8 years of experience in Site Reliability Engineering.
- Previous experience using ClickHouse in production.
- Hands-on experience with Go and/or Python.
- Cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Hands-on experience with Kubernetes or Docker Swarm.
π Benefits
- Flexible, remote-friendly work environment across 20+ countries.
- Healthcare β employer contributions towards your healthcare.
- Equity in the company β stock options for new team members.
- Time off β flexible time off in the US; generous elsewhere.
- A $500 Home office setup for remote employees.
- Global Gatherings β opportunities for company-wide offsites.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!