Related skills
sre aws python reliability engineering clickhouseπ Description
- Continuously improve the reliability and performance of ClickHouse core.
- Improve and create metrics and alerts in production to prevent customer impact.
- Investigate root causes from customer issues and submit bug reports and improvements.
- Enhance incident response processes and post-mortem analysis for core outages with Support and Cloud teams.
- Plan, enable, and drive Chaos initiatives across Engineering.
- Manage on-call processes and coordinate escalation to minimize customer impact.
π― Requirements
- Bachelor's or Master's degree in Computer Science or a related field.
- At least 5 years of experience in Reliability Engineering, QA or customer-facing engineering.
- Previous experience operating ClickHouse or other SQL databases in production.
- Excellent understanding of distributed database internals and SQL; ClickHouse is a major plus.
- Scripting experience with Shell or Python; ability to read and understand C++ code.
- Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
π Benefits
- Flexible, remote-friendly work environment across 20+ countries.
- Healthcare: employer contributions toward healthcare.
- Equity in the company: stock options for new hires.
- Time off: flexible time off in the US, generous elsewhere.
- A $500 home office setup for remote employees.
- Global gatherings: company-wide offsites.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!