Related skills
aws sql python distributed systems google cloud platform📋 Description
- Continuously improve the reliability and performance of ClickHouse core.
- Improve metrics and alerts to detect issues before customer impact.
- Root-cause analysis and bug fixes from customer issues.
- Enhance incident response and post-mortem processes across outages.
- Plan and drive Chaos initiatives across engineering teams.
- Manage on-call processes to coordinate escalation and resolution.
🎯 Requirements
- Bachelor’s or Master’s in Computer Science or related field.
- 5+ years in Reliability Engineering, QA or customer-facing roles.
- Experience operating ClickHouse or other SQL databases in production.
- Strong understanding of distributed database internals and SQL.
- Scripting with Shell or Python; able to read C++ code.
- Cloud experience with AWS, Azure, or Google Cloud Platform.
🎁 Benefits
- Flexible work environment; remote-friendly globally.
- Healthcare; employer contributions.
- Equity in the company; stock options.
- Time off; flexible US, generous elsewhere.
- A $500 home office setup for remote employees.
- Global gatherings; company-wide offsites.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!