Related skills
sre cloud aws postgresql pythonπ Description
- Continuously improve the reliability and performance of ClickHouse core.
- Create metrics and alerts to detect issues before customers are affected.
- Investigate root causes, submit bug fixes, and propose improvements.
- Refine incident response and post-mortem analysis for core outages.
- Plan and drive Chaos initiatives across Engineering teams.
- Manage on-call processes and escalation to minimize customer impact.
π― Requirements
- Bachelor's or Master's in Computer Science or related field
- 5+ years in Reliability Engineering, QA or customer-facing engineering
- Experience operating ClickHouse or other SQL databases in production
- Strong understanding of distributed DB internals and SQL; ClickHouse a plus
- Scripting with Shell or Python; ability to read C++ code
- Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform
π Benefits
- Flexible work environment; remote-friendly
- Healthcare; employer contributions
- Equity in the company; stock options
- Flexible time off; US vs other countries
- $500 home office setup; remote employees
- Global gatherings; company-wide offsites
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!