Related skills
sre security rust python observability📋 Description
- Design and operate infra powering connector sync, indexing, retrieval at scale.
- Build control plane primitives: rollout, config, permissions, policy enforcement, kill switches.
- Own reliability: SLOs, monitoring, incident response, postmortems, on-call.
- Guardrails for safe multi-tenant execution: isolation, secrets, rate limits.
- Partner with security/compliance to meet enterprise requirements (audibility, least privilege).
- Improve developer velocity via internal tooling: local dev, canary environments, load testing.
🎯 Requirements
- 5+ years in infra/SRE/platform roles at tech/product companies.
- Strong distributed systems fundamentals: availability, latency, resilience.
- Experience building/operating services with uptime and scale; multi-region a plus.
- Proficient in backend languages (Python, Rust) and systems concerns (networking, storage, queues).
- Deep familiarity with observability (metrics/logs/tracing), incident mgmt, reliability practices.
- Comfortable navigating ambiguous problems and shipping pragmatic solutions.
- Interest in AI/ML is a plus, not required.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!