Senior Manager, Observability at 2K
Location: Austin, Texas, United States (onsite)
About 2K
2K is a leading video game publisher known for building immersive, high-quality gaming experiences. We seek a seasoned leader to shape the observability and reliability strategy across our live services and games.
Role Overview
The Senior Manager, Observability will lead the observability program, drive incident response and reliability improvements, and mentor a team of engineers. You will partner with Platform, SRE, and Software Engineering teams to define instrumentation standards, implement scalable monitoring, and ensure our systems meet their reliability targets.
Responsibilities
- Define and implement the observability strategy, including metrics, tracing, logging, alerting, and incident response.
- Build and maintain scalable instrumentation and dashboards using modern tooling (e.g., Prometheus, Grafana, OpenTelemetry).
- Collaborate with SRE, platform, and development teams to improve system reliability and performance.
- Lead and grow a team of observability engineers; mentor and develop talent.
- Establish SLOs/SLIs, runbooks, and incident management processes.
Qualifications
- 7+ years of experience in observability, site reliability engineering, or a related field.
- Demonstrated leadership experience and the ability to manage and mentor teams.
- Strong knowledge of cloud platforms (AWS, GCP, Azure) and distributed systems.
- Hands-on experience with monitoring/observability stacks (Prometheus, Grafana, OpenTelemetry, Jaeger).
- Experience with containerization and orchestration (Docker, Kubernetes).
- Excellent communication and collaboration skills.
- Bachelor’s degree in Computer Science or a related field (or equivalent experience).
Benefits
- Competitive compensation and potential equity.
- Health, dental, and vision insurance.
- 401(k) with company match and financial planning resources.
- Generous paid time off and flexible work options.