This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs

Principal Engineer - Observability

Added
26 days ago
Type
Full time
Salary
Not Specified

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Save job

Overview

CoreWeave is seeking a Principal Engineer - Observability to lead the design and implementation of the company’s observability platform across distributed GPU-accelerated infrastructure. This role focuses on delivering robust visibility, reliability, and performance insights for services and systems at scale.

Responsibilities

  • Design, implement, and maintain the observability architecture across production systems.
  • Instrument services with metrics, traces, and logs using industry standards (Prometheus, OpenTelemetry, Jaeger).
  • Define and manage SLIs/SLOs, and build dashboards and alerting to enable proactive reliability.
  • Collaborate with Platform, DevOps, and Engineering teams to drive improvements in performance and reliability.
  • Mentor and guide junior engineers in best practices for observability and incident response.
  • Champion scalable instrumentation and data quality across multi-region deployments.

Qualifications

  • 8+ years of software engineering or SRE experience, with a focus on observability and reliability.
  • Strong hands on experience with Prometheus, Grafana, OpenTelemetry, and distributed tracing (Jaeger or similar).
  • Deep knowledge of Kubernetes, Docker, and cloud environments (AWS, GCP, Azure).
  • Proficiency in one or more programming languages such as Go or Python.
  • Excellent problem solving, communication, and collaboration skills.

About CoreWeave

CoreWeave is a leading provider of GPU accelerated compute for AI, machine learning, and HPC workloads. We empower teams to run large scale workloads with speed and reliability.

What we offer

  • Competitive compensation and comprehensive benefits
  • Collaborative, fast paced engineering culture
  • Opportunities for professional growth and impact

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to On site Engineering Jobs. Just set your preferences and Job Copilot will do the rest—finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →