Site Reliability Engineer

Added
28 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

java grafana prometheus rest opentelemetry

πŸ“‹ Description

  • OTel Orchestration: Implement OpenTelemetry instrumentation across Java and C# services.
  • API Reliability: Test with social APIs; set monitoring and alerts.
  • Health & Performance: Grafana dashboards/alerts for Golden Signals in JVM/.NET.
  • Infrastructure as Code: RCA and remediations via code.
  • Distributed Tracing: Use traces to optimize cross-service data paths.
  • Full-Stack Troubleshooting: Debug from Java/C# to network and cloud resources.
  • Capacity Planning: Analyze usage to plan scaling and cloud spend.
  • Salary: $110,000 - $130,000 per year, plus 5% bonus.

🎯 Requirements

  • Java or C# proficiency.
  • OpenTelemetry experience.
  • Grafana/Prometheus/Tempo/Loki or Jaeger experience.
  • REST APIs, OAuth, rate-limiting, and webhooks.
  • Understanding how Java/C# apps interact with infrastructure.
  • Self-healing mindset in a high-velocity environment.
  • Education: B.S. in CS or equivalent.
  • Nice to have: Affiliate & Partnerships Fundamentals Certification (PXA).

🎁 Benefits

  • Extended health, vision, dental, virtual care.
  • Life insurance, disability coverage, and Health Care Spending Account.
  • Flexible PTO and work-life balance.
  • Restricted Stock Units (RSUs) with 3-year vesting.
  • Coursera access and PXA courses for growth.
  • Parental leave: 26 weeks primary, 13 weeks secondary.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’