Sr. Staff Software Engineer (Reliability)

Added
2 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

java postgresql python kubernetes go

πŸ“‹ Description

  • Lead the Service Platform Automation team for fleet lifecycle at scale.
  • Work hybrid in San Jose, CA; report to VP of Engineering.
  • Transform legacy scripts into a Temporal-based orchestration platform.
  • Scale the platform and drive AI SRE practices for self-healing.
  • Collaborate to deliver reliable, explainable fleet actions.

🎯 Requirements

  • BS or MS in Computer Science with 10+ years in hyperscale systems.
  • Mastery of backend languages: Go, Java, Python, or others.
  • Experience designing and operating complex distributed systems.
  • Expertise building automation using REST APIs and Swagger with idempotency.
  • Expertise in hybrid infrastructure across AWS/GCP/GKE and on-prem with CI/CD safety.
  • Proficiency in PostgreSQL, SQL development and schema management.
  • Experience building AI-enabled tooling for triage and automation.
  • Experience testing orchestration systems with determinism and chaos testing.

🎁 Benefits

  • Various health plans
  • Time off for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’