Added
8 days ago
Type
Full time
Salary
Salary not provided

Related skills

bigquery datadog terraform prometheus python

πŸ“‹ Description

  • Ensure near-zero downtime with monitoring, alerting, and self-healing automation
  • Create automated, scalable systems using software and infrastructure principles
  • Advise clients on DevOps and SRE practices, pipelines, HA, reliability, and debt
  • Work with Google Cloud tech (GKE, Anthos, BigQuery) and tools like Prometheus, Datadog
  • Use Python and Terraform to automate tasks and deploy infrastructure
  • Collaborate with clients, your team, and Google engineers to resolve infra issues

🎯 Requirements

  • 1-2 years cloud/infrastructure experience with Linux, Windows, k8s, databases, and networking
  • 1+ years Google Cloud experience; certifications preferred but not required
  • Proficiency in Python; other languages a plus
  • Strong provisioning with Terraform
  • Experience with 24x7x365 monitoring, incident response, and on-call support
  • Experience troubleshooting across systems, networks, and code
  • Experience negotiating SLIs/SLOs/SLAs with product owners
  • Ability to work independently and in cross-team settings
  • Experience with Agile Scrum or Kanban in SDLC
  • Balancing service reliability, metrics, debt, and toil for live services at scale
  • Strong communication skills; customer-facing
  • Bachelor's degree in CS, EE, or equivalent

🎁 Benefits

  • Equal Opportunity employer; all qualified applicants considered
  • Culture that supports innovation and professional growth
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’