Related skills
bigquery datadog terraform prometheus pythonπ Description
- Ensure near-zero downtime with monitoring, alerting, and self-healing automation
- Create automated, scalable systems using software and infrastructure principles
- Advise clients on DevOps and SRE practices, pipelines, HA, reliability, and debt
- Work with Google Cloud tech (GKE, Anthos, BigQuery) and tools like Prometheus, Datadog
- Use Python and Terraform to automate tasks and deploy infrastructure
- Collaborate with clients, your team, and Google engineers to resolve infra issues
π― Requirements
- 1-2 years cloud/infrastructure experience with Linux, Windows, k8s, databases, and networking
- 1+ years Google Cloud experience; certifications preferred but not required
- Proficiency in Python; other languages a plus
- Strong provisioning with Terraform
- Experience with 24x7x365 monitoring, incident response, and on-call support
- Experience troubleshooting across systems, networks, and code
- Experience negotiating SLIs/SLOs/SLAs with product owners
- Ability to work independently and in cross-team settings
- Experience with Agile Scrum or Kanban in SDLC
- Balancing service reliability, metrics, debt, and toil for live services at scale
- Strong communication skills; customer-facing
- Bachelor's degree in CS, EE, or equivalent
π Benefits
- Equal Opportunity employer; all qualified applicants considered
- Culture that supports innovation and professional growth
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!