Linux Site Reliability Engineer

Added
4 days ago
Type
Full time
Salary
Salary not provided

Related skills

golang ansible terraform linux prometheus

📋 Description

  • Install, manage, scale Kubernetes and RKE clusters using Ansible and Terraform.
  • Collaborate with SpaceX engineers to deploy and support Kubernetes-based platforms.
  • Own processes and tools to improve reliability and reduce toil.
  • Recommend and implement changes using change control processes.
  • Work with internal teams to design solutions and resolve issues.
  • Define standards for systems design, testing and implementation.

🎯 Requirements

  • Bachelor’s degree in CS or STEM and 3+ years of systems engineering; or 5+ years of experience in lieu of a degree.
  • Experience deploying Linux servers in physical/virtualized environments (VMware via automation).
  • Experience with Linux shell and configuring Linux instances (kernel modules, pki, iptables).
  • Experience supporting and scaling containerized applications in Linux environments.
  • Experience using automation tools (Ansible, Terraform) to manage infra lifecycles and Kubernetes installations.
  • Expertise in scalable architectures with high availability, monitoring and performance tuning.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →