Role Overview
Staff Site Reliability Engineer at Nearsure. This is a remote role within Latin America. You will design, build, and maintain scalable, highly available services and infrastructure, collaborating with global teams to ensure reliability and performance.
About Nearsure
Nearsure is hiring for a Staff/Senior level SRE to help scale and secure our platforms. This is a remote opportunity.
Responsibilities
- Design, implement, and maintain scalable, highly available services and infrastructure
- Lead incident response and post-incident reviews
- Automate reliability tasks and reduce toil through tooling and CI/CD integration
- Improve observability with metrics, logs, traces, dashboards
- Define and enforce SLOs/SLIs, participate in capacity planning
- Mentor and coach junior engineers
- Collaborate with product, platform, and security teams
Requirements
- 6+ years of Site Reliability Engineering, Platform Engineering, or equivalent
- Strong Linux knowledge and experience with Kubernetes
- Cloud experience (AWS, GCP, or Azure)
- Proficiency in scripting (Python, Bash) and IaC (Terraform, CloudFormation)
- Experience with monitoring and alerting (Prometheus, Grafana, Alertmanager)
- Excellent problem solving, communication, and collaboration skills
- Bachelor's degree in Computer Science or equivalent
Nice to have
- Experience in fintech or regulated domains
- Security practices, incident management, and disaster recovery planning
- Experience with CI/CD pipelines, cost optimization
Benefits
- Remote-friendly work environment
- Competitive compensation and benefits
- Career growth and mentoring opportunities
How to apply
Please apply through the Greenhouse posting: https://job-boards.greenhouse.io/nearsure/jobs/4716525007