Zipdev logo

Senior Site Reliability Engineer

Zipdev

Posted 3 days ago

We're hiring a Senior SRE based in Latin America to work alongside our US-based engineering team, building out observability, on-call coverage, and deployment automation for a client with strict compliance requirements. We're specifically looking for someone with deep Azure expertise, see below.

This is a full-time role with real production ownership, not a support or ticket-queue position.

What You'll Do

  • Support observability tooling implementation (Datadog and/or Azure Monitor/App Insights) and help build SLO definitions, alert rules, and synthetic checks
  • Participate in a PagerDuty on-call rotation, including escalation handling and incident documentation
  • Build and maintain operational runbooks for incident response, rollback, and recovery scenarios
  • Contribute to deployment automation work (blue/green or canary patterns) and Infrastructure as Code
  • Work across Azure SQL and Cosmos DB environments, supporting performance and cost optimization initiatives
  • Collaborate closely with US-based engineers during overlapping working hours

Requirements

  • 5+ years in SRE, DevOps, or cloud infrastructure roles
  • Strong hands-on experience with Microsoft Azure (Azure SQL, Cosmos DB, Container Apps, App Service)
  • Experience with observability tooling (Datadog, Azure Monitor, or similar) and on- call/incident response
  • Familiarity with Infrastructure as Code (Terraform preferred)
  • Strong written and spoken English; you'll be in daily communication with US-based team members and, at times, client stakeholders
  • Availability with meaningful overlap with US Eastern or Mountain time zones
  • Experience working in HIPAA-regulated environments, including handling PHI under a Business Associate Agreement (BAA) and working within least-privilege, audited access controls
  • Willingness to complete a healthcare-industry-standard background check prior to production access

On-Call Expectations

  • This role includes participation in a pager-based on-call rotation via PagerDuty, covering SEV- 1/SEV-2 incidents on a shared schedule with the SRE team. This is a core, required part of the role, not an occasional ask.

Benefits

  • Work remotely
  • Vacation: 10 business days a year
  • Holidays: 5 National Holidays a year
  • Company Holidays: 5 Company Holidays a year (Christmas Eve, Christmas Day, New Year's Eve, New Year's Day, Zipdev Day)
  • Parental Leave
  • Health Care Reimbursement
  • Active Lifestyle Reimbursement
  • Quarterly Home Office Reimbursement
  • Payroll Deduction Purchase Plans
  • Longevity Bonus
  • Continuous Learning Bonus
  • Access to Training and Professional Development Platforms
  • Did we mention it's REMOTE?!!

One of our core values at Zipdev is "Be authentic." that's why we encourage you to answer the application form in your own words; we are interested in getting to know you, not a digital assistant.

Want to see the full job description?

Sign in to view the complete details and apply to this position.

Job details

Workplace

Hybrid

Location

Brazil

Experience

SE

Similar

Jobr Assistant extension

Get the extension →