Orlando, FL

Site Reliability Engineer

Talener

Job Title: Site Reliability Engineer (SRE)
Location: Remote (U.S.)

Overview
A fast-growing healthcare technology organization is seeking a Site Reliability Engineer (SRE) to help scale and support a high-impact cloud platform focused on improving healthcare delivery nationwide. This role will play a critical part in strengthening platform reliability, operational efficiency, observability, and automation across production environments.
The ideal candidate is passionate about infrastructure stability, incident response, automation, and continuous improvement within modern cloud-native environments.

Key Responsibilities
  • Ensure the reliability, scalability, performance, and security of cloud-based infrastructure and applications 
  • Monitor, troubleshoot, and resolve production platform and application issues across distributed systems 
  • Lead incident response efforts, root cause analysis, and blameless post-mortems 
  • Build and maintain operational runbooks and automated remediation workflows 
  • Develop and enhance observability and telemetry solutions for proactive monitoring and alerting 
  • Collaborate closely with engineering, DevOps, QA, security, and operations teams to improve platform health and deployment processes 
  • Support infrastructure automation and configuration management initiatives 
  • Contribute to infrastructure-as-code (IaC) practices and CI/CD operational improvements 
  • Promote best practices around reliability engineering, incident management, and operational excellence 
  • Participate in an on-call rotation supporting production systems, including occasional off-hours support for West Coast operations 
Required Qualifications
  • 5+ years of experience in Site Reliability Engineering, DevOps, Cloud Infrastructure, or related disciplines 
  • Strong experience troubleshooting and supporting production environments 
  • Hands-on experience with observability and monitoring platforms such as Datadog, New Relic, or similar tools 
  • Experience working within Azure-based cloud environments and modern containerized infrastructure 
  • Knowledge of Docker, Kubernetes, and cloud-native application hosting environments 
  • Experience with infrastructure-as-code tools such as Terraform, Terragrunt, or OpenTofu 
  • Strong scripting and automation experience using PowerShell, Python, JavaScript, or similar languages 
  • Experience with source control and CI/CD tooling (Git, Azure DevOps, etc.) 
  • Understanding of cloud security principles, compliance frameworks, and operational best practices 
  • Strong collaboration and communication skills within Agile engineering environments 
Preferred Qualifications
  • Experience improving operational visibility through telemetry, dashboards, reports, and alerting systems 
  • Experience evolving incident response processes and operational tooling 
  • Passion for mentoring others and promoting operational excellence across teams 
  • Strong problem-solving mindset with a focus on continuous improvement and automation 
Additional Details
  • Opportunity to work on mission-driven technology with meaningful real-world impact 
  • Collaborative engineering culture focused on innovation, reliability, and continuous learning 
  • Flexible environment that supports work-life balance while maintaining operational excellence 
Compensation: $110k – $130k + $10k Bonus

If interested/qualified, please email Ryan Dwyer at rdwyer@talener.com

#LI-Remote
 

{“@context”:”http://schema.org”,”@type”:”JobPosting”,”baseSalary”:null,”datePosted”:”2026-05-21″,”validThrough”:”2027-05-21″,”description”:”<div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Job Title:&nbsp;Site Reliability Engineer (SRE)<br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Location: Remote (U.S.)</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Overview<br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">A fast-growing healthcare technology organization is seeking a Site Reliability Engineer (SRE) to help scale and support a high-impact cloud platform focused on improving healthcare delivery nationwide. This role will play a critical part in strengthening platform reliability, operational efficiency, observability, and automation across production environments.</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">The ideal candidate is passionate about infrastructure stability, incident response, automation, and continuous improvement within modern cloud-native environments.</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Key Responsibilities</div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;" type="disc"><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Ensure the reliability, scalability, performance, and security of cloud-based infrastructure and applications&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Monitor, troubleshoot, and resolve production platform and application issues across distributed systems&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Lead incident response efforts, root cause analysis, and blameless post-mortems&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Build and maintain operational runbooks and automated remediation workflows&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Develop and enhance observability and telemetry solutions for proactive monitoring and alerting&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Collaborate closely with engineering, DevOps, QA, security, and operations teams to improve platform health and deployment processes&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Support infrastructure automation and configuration management initiatives&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Contribute to infrastructure-as-code (IaC) practices and CI/CD operational improvements&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Promote best practices around reliability engineering, incident management, and operational excellence&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Participate in an on-call rotation supporting production systems, including occasional off-hours support for West Coast operations&nbsp;</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Required Qualifications</div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;" type="disc"><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">5+ years of experience in Site Reliability Engineering, DevOps, Cloud Infrastructure, or related disciplines&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Strong experience troubleshooting and supporting production environments&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Hands-on experience with observability and monitoring platforms such as Datadog, New Relic, or similar tools&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience working within Azure-based cloud environments and modern containerized infrastructure&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Knowledge of Docker, Kubernetes, and cloud-native application hosting environments&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience with infrastructure-as-code tools such as Terraform, Terragrunt, or OpenTofu&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Strong scripting and automation experience using PowerShell, Python, JavaScript, or similar languages&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience with source control and CI/CD tooling (Git, Azure DevOps, etc.)&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Understanding of cloud security principles, compliance frameworks, and operational best practices&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Strong collaboration and communication skills within Agile engineering environments&nbsp;</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Preferred Qualifications</div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;" type="disc"><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience improving operational visibility through telemetry, dashboards, reports, and alerting systems&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Experience evolving incident response processes and operational tooling&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Passion for mentoring others and promoting operational excellence across teams&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Strong problem-solving mindset with a focus on continuous improvement and automation&nbsp;</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Additional Details</div><ul fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit; margin-top: 0px; margin-bottom: 10px;" type="disc"><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Opportunity to work on mission-driven technology with meaningful real-world impact&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Collaborative engineering culture focused on innovation, reliability, and continuous learning&nbsp;</li><li fr-original-style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px;" style="line-height: 116%; font-family: Tahoma, Geneva, sans-serif; font-size: 14px; box-sizing: border-box;">Flexible environment that supports work-life balance while maintaining operational excellence&nbsp;</li></ul><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">Compensation: $110k – $130k + $10k Bonus</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"></div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">If interested/qualified, please email Ryan Dwyer at rdwyer@talener.com</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;"><br fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">#LI-Remote</div><div fr-original-style="" style="box-sizing: border-box; font-family: inherit; font-size: inherit;">&nbsp;</div>”,”employmentType”:”FULL_TIME”,”hiringOrganization”:{“@type”:”Organization”,”name”:”Talener”},”jobLocation”:{“@type”:”Place”,”address”:{“@type”:”PostalAddress”,”streetAddress”:null,”addressLocality”:”Orlando”,”addressRegion”:”FL”,”postalCode”:null,”addressCountry”:null}},”title”:”Site Reliability Engineer”,”url”:”https://talener.com/jobs/?cjobid=MM444738519&rpid=1628011&postid=t5ZhSXAp7bM”,”identifier”:{“@type”:”PropertyValue”,”name”:”Talener”,”value”:null}}

We take a direct path to technology staffing success.