Skip to main content
Posted 17 May, 2026

Site Reliability Lead Engineer

APN Consulting
Hyderabad Full Time
Reference: 365_625625_26-07461

Role: Site Reliability Lead Engineering
Location: Pune & Hyderabad
Experience Required: 8+ years

Experience - 8+ (SRE Lead - Production Support) with Expertise in SPLUNK

Job Description:
  • Lead reliability, availability, and performance engineering for large-scale production systems.
  • Architect and automate infrastructure using IaC tools and advanced CI/CD pipelines.
  • Develop and operate highly scalable environments across AWS/Azure/GCP.
  • Implement and optimize observability stacks using Splunk, Elastic, Grafana, Honeycomb or similar tools.
  • Establish and refine SLIs/SLOs while driving a data-driven reliability culture.
  • Own incident response, post-mortems, and long-term preventive engineering.
  • Enhance system resilience through chaos testing, auto-healing mechanisms, and advanced automation.
  • Collaborate with engineering teams to design fault-tolerant, maintenance architectures.
  • Strengthen infrastructure security, compliance, and cost management practices.
  • Mentor engineers and champion SRE best practices across teams.

Sign up for Job Alerts