Posted 17 May, 2026
Site Reliability Lead Engineer
APN Consulting
Hyderabad
Full Time
Reference: 365_625625_26-07461
Role: Site Reliability Lead Engineering
Location: Pune & Hyderabad
Experience Required: 8+ years
Experience - 8+ (SRE Lead - Production Support) with Expertise in SPLUNK
Job Description:
Location: Pune & Hyderabad
Experience Required: 8+ years
Experience - 8+ (SRE Lead - Production Support) with Expertise in SPLUNK
Job Description:
- Lead reliability, availability, and performance engineering for large-scale production systems.
- Architect and automate infrastructure using IaC tools and advanced CI/CD pipelines.
- Develop and operate highly scalable environments across AWS/Azure/GCP.
- Implement and optimize observability stacks using Splunk, Elastic, Grafana, Honeycomb or similar tools.
- Establish and refine SLIs/SLOs while driving a data-driven reliability culture.
- Own incident response, post-mortems, and long-term preventive engineering.
- Enhance system resilience through chaos testing, auto-healing mechanisms, and advanced automation.
- Collaborate with engineering teams to design fault-tolerant, maintenance architectures.
- Strengthen infrastructure security, compliance, and cost management practices.
- Mentor engineers and champion SRE best practices across teams.