DevOps Engineer - AI, Data & Platforms
KKR's Gurugram oce provides best in class services and solutions to our internal stakeholders and clients, drives organization wide process eciency and transformation, and reect KKR's global culture and values of teamwork and innovation. The oce will contain multifunctional business capabilities and will be integral in furthering the growth and transformation of KKR.
TEAM OVERVIEW
KKR's Technology team operates as a strategic partner to the business, driving the tools, platforms, and capabilities that enable KKR's investment professionals and operational teams to work with greater speed, precision, and insight. The DevOps function sits within the Technology organization and is responsible for the reliability, automation, and developer experience of the platforms that support KKR's AI, data, and application teams.
POSITION SUMMARY
KKR is looking for a DevOps Engineer to join our Technology team in Gurugram. This role is responsible for the operational backbone of KKR's AI platform - owning CI/CD, cloud infrastructure, and the day-to-day reliability of the systems that AI Engineering, AI Infrastructure, and Data teams depend on. Reporting into the DevOps and Cloud Platform leadership team and working in close partnership with AI Infrastructure and AI Engineering, this role will build the paved roads that make secure, reliable, cost-efficient delivery default across KKR's AI estate. The role is suited to an engineer who takes pride in automation, treats observability as a first-class concern, and is energized by the operational realities of running AI workloads at enterprise scale.
We are operating in a 4-day in office, 1-day flexible work arrangement.
ROLES AND RESPONSIBILITIES
- Own CI/CD, infrastructure-as-code (Terraform), and release pipelines across KKR's AI and data services.
- Manage cloud infrastructure, networking, secrets, and access controls in line with enterprise security and compliance standards.
- Build observability, alerting, and incident response capabilities into every platform service.
- Partner with AI Infrastructure and AI Engineering teams to streamline the path from commit to production.
- Drive cost visibility and optimization across cloud and GPU spend, partnering with Finance and Technology leadership.
- Establish operational standards - runbooks, on-call practices, post-incident reviews - that scale with the growth of KKR's AI footprint.
- Continuously improve developer experience by automating toil and removing friction from common engineering workflows.
QUALIFICATIONS
- 6+ years of experience in DevOps, SRE, or cloud platform engineering, ideally supporting data or AI workloads.
- Strong Python (or Go) skills for automation and tooling.
- Solid hands-on cloud experience across AWS, GCP, or Azure - equivalent to a strong mid-level practitioner.
- Fluency with Kubernetes, Terraform, GitHub Actions or ArgoCD, and modern observability stacks (Datadog, Prometheus, Grafana).
- Comfort supporting AI/ML workloads, including GPU instances, large model artifacts, and long-running training and inference jobs.
- Calm under pressure during incidents, with a strong bias toward automation over manual toil.
- Exceptional communication skills, with the ability to partner across engineering, security, and business stakeholders.
PREFERRED QUALIFICATIONS
- Experience operating cloud platforms in financial services or other regulated environments.
- Familiarity with model serving infrastructure, GPU scheduling, or ML-specific operational patterns.
- Exposure to FinOps practices and large-scale cloud cost optimization.
- Contributions to open-source DevOps or platform tooling.
- Advanced degree in a relevant field (Computer Science, Engineering, or equivalent).