Posted 07 June, 2026
Lead Cloud Data Engineer
ExlService Holdings, Inc.
Gurugram, Haryana, India
Full Time
Reference: 218_689623_15257
We are looking for an accomplished Lead Data Engineer to drive the architectural design, development, and optimization of our enterprise-scale data ecosystem. In this senior role, you will spearhead the build-out of highperformance data pipelines, enforce rigorous data quality and validation frameworks, and ensure endtoend reliability, scalability, and integrity of data flows across the organization. You will play a pivotal role in shaping our data engineering strategy, enabling advanced analytics and missioncritical, datadriven decision-making within our insurance-focused business domains.
- 8-12 years experience on Data Engineering role working with Databricks & Azure Cloud technologies.
- Bachelor's degree in computer science, Information Technology, or related field.
- Strong proficiency in PySpark, Python, SQL.
- Strong experience in data modeling, ETL/ELT pipeline development, and automation
- Hands-on experience with performance tuning of data pipelines and workflows
- Proficient in working on Azure cloud components Azure Data Factory, Azure DataBricks, Azure Data Lake etc.
- Experience with data modeling, ETL processes, Delta Lake and data warehousing.
- Experience on Delta Live Tables, Autoloader & Unity Catalog.
- Preferred - Knowledge of the insurance industry and its data requirements.
- Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy.
- Excellent communication and problem-solving skills to work effectively with diverse teams
- Excellent problem-solving skills and ability to work under tight deadlines.
- Lead the design and implementation of scalable data architectures on Azure, leveraging Databricks, Delta Lake, and related Azure data services.
- Build and optimize highvolume ETL/ELT pipelines using PySpark, SQL, Databricks Workflows, and Delta Live Tables.
- Define and enforce best practices for data engineering across notebooks, jobs, CI/CD, Unity Catalog, security, and workspace governance.
- Integrate and orchestrate data pipelines using Azure Data Factory, Azure Synapse pipelines, or Azure Databricks Workflows.
- Drive performance tuning for Spark jobs, cluster configurations, and data storage layers to balance speed and cost efficiency.
- Implement robust data quality and validation frameworks using tools like Great Expectations, DLT expectations, or custom PySpark checks.
- Ensure compliance, governance, and lineage tracking through Unity Catalog, Purview integration, and RBAC/ABAC policies.
- Architect endtoend data solutions supporting analytics, ML, actuarial models, and business-critical reporting workloads.
- Evaluate new Databricks features, MLflow enhancements, Photon execution, serverless compute, and recommend adoption strategies.
- Familiar with working on Agile methodologies - scrum, sprint planning, backlog refinement etc.