Skip to main content
Posted 06 June, 2026

Databricks COE Lead

ExlService Holdings, Inc.
Hyderabad, Telangana, India Full Time
Reference: 218_689623_14859

Key Role & Responsibilities

CoE Leadership & Strategy

  • Establish and lead the Databricks CoE: define vision, roadmap, governance, and operating model.
  • Build reusable frameworks, accelerators, and solution blueprints for scalable lakehouse adoption.
  • Drive standardisation of architecture patterns, coding practices, and delivery methodologies.
  • Partner with sales, solutioning, and delivery teams to enable Databricks-led transformations across clients.

Architecture & Solutioning

  • Lead end-to-end Databricks Lakehouse architecture (bronze/silver/gold layers, medallion architecture).
  • Translate business requirements into customised data and AI solutions (batch, streaming, ML, GenAI).
  • Define best practices for ingestion, transformation, orchestration, and consumption layers.
  • Architect enterprise-scale solutions integrating Databricks with cloud ecosystems (Azure/AWS/GCP).

Data Engineering & Platform Enablement

  • Design and implement scalable data pipelines using Spark, Delta Lake, and Databricks workflows.
  • Establish CI/CD, DevOps, and DataOps practices for Databricks platforms.
  • Define data modelling approaches (lakehouse models, marts, feature engineering layers).
  • Build accelerators for ingestion, transformation, and ML/GenAI pipelines.

AI / GenAI & Advanced Analytics Enablement

  • Drive adoption of ML, MLOps, and GenAI capabilities on Databricks (MLflow, Feature Store, Mosaic AI).
  • Enable use cases such as intelligent automation, conversational analytics, and agentic workflows.
  • Define patterns for integrating LLMs, vector databases, and knowledge retrieval architectures.

Governance, Security & Performance

  • Define governance models including data lineage, access control, and compliance frameworks.
  • Implement security best practices (RBAC, IAM, encryption, secrets management).
  • Establish performance and cost optimisation strategies across compute and storage layers.

Stakeholder Management & Delivery Governance

  • Lead client engagements: workshops, architecture reviews, solution roadmaps, and advisory.
  • Collaborate with business stakeholders to ensure solutions drive measurable outcomes.
  • Provide leadership across large delivery programmes and ensure quality and scalability.

People & Capability Building

  • Build and mentor a high-performing Databricks engineering and architecture team.
  • Conduct technical reviews, capability assessments, and skill development initiatives.
  • Drive capability building across Data Engineering, ML, and GenAI domains.

Must Have

  • 12+ years of experience with a strong background in Data Engineering, Big Data, and platform architecture.
  • 5+ years of hands-on experience with Databricks (Spark, Delta Lake, Lakehouse architecture).
  • Proven experience in leading CoE / practice / large-scale data platform initiatives.
  • Strong expertise in PySpark/Scala/SQL, distributed data processing, and large-scale pipeline design.
  • Experience with batch and streaming architectures (Kafka, Event Hubs, Structured Streaming, etc.).
  • Strong understanding of ML/AI ecosystem on Databricks (MLflow, Feature Store, MLOps).
  • Demonstrated customer-facing experience: solutioning, stakeholder discussions, and executive communication.
  • Strong leadership and mentoring capabilities.

Good to Have

  • Experience with GenAI / LLM ecosystems (OpenAI, Claude, vector DBs, RAG architectures).
  • Exposure to Databricks Mosaic AI, Unity Catalog, Delta Live Tables.
  • Cloud expertise on Azure/AWS/GCP (IAM, networking, storage, monitoring).
  • Knowledge of tools such as dbt, Airflow, ADF, Terraform.
  • Domain experience in BFSI / Insurance / Healthcare / Analytics services.

Sign up for Job Alerts