Data Architect
Job Description
Job Description Data Architect — Databricks
Data Engineering & Pipelines | Mid-Level | Full-Time
Experience
5 – 8 Years ...
Job Description
Data Architect — Databricks
Data Engineering & Pipelines | Mid-Level | Full-Time
Experience
5 – 8 Years
Level
Mid-Level
Employment Type
Full-Time
Location
Pune - Hybrid
Primary Stack
Databricks, Apache Spark, Delta Lake, SQL
Domain
Data Engineering & Pipelines
About the Role
We are looking for a hands-on Data Architect with deep expertise in Databricks to design, build, and optimise enterprise-scale data platforms. You will own the end-to-end data engineering lifecycle — from ingestion and transformation to serving — while ensuring reliability, scalability, and governance across our lakehouse architecture.
You will collaborate closely with data engineers, analytics engineers, and product teams to translate business requirements into robust, reusable data solutions on the Databricks Lakehouse Platform.
Key Responsibilities
Data Architecture & Design
• Design and maintain the organisation's lakehouse architecture using Databricks and Delta Lake.
• Define data modelling standards (dimensional, Data Vault 2.0, or medallion architecture) across Bronze, Silver, and Gold layers.
• Architect scalable ingestion frameworks using structured and unstructured data sources (Kafka, JDBC, REST APIs, cloud storage).
• Own schema evolution strategy and ensure backward-compatibility across data assets.
Pipeline Development & Optimisation
• Build and maintain production-grade ETL/ELT pipelines using PySpark, Spark SQL, and Databricks Workflows.
• Implement Delta Live Tables (DLT) for declarative, auto-scaling pipeline development.
• Optimise Spark jobs for performance — partitioning, Z-ordering, caching, and cluster right-sizing.
• Establish CI/CD practices for data pipelines using tools such as GitHub Actions, Azure DevOps, or Databricks Asset Bundles.
Data Governance & Quality
• Implement Unity Catalog for data discovery, lineage tracking, fine-grained access control, and compliance.
• Define and enforce data quality rules using Great Expectations, DLT expectations, or equivalent frameworks.
• Work with data governance teams to document metadata, business glossary, and data contracts.
Platform & Infrastructure
• Manage Databricks workspace configuration: clusters, pools, secrets, and access policies.
• Collaborate with cloud and DevOps teams on infrastructure-as-code (Terraform) for Databricks on Azure / AWS / GCP.
• Monitor platform health, SLAs, and cost using Databricks system tables and cloud-native monitoring tools.
Collaboration & Mentorship
• Partner with data consumers (analysts, data scientists, ML engineers) to define SLAs and publish clean, well-documented data products.
• Review code and provide architectural guidance to junior engineers.
• Contribute to and champion internal data engineering best practices, runbooks, and documentation.
Required Skills & Experience
Core Databricks & Spark
• 4+ years of hands-on experience with Databricks (Unified Data Analytics Platform).
• Strong proficiency in PySpark and Spark SQL for large-scale data transformation.
• Deep knowledge of Delta Lake — ACID transactions, time travel, OPTIMIZE, VACUUM.
• Experience with Databricks Workflows, Jobs, and Delta Live Tables (DLT).
• Familiarity with Unity Catalog and Databricks governance features.
Data Engineering Fundamentals
• Solid understanding of data modelling paradigms: dimensional modelling, Data Vault, or medallion architecture.
• Experience designing and operating streaming pipelines (Structured Streaming, Kafka, Event Hubs, or Kinesis).
• Proficiency in SQL; experience with dbt is a strong plus.
• Hands-on experience with cloud platforms: Azure (ADLS, ADF), AWS (S3, Glue), or GCP (BigQuery, GCS).
Software Engineering Practices
• Version control with Git; experience with branching strategies and code review workflows.
• Ability to write testable, modular pipeline code with unit and integration tests.
• Familiarity with CI/CD pipelines and infrastructure-as-code (Terraform preferred).
Nice to Have
• Databricks Certified Data Engineer Associate or Professional certification.
• Experience with data mesh or data product frameworks.
• Exposure to ML pipelines, MLflow, or Feature Store on Databricks.
• Knowledge of data cataloguing tools (Alation, Collibra, or Databricks Unity Catalog).
• Experience with Apache Iceberg or Apache Hudi as alternative table formats.
• Familiarity with real-time analytics or OLAP systems (Druid, ClickHouse, Redshift).
What We Offer
• Competitive salary with performance-linked bonus.
• Flexible / hybrid working arrangements.
• Access to Databricks training and certification budget.
• Collaborative, engineering-first data culture with modern tooling.
• Clear career progression path to Senior Data Architect or Data Platform Lead.
• Comprehensive health, wellness, and retirement benefits.
Requirements
Data Architect — Databricks Data Engineering & Pipelines | Mid-Level | Full-Time Experience 5 – 8 Years Level Mid-Level Employment Type Full-Time Location Hybrid - Pune Primary Stack Databricks, Apache Spark, Delta Lake, SQL Domain Data Engineering & Pipelines About the Role We are looking for a hands-on Data Architect with deep expertise in Databricks to design, build, and optimise enterprise-scale data platforms. You will own the end-to-end data engineering lifecycle — from ingestion and transformation to serving — while ensuring reliability, scalability, and governance across our lakehouse architecture. You will collaborate closely with data engineers, analytics engineers, and product teams to translate business requirements into robust, reusable data solutions on the Databricks Lakehouse Platform. Key Responsibilities Data Architecture & Design • Design and maintain the organisation's lakehouse architecture using Databricks and Delta Lake. • Define data modelling standards (dimensional, Data Vault 2.0, or medallion architecture) across Bronze, Silver, and Gold layers. • Architect scalable ingestion frameworks using structured and unstructured data sources (Kafka, JDBC, REST APIs, cloud storage). • Own schema evolution strategy and ensure backward-compatibility across data assets. Pipeline Development & Optimisation • Build and maintain production-grade ETL/ELT pipelines using PySpark, Spark SQL, and Databricks Workflows. • Implement Delta Live Tables (DLT) for declarative, auto-scaling pipeline development. • Optimise Spark jobs for performance — partitioning, Z-ordering, caching, and cluster right-sizing. • Establish CI/CD practices for data pipelines using tools such as GitHub Actions, Azure DevOps, or Databricks Asset Bundles. Data Governance & Quality • Implement Unity Catalog for data discovery, lineage tracking, fine-grained access control, and compliance. • Define and enforce data quality rules using Great Expectations, DLT expectations, or equivalent frameworks. • Work with data governance teams to document metadata, business glossary, and data contracts. Platform & Infrastructure • Manage Databricks workspace configuration: clusters, pools, secrets, and access policies. • Collaborate with cloud and DevOps teams on infrastructure-as-code (Terraform) for Databricks on Azure / AWS / GCP. • Monitor platform health, SLAs, and cost using Databricks system tables and cloud-native monitoring tools. Collaboration & Mentorship • Partner with data consumers (analysts, data scientists, ML engineers) to define SLAs and publish clean, well-documented data products. • Review code and provide architectural guidance to junior engineers. • Contribute to and champion internal data engineering best practices, runbooks, and documentation. Required Skills & Experience Core Databricks & Spark • 4+ years of hands-on experience with Databricks (Unified Data Analytics Platform). • Strong proficiency in PySpark and Spark SQL for large-scale data transformation. • Deep knowledge of Delta Lake — ACID transactions, time travel, OPTIMIZE, VACUUM. • Experience with Databricks Workflows, Jobs, and Delta Live Tables (DLT). • Familiarity with Unity Catalog and Databricks governance features. Data Engineering Fundamentals • Solid understanding of data modelling paradigms: dimensional modelling, Data Vault, or medallion architecture. • Experience designing and operating streaming pipelines (Structured Streaming, Kafka, Event Hubs, or Kinesis). • Proficiency in SQL; experience with dbt is a strong plus. • Hands-on experience with cloud platforms: Azure (ADLS, ADF), AWS (S3, Glue), or GCP (BigQuery, GCS). Software Engineering Practices • Version control with Git; experience with branching strategies and code review workflows. • Ability to write testable, modular pipeline code with unit and integration tests. • Familiarity with CI/CD pipelines and infrastructure-as-code (Terraform preferred). Nice to Have • Databricks Certified Data Engineer Associate or Professional certification. • Experience with data mesh or data product frameworks. • Exposure to ML pipelines, MLflow, or Feature Store on Databricks. • Knowledge of data cataloguing tools (Alation, Collibra, or Databricks Unity Catalog). • Experience with Apache Iceberg or Apache Hudi as alternative table formats. • Familiarity with real-time analytics or OLAP systems (Druid, ClickHouse, Redshift). What We Offer • Competitive salary with performance-linked bonus. • Flexible / hybrid working arrangements. • Access to Databricks training and certification budget. • Collaborative, engineering-first data culture with modern tooling. • Clear career progression path to Senior Data Architect or Data Platform Lead. • Comprehensive health, wellness, and retirement benefits.
Below are some other jobs we think you might be interested in.
-
Data Architect
- Aptus Data Labs
- Bangalore North,Karnataka,India
Data Architect Location: Bangalore Job Type: Full Time Experience Level: Mid-Senior About the Role We are looking for an experienced Data Architect to...12 Jun -
Enterprise Data Architect
- Aptus Data Labs
- Bangalore,Karnataka,India
Enterprise Data Architect - Job Description Location: Bangalore Experience: 15+ years in Data Engineering & Data Platforms Employment Type: Full-time...12 Jun -
Snowflake Data Architect
- NTT DATA Services
- Bangalore,KA,IN
Req ID: 367467 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an...22 May -
Lead Data Engineer/ Data Architect
- Enable Data Incorporated
- Hyderabad,Telangana,India
Enable Data Incorporated is seeking a highly skilled Senior Data Engineer with expertise in cloud technologies, Spark, and Databricks to join our...28 May -
Salesforce Data Cloud Architect
- NTT DATA Services
- Hyderabad,TG,IN
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable,...07 Jun -
Solution Architect - Data Platform Modernization
- NTT DATA Services
- Bangalore,KA,IN
Roles & Responsibilities: Define the end-to-end migration strategy across: Own the target state architecture across data ingestion, storage,...09 Jun -
Data Architect
- Coforge
- Hyderabad, TG, IN
Job Description Coforge is Hiring: Data Architect – Oracle EBS to Workday\nLocations: Hyderabad / Greater Noida / Pune\nWork Mode: Hybrid (3 Days...16 Jun -
Data Architect
- METRO/MAKRO
- Pune,Maharashtra,India,411014
Company Description Metro Global Solution Center (MGSC) is internal solution partner for METRO, a 31.6 Billion international wholesaler...13 Jun -
Data Architect
- r3 Consultant
- Noida, Haryana, IN
Description r3 Consultant is looking for a Data Architect to join our dedicated team in Noida. This full-time, on-site position provides a unique...23 May -
Data Architect
- New Era Technology
- India
Job Title: Data Architect - Snowflake (Medical Devices / Healthcare)Mode: ContractDuration: 6 Months (Extendable)Work Timings: 3pm - 11.30pm ISTMode of...29 May -
Data Architect
- r3 Consultant
- Noida, UP, IN
Job DescriptionDescription\n\n r3 Consultant is looking for a Data Architect to join our dedicated team in Noida. This full-time, on-site position...24 May -
Data Architect
- Zenon Analytics Private limited
- Uttar Pradesh,Noida,India,201301
Sr. Data Architect / Data ArchitectExperience: 10+ YearsSummary:The Data Architect will design and implement data solutions that support the...12 Jun -
Data Architect
- WhiteLotus Talent Partners
- Bengaluru, KA, IN
Job Description Job Title- Data Architect Location- Bangalore, Pan India Mode- Onsite Job Type: Full-Time Overview We are looking for a highly...16 Jun -
29 May
-
Data Architect
- Innover Digital
- Bengaluru, KA, IN
Job Description Role: Data Architect Experience : 14–20 Years Location: Bangalore (Hybrid – minimum 3 days in office) About Innover Digital...12 Jun -
Data Architect
- R Systems
- Kanpur, UP, IN
Job Description About R Systems\nhttps://www.rsystems.com/\nR Systems is a Blackstone portfolio company and a leading digital transformation and...16 Jun -
Data Architect
- Tata Consultancy Services
- Bengaluru, KA, IN
Job Description Greetings from TCS Job Description for Database Architect – RDBMS & Non RDBMS Experience: 7–15 years Role Overview We are seeking...23 May -
Data Architect
- Artefact
- Pune, Maharashtra, India
Key Responsibilities Enterprise Data Strategy & Client Engagement: Develop and maintain a comprehensive data architecture strategy that aligns with...12 Jun -
Data Architect
- Persistent Systems
- Hyderabad, TG, IN
Job Description About Position: We are looking for a Data Architect with strong experience in data modelling and data warehousing, along with...04 Jun -
Data Architect
- Grid Dynamics
- Bengaluru, KA, IN
Job Description Total Experience- 12 Years - 18 Years Location - Bangalore (Preferred)/Chennai/Hyderabad Notice Period - Immediate - 15 Days Job...21 May