Posted 12 June, 2026
Azure Data Lake Engineer
ClifyX
India
Full Time
Reference: 365_594563_26-04003
| A new Job Posting has been submitted. Please respond as soon as possible. If you are unable to submit a qualified Job Seeker, please decline immediately. |
| The following URL will take you to the Details page: Click here If you are having trouble with the link above, please copy and paste the URL below into your browser's address bar: https://www.us.fieldglass.cloud.sap/job_posting_detail.do?id=z26050620585101970031208&buyerCode=INFYSY&ticb=INFYSY&docobj=yes |
Details |
|
Job Posting IDINFYSYJP00004960 |
Job Posting Title564777 - Azure Data Lake Engineer |
DescriptionJob Title:Job Summary : We are looking for a Data Engineer who is passionate about building scalable, secure, and high-performance data architectures. In this role, you will be the primary architect and developer for our Azure Data Lake ecosystem. Your expertise in Scala and Apache Spark will be critical in transforming massive raw datasets into actionable data products for our analytics Responsibilities: • Data Lake Architecture: Design and implement ADLS Gen2 storage structures (Bronze/Silver/Gold layers) ensuring optimal partitioning and security. • Pipeline Development: Build and maintain complex ETL/ELT pipelines using Scala on Azure Databricks or HDInsight. • Performance Tuning: Optimize Spark jobs written in Scala to ensure low latency and cost-effective compute usage. • Data Orchestration: Use Azure Data Factory (ADF) to schedule and monitor end-to-end workflows. • Streaming & Batch: Develop real-time data ingestion pipelines using Azure Event Hubs or Kafka integration. • Governance & Security: Implement role-based access control (RBAC), encryption, and data masking using Microsoft Purview . Must-Have Skills / Requirements: • 7+ years overall backend development experience in production environments • Primary Language: Advanced proficiency in Scala (Functional Programming, Spark APIs). • Azure Big Data Stack: Expert knowledge of Azure Data Lake Storage Gen2 , Databricks, and Data Factory. • Distributed Computing: Deep understanding of Apache Spark (Data Frames, Datasets, Spark SQL, and optimization techniques like Caching and Broadcast joins). • SQL Mastery: Ability to write and optimize complex SQL queries for Synapse or Azure SQL. • DevOps: Experience with Azure DevOps , CI/CD pipelines, and version control (Git). • Data Modeling: Familiarity with Star/Snowflake schemas |
Job Posting Start Date01/05/2026 |
Job Posting End Date31/10/2026 |
Practice Unit (PU)ENGCDPE |
SiteNot Applicable |
LocationNot Applicable |
Posting Information |
|
Job Posting OwnerSarath Krishna |
Comments |
|
Comments To Supplier- |