Posted 17 April, 2026

Cloud Data Engineer

Jefferies

Pune,IN,411001 Full Time

Reference: 218_611037_4089

Innovation Hub Overview Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global...

Expand

Innovation Hub Overview

Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global markets. As our first India technology center, this hub brings together hands on builders who engineer the platforms behind Jefferies' growth across capital markets, investment banking, and institutional securities. We're scaling toward an elite team of 500 engineers while maintaining the agility, ownership, and meritocratic spirit that defines Jefferies. From cloud and data to AI, risk, and core business technologies, teams in Pune will lead high impact work with a global mandate.

Team Overview

IT Infrastructure Technology

Jefferies' IT Infrastructure Technology team builds and runs the core technology backbone that enables the firm to operate globally. It covers enterprise infrastructure engineering and operations while driving modernization initiatives such as Cloud Adoption and Network Resiliency. The group supports critical platforms including networking, end-user computing, cloud, databases, server/Unix engineering, communications, and global data centers (including low-latency colocations near exchanges).

Role summary

Build and operate scalable, reliable data pipelines and infrastructure on AWS to power analytics, reporting, and data-driven decision-making. As a Cloud Data Engineer, you will design and implement data ingestion, transformation, and orchestration workflows, optimize data storage and processing, and ensure data quality and governance. You will partner with Analytics, Data Platform, ML Engineering, and business stakeholders to deliver high-quality data products that meet business needs.

You will collaborate with Analytics Engineers, Data Analysts, ML Engineers, Data Platform Engineers, Cloud Security, Database, and business stakeholders all divisions. Strong teamwork, customer service orientation, and the ability to translate business requirements into technical solutions are essential. Experience working in Agile teams using Jira and Confluence is expected

Key responsibilities

Design, build, and maintain scalable data pipelines and ETL/ELT workflows on AWS using services such as Glue, EMR, Lambda, Step Functions, Kinesis, and S3
Implement data ingestion from diverse sources (databases, APIs, streaming platforms, third-party providers); ensure reliability, performance, and error handling
Develop and optimize data transformation logic using SQL, Python, PySpark, or similar frameworks; ensure data quality, consistency, and lineage
Design and implement data storage solutions on AWS: S3 data lakes, Redshift data warehouses, DynamoDB, RDS, and Aurora; optimize for cost, performance, and access patterns
Build and maintain Infrastructure as Code (Terraform) for data infrastructure; follow team standards for modules, state management, and Terraform Enterprise workflows
Implement CI/CD pipelines for data workflows using GitHub, Bamboo, GitLab, or similar tools; ensure automated testing, deployment, and monitoring
Establish data governance and security controls: encryption at rest and in transit, IAM policies, data classification, audit logging, and compliance with regulatory requirements
Collaborate with Analytics Engineers, Data Analysts, and ML Engineers to understand data requirements and deliver datasets optimized for downstream consumption
Monitor and troubleshoot data pipeline performance, failures, and data quality issues; implement proactive alerting and remediation
Partner with Cloud Architecture, Cloud Security, and Database teams to ensure data infrastructure aligns with enterprise standards and best practices
Document data pipelines, data models, and operational procedures; contribute to team knowledge base
Drive automation and toil reduction; leverage GenAI and agentic workflows to improve engineering productivity
Proficiency using GenAI assistants (ChatGPT, Claude, GitHub Copilot) for SQL generation, pipeline code development, troubleshooting, and documentation
Ability to implement AI agent-driven automation (agentic workflows) for data engineering tasks (e.g., data quality checks, anomaly detection, pipeline optimization) with enterprise safety controls: human-in-the-loop validation, comprehensive logging and auditability, guardrails to prevent data corruption, rollback mechanisms, and secure credential handling
Proactive mindset to identify data engineering toil; ship automation that measurably reduces manual work and improves pipeline reliability
Monitor data pipeline health and SLAs; respond to failures and data quality incidents promptly
Participate in on-call rotation as needed for critical data workflows
Conduct post-incident reviews for data pipeline failures; track corrective actions to completion
Maintain runbooks and operational documentation for data infrastructure
Continuously improve pipeline performance, cost efficiency, and data quality

Requirements

7+ years in data engineering, data platform, or analytics engineering roles with strong cloud focus
Deep expertise in AWS data services: S3, Glue, EMR, Redshift, Athena, Lambda, Kinesis, Step Functions, and Lake Formation
Strong programming skills in Python and SQL; experience with PySpark or similar big data frameworks
Proficiency building ETL/ELT pipelines at scale; experience with data orchestration tools (Airflow, Step Functions, Prefect)
Solid understanding of data modelling, data warehousing, and dimensional design (star schema, snowflake schema)
Experience with Infrastructure as Code (Terraform) and CI/CD pipelines for data infrastructure
Strong understanding of data governance, security, and compliance best practices
Familiarity with streaming data platforms (Kafka, Kinesis, MSK) is a plus
Excellent problem-solving skills and attention to data quality and reliability
Strong communication and collaboration skills; ability to work with technical and non-technical stakeholders

Preferred

AWS Certified Data Analytics or Big Data Specialty certification
Experience with Snowflake or Databricks for data processing and analytics
Familiarity with dbt (data build tool) for transformation workflows
Background in software engineering or site reliability engineering
Experience with data catalog and metadata management tools (AWS Glue Catalog, Collibra, Alation)
Knowledge of DataOps practices and data quality frameworks

We have been made aware of bad actors falsely claiming to be associated with Jefferies Group soliciting individuals to attend virtual job interviews, complete online tests or courses and sending fictitious employment offer letters.

Please note that any email contact with Jefferies personnel will come from an "@jefferies.com" email address. Further, Jefferies will not notify shortlisted candidates through social media platforms (e.g. WhatsApp or Telegram) or ask candidates to make payment to participate in the hiring process.

#LI-MF1

Jefferies is a leading global, full-service investment banking and capital markets firm that provides advisory, sales and trading, research, and wealth and asset management services. With more than 40 offices around the world, we offer insights and expertise to investors, companies, and governments.

At Jefferies, we believe that diversity fosters creativity, innovation and thought leadership through the infusion of new ideas and perspectives. We have made a commitment to building a culture that provides opportunities for all employees regardless of our differences and supports a workforce that is reflective of the communities where we work and live. As a result, we are able to pool our collective insights and intelligence to provide fresh and innovative thinking for our clients.

Jefferies is an equal employment opportunity employer, and takes affirmative action to ensure that all qualified applicants will receive consideration for employment without regard to race, creed, color, national origin, ancestry, religion, gender, pregnancy, age, physical or mental disability, marital status, sexual orientation, gender identity or expression, veteran or military status, genetic information, reproductive health decisions, or any other factor protected by applicable law. We are committed to hiring the most qualified applicants and complying with all federal, state, and local equal employment opportunity laws. As part of this commitment, Jefferies will extend reasonable accommodations to individuals with disabilities, as required by applicable law.

This listing expired on 17 May. Applications are no longer accepted.

Below are some other jobs we think you might be interested in.

Cloud Data Engineer
- Jefferies
- Pune,IN,411001
Innovation Hub Overview Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global...
18 May
Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are looking for a highly motivated and detail-oriented Data Engineer to join our growing data and analytics team. In this role, the candidate will be...
07 Jun
AWS Cloud - Data Engineer
- Endava
- Bengaluru,Karnataka,India
Company Description Technology is our how. And people are our why. For over two decades, we have been harnessing technology to drive...
15 May
Sr. Cloud Data Engineer
- ClifyX
- India
Date Of Requisition 17-Oct-25 ECMS REQ ID/ Unique ID 543318 PU DNAHL Number of Openings 13 Sr. Cloud Data Engineer ...
03 Jun
Cloud Data Engineer - BLR
- Photon
- India
Location: Bangalore Experience: 10+ YearsThe Value You Deliver You will be responsible for designing, developing, and coordinating with other...
19 May
Google Cloud Data Engineer
- Muller`s Solutions
- Delhi,Delhi,India
Muller's Solutions is actively searching for a Google Cloud Data Engineer to enhance our data processing capabilities. In this vital role, you will be...
13 May
Amaze - Cloud Data Engineer
- Hexaware Technologies
- India
JD; Key Responsibilities Build LLM-based agents for profiling, modeling, quality, and transformation tasks and collaborate with architects on...
06 Jun
GCP Cloud Data engineer
- Diverse Lynx
- bengaluru,560063
JD: Total Yrs. of Experience* 5 Years Relevant Yrs. of experience* 5 Years Detailed JD *(Roles and Responsibilities) Retail banking...
30 May
Lead Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are looking for an accomplished Lead Data Engineer to drive the architectural design, development, and optimization of our enterprise-scale data...
07 Jun
Sr Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are seeking a highly skilled Senior Data Engineer to lead the design, development, and optimization of scalable data solutions that power enterprise...
07 Jun
Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
Data Engineer Consultant - Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data...
13 May
Area Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Noida, Uttar Pradesh, India
Role Title: Azure Data Engineer BU/Segment: Healthcare AnalyticsResponsibilities Design, develop, test, and deploy robust and scalable data pipelines...
13 May
DE&A - Core - Cloud Data Engineering - Snowflake Data Engineering
- Zensar Technologies
- Hyderabad,Telangana,IN,500019
You are passionate about data, possessing the ability to build and operate data pipelines, and maintain data storage within distributed systems. This...
13 May
Lead Assistant Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Chennai, Tamil Nadu, India
The ideal candidate will have strong expertise in Snowflake, Hadoop ecosystem, PySpark, and SQL, and will play a key role in enabling data-driven...
13 May
Associate - Business Analyst-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Noida, Uttar Pradesh, India
Strong hands-on programming skills in Python with multiple design patterns. Strong hands-on expertise in Kubernetes and Docker, having ability to build...
13 May
Cloud Data Engineer – GCP(100% Remote)- India
- VGreenTEK
- Trivandrum North, KL,Trivandrum South, KL,Chennai, TN,Bengaluru, KA,Ahmedabad, GJ,Mumbai, MH,Pune, MH,Lucknow, UP,Delhi, DL,Gurgaon, HR, KA, IN
- Quick Apply
Cloud Data Engineer - GCP Location: 100% Remote Job Type: Contract Duration: 6+ Months Experience: 6+ Years Skills Required: 5+ years of...
14 May
SF -Data Cloud
- NTT DATA Services
- Delhi,DL,IN
Req ID: 371568 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an...
04 Jun
DE&A - Core - Cloud Data Engineering - Informatica Cloud
- Zensar Technologies
- Pune,Maharashtra,IN,411014
We are seeking an experienced Informatica Intelligent Cloud Services (IICS) Developer to design, develop, and maintain scalable cloud-based data...
24 May
Salesforce Data Cloud Architect
- NTT DATA Services
- Hyderabad,TG,IN
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable,...
07 Jun
Data Engineering and Cloud
- Bosch Group
- bengaluru,India,560095
Company Description Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the...
13 May