Cloud Data Engineer
Innovation Hub Overview Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global...
Innovation Hub Overview
Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global markets. As our first India technology center, this hub brings together hands on builders who engineer the platforms behind Jefferies' growth across capital markets, investment banking, and institutional securities. We're scaling toward an elite team of 500 engineers while maintaining the agility, ownership, and meritocratic spirit that defines Jefferies. From cloud and data to AI, risk, and core business technologies, teams in Pune will lead high impact work with a global mandate.
Team Overview
IT Infrastructure Technology
Jefferies' IT Infrastructure Technology team builds and runs the core technology backbone that enables the firm to operate globally. It covers enterprise infrastructure engineering and operations while driving modernization initiatives such as Cloud Adoption and Network Resiliency. The group supports critical platforms including networking, end-user computing, cloud, databases, server/Unix engineering, communications, and global data centers (including low-latency colocations near exchanges).
Role summary
Build and operate scalable, reliable data pipelines and infrastructure on AWS to power analytics, reporting, and data-driven decision-making. As a Cloud Data Engineer, you will design and implement data ingestion, transformation, and orchestration workflows, optimize data storage and processing, and ensure data quality and governance. You will partner with Analytics, Data Platform, ML Engineering, and business stakeholders to deliver high-quality data products that meet business needs.
You will collaborate with Analytics Engineers, Data Analysts, ML Engineers, Data Platform Engineers, Cloud Security, Database, and business stakeholders all divisions. Strong teamwork, customer service orientation, and the ability to translate business requirements into technical solutions are essential. Experience working in Agile teams using Jira and Confluence is expected
Key responsibilities
- Design, build, and maintain scalable data pipelines and ETL/ELT workflows on AWS using services such as Glue, EMR, Lambda, Step Functions, Kinesis, and S3
- Implement data ingestion from diverse sources (databases, APIs, streaming platforms, third-party providers); ensure reliability, performance, and error handling
- Develop and optimize data transformation logic using SQL, Python, PySpark, or similar frameworks; ensure data quality, consistency, and lineage
- Design and implement data storage solutions on AWS: S3 data lakes, Redshift data warehouses, DynamoDB, RDS, and Aurora; optimize for cost, performance, and access patterns
- Build and maintain Infrastructure as Code (Terraform) for data infrastructure; follow team standards for modules, state management, and Terraform Enterprise workflows
- Implement CI/CD pipelines for data workflows using GitHub, Bamboo, GitLab, or similar tools; ensure automated testing, deployment, and monitoring
- Establish data governance and security controls: encryption at rest and in transit, IAM policies, data classification, audit logging, and compliance with regulatory requirements
- Collaborate with Analytics Engineers, Data Analysts, and ML Engineers to understand data requirements and deliver datasets optimized for downstream consumption
- Monitor and troubleshoot data pipeline performance, failures, and data quality issues; implement proactive alerting and remediation
- Partner with Cloud Architecture, Cloud Security, and Database teams to ensure data infrastructure aligns with enterprise standards and best practices
- Document data pipelines, data models, and operational procedures; contribute to team knowledge base
- Drive automation and toil reduction; leverage GenAI and agentic workflows to improve engineering productivity
- Proficiency using GenAI assistants (ChatGPT, Claude, GitHub Copilot) for SQL generation, pipeline code development, troubleshooting, and documentation
- Ability to implement AI agent-driven automation (agentic workflows) for data engineering tasks (e.g., data quality checks, anomaly detection, pipeline optimization) with enterprise safety controls: human-in-the-loop validation, comprehensive logging and auditability, guardrails to prevent data corruption, rollback mechanisms, and secure credential handling
- Proactive mindset to identify data engineering toil; ship automation that measurably reduces manual work and improves pipeline reliability
- Monitor data pipeline health and SLAs; respond to failures and data quality incidents promptly
- Participate in on-call rotation as needed for critical data workflows
- Conduct post-incident reviews for data pipeline failures; track corrective actions to completion
- Maintain runbooks and operational documentation for data infrastructure
- Continuously improve pipeline performance, cost efficiency, and data quality
Requirements
- 7+ years in data engineering, data platform, or analytics engineering roles with strong cloud focus
- Deep expertise in AWS data services: S3, Glue, EMR, Redshift, Athena, Lambda, Kinesis, Step Functions, and Lake Formation
- Strong programming skills in Python and SQL; experience with PySpark or similar big data frameworks
- Proficiency building ETL/ELT pipelines at scale; experience with data orchestration tools (Airflow, Step Functions, Prefect)
- Solid understanding of data modelling, data warehousing, and dimensional design (star schema, snowflake schema)
- Experience with Infrastructure as Code (Terraform) and CI/CD pipelines for data infrastructure
- Strong understanding of data governance, security, and compliance best practices
- Familiarity with streaming data platforms (Kafka, Kinesis, MSK) is a plus
- Excellent problem-solving skills and attention to data quality and reliability
- Strong communication and collaboration skills; ability to work with technical and non-technical stakeholders
Preferred
- AWS Certified Data Analytics or Big Data Specialty certification
- Experience with Snowflake or Databricks for data processing and analytics
- Familiarity with dbt (data build tool) for transformation workflows
- Background in software engineering or site reliability engineering
- Experience with data catalog and metadata management tools (AWS Glue Catalog, Collibra, Alation)
- Knowledge of DataOps practices and data quality frameworks
We have been made aware of bad actors falsely claiming to be associated with Jefferies Group soliciting individuals to attend virtual job interviews, complete online tests or courses and sending fictitious employment offer letters.
Please note that any email contact with Jefferies personnel will come from an "@jefferies.com" email address. Further, Jefferies will not notify shortlisted candidates through social media platforms (e.g. WhatsApp or Telegram) or ask candidates to make payment to participate in the hiring process.
#LI-MF1
Jefferies is a leading global, full-service investment banking and capital markets firm that provides advisory, sales and trading, research, and wealth and asset management services. With more than 40 offices around the world, we offer insights and expertise to investors, companies, and governments.
At Jefferies, we believe that diversity fosters creativity, innovation and thought leadership through the infusion of new ideas and perspectives. We have made a commitment to building a culture that provides opportunities for all employees regardless of our differences and supports a workforce that is reflective of the communities where we work and live. As a result, we are able to pool our collective insights and intelligence to provide fresh and innovative thinking for our clients.
Jefferies is an equal employment opportunity employer, and takes affirmative action to ensure that all qualified applicants will receive consideration for employment without regard to race, creed, color, national origin, ancestry, religion, gender, pregnancy, age, physical or mental disability, marital status, sexual orientation, gender identity or expression, veteran or military status, genetic information, reproductive health decisions, or any other factor protected by applicable law. We are committed to hiring the most qualified applicants and complying with all federal, state, and local equal employment opportunity laws. As part of this commitment, Jefferies will extend reasonable accommodations to individuals with disabilities, as required by applicable law.
Below are some other jobs we think you might be interested in.
-
Cloud Data Engineer
- Jefferies
- Pune,IN,411001
Innovation Hub Overview Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global...18 May -
Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are looking for a highly motivated and detail-oriented Data Engineer to join our growing data and analytics team. In this role, the candidate will be...07 Jun -
AWS Cloud - Data Engineer
- Endava
- Bengaluru,Karnataka,India
Company Description Technology is our how. And people are our why. For over two decades, we have been harnessing technology to drive...15 May -
Sr. Cloud Data Engineer
- ClifyX
- India
Date Of Requisition 17-Oct-25 ECMS REQ ID/ Unique ID 543318 PU DNAHL Number of Openings 13 Sr. Cloud Data Engineer ...03 Jun -
Cloud Data Engineer - BLR
- Photon
- India
Location: Bangalore Experience: 10+ YearsThe Value You Deliver You will be responsible for designing, developing, and coordinating with other...19 May -
Google Cloud Data Engineer
- Muller`s Solutions
- Delhi,Delhi,India
Muller's Solutions is actively searching for a Google Cloud Data Engineer to enhance our data processing capabilities. In this vital role, you will be...13 May -
Amaze - Cloud Data Engineer
- Hexaware Technologies
- India
JD; Key Responsibilities Build LLM-based agents for profiling, modeling, quality, and transformation tasks and collaborate with architects on...06 Jun -
GCP Cloud Data engineer
- Diverse Lynx
- bengaluru,560063
JD: Total Yrs. of Experience* 5 Years Relevant Yrs. of experience* 5 Years Detailed JD *(Roles and Responsibilities) Retail banking...30 May -
Lead Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are looking for an accomplished Lead Data Engineer to drive the architectural design, development, and optimization of our enterprise-scale data...07 Jun -
Sr Cloud Data Engineer
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
We are seeking a highly skilled Senior Data Engineer to lead the design, development, and optimization of scalable data solutions that power enterprise...07 Jun -
Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Gurugram, Haryana, India
Data Engineer Consultant - Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data...13 May -
Area Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Noida, Uttar Pradesh, India
Role Title: Azure Data Engineer BU/Segment: Healthcare AnalyticsResponsibilities Design, develop, test, and deploy robust and scalable data pipelines...13 May -
DE&A - Core - Cloud Data Engineering - Snowflake Data Engineering
- Zensar Technologies
- Hyderabad,Telangana,IN,500019
You are passionate about data, possessing the ability to build and operate data pipelines, and maintain data storage within distributed systems. This...13 May -
Lead Assistant Manager-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Chennai, Tamil Nadu, India
The ideal candidate will have strong expertise in Snowflake, Hadoop ecosystem, PySpark, and SQL, and will play a key role in enabling data-driven...13 May -
Associate - Business Analyst-Data Engineering-Cloud Data Engineering
- ExlService Holdings, Inc.
- Noida, Uttar Pradesh, India
Strong hands-on programming skills in Python with multiple design patterns. Strong hands-on expertise in Kubernetes and Docker, having ability to build...13 May -
Cloud Data Engineer – GCP(100% Remote)- India
- VGreenTEK
- Trivandrum North, KL,Trivandrum South, KL,Chennai, TN,Bengaluru, KA,Ahmedabad, GJ,Mumbai, MH,Pune, MH,Lucknow, UP,Delhi, DL,Gurgaon, HR, KA, IN
- Quick Apply
Cloud Data Engineer - GCP Location: 100% Remote Job Type: Contract Duration: 6+ Months Experience: 6+ Years Skills Required: 5+ years of...14 May -
SF -Data Cloud
- NTT DATA Services
- Delhi,DL,IN
Req ID: 371568 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an...04 Jun -
DE&A - Core - Cloud Data Engineering - Informatica Cloud
- Zensar Technologies
- Pune,Maharashtra,IN,411014
We are seeking an experienced Informatica Intelligent Cloud Services (IICS) Developer to design, develop, and maintain scalable cloud-based data...24 May -
Salesforce Data Cloud Architect
- NTT DATA Services
- Hyderabad,TG,IN
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable,...07 Jun -
Data Engineering and Cloud
- Bosch Group
- bengaluru,India,560095
Company Description Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the...13 May