Posted 16 June, 2026

Senior Data Engineer - IBM Datastage

PwC Acceleration Center India

Bengaluru, KA, IN Full Time

Reference: fb055c34e9cebdb5

Job Description

Senior Data Engineer

Location : Bangalore/Hyderabad

Experience : 5+ Years Data Engineering | Healthcare Analytics Preferred

Position Summary

The Senior Data Engineer role takes ownership of complex pipeline design, architectural decisions, and end-to-end delivery accountability for enterprise data integration workflows. Beyond development, the Senior Data Engineer mentors junior engineers, drives platform stabilization in the AWS-migrated environment, and actively contributes to modernization planning. This individual operates with high autonomy and is expected to resolve the most complex data engineering challenges across a production footprint of ~2,000 stored procedures, ~225 scheduled jobs, and ~40 job streams.

Key Responsibilities

Lead the design and architecture of enterprise ETL/ELT pipelines, data integration workflows, and transformation frameworks across DB2, SQL Server, and AWS
Define and enforce data engineering standards — coding patterns, naming conventions, error handling, logging, and performance benchmarks
Architect scalable data integration solutions using IBM DataStage , Oracle GoldenGate, and SQL-based transformation layers
Evaluate emerging technologies and propose modernization opportunities aligned to Project Catalyst objectives
Own L2/L3 escalations for complex pipeline failures, data quality issues, and integration incidents across the production footprint
Manage and optimize IBM Workload Scheduler job streams, dependency chains, and SLA-driven scheduling configurations
Oversee DB2 and SQL Server platform health — object optimization, index management, storage monitoring, and release governance
Lead monthly release cycles end-to-end including environment coordination, change control, and production readiness validation
Drive automation of high-volume manual tasks — dataset refreshes, extract modifications, and scheduled distribution workflows
Maintain comprehensive data lineage documentation, pipeline dependency maps, and transformation logic runbooks
Define and implement data validation frameworks and reconciliation controls across critical pipeline assets
Lead documentation efforts for runbooks, standard operating procedures, onboarding guides, and disaster recovery playbooks
Support rebadged workforce integration by transferring institutional knowledge and onboarding new team members to PHS systems
Partner with Tableau and BusinessObjects developers, business analysts, and domain leads to ensure data reliability for reporting
Proactively communicate pipeline status, capacity constraints, and delivery timelines to the Analytics Delivery Manager
Support CSAT improvement by resolving data trust and availability issues that create friction for end-user analytics

Required Qualifications

Minimum Degree Required: Bachelor’s Degree in Engineering, Statistics, Mathematics, Computer Science, Data Science, Economics, or a related quantitative field
5-9 years of data engineering experience with deep expertise in ETL/ELT development, pipeline architecture, and enterprise data integration
Expert-level SQL proficiency in IBM DB2 and/or SQL Server — complex queries, stored procedures, performance tuning, and schema design
Hands-on IBM DataStage experience including job design, parallel processing, and enterprise deployment
Strong experience with IBM Workload Scheduler or equivalent enterprise job scheduling platforms
Proficiency in Oracle GoldenGate or equivalent CDC/replication technologies
Demonstrated experience in AWS data services — S3, Glue, RDS, Redshift, or Lambda — in a production data engineering context
Proven ability to lead technical design, conduct reviews, and mentor junior engineers
Strong RCA and troubleshooting skills across complex, multi-system data environments
Experience in regulated environments with HIPAA, audit trail, and data access control requirements

Preferred Qualifications

Healthcare data engineering experience — claims processing, clinical data (HL7/FHIR), EMR integration, or population health pipelines
AWS certification (Data Engineer – Associate, Solutions Architect, or equivalent)
Experience with modern data stack tools (dbt, Apache Airflow, Spark) as part of a modernization roadmap
Familiarity with Tableau or BusinessObjects data source architecture and performance optimization
Knowledge of HITRUST, CMS reporting requirements, or healthcare regulatory data standards
Experience supporting on-premises to AWS cloud migration for enterprise data platforms

Apply to this Job

Senior Data Engineer - IBM Datastage

Job Description

Sign up for Job Alerts

Share this Job