Skip to main content
Posted 16 June, 2026

Senior Data Engineer - IBM Datastage

PwC Acceleration Center India
Hyderabad, TG, IN Full Time
Reference: 3a7e8267a6861324

Job Description

Senior Data Engineer


Location : Bangalore/Hyderabad

Experience : 5+ Years Data Engineering | Healthcare Analytics Preferred


Position Summary

The Senior Data Engineer role takes ownership of complex pipeline design, architectural decisions, and end-to-end delivery accountability for enterprise data integration workflows. Beyond development, the Senior Data Engineer mentors junior engineers, drives platform stabilization in the AWS-migrated environment, and actively contributes to modernization planning. This individual operates with high autonomy and is expected to resolve the most complex data engineering challenges across a production footprint of ~2,000 stored procedures, ~225 scheduled jobs, and ~40 job streams.

Key Responsibilities

  • Lead the design and architecture of enterprise ETL/ELT pipelines, data integration workflows, and transformation frameworks across DB2, SQL Server, and AWS
  • Define and enforce data engineering standards — coding patterns, naming conventions, error handling, logging, and performance benchmarks
  • Architect scalable data integration solutions using IBM DataStage , Oracle GoldenGate, and SQL-based transformation layers
  • Evaluate emerging technologies and propose modernization opportunities aligned to Project Catalyst objectives
  • Own L2/L3 escalations for complex pipeline failures, data quality issues, and integration incidents across the production footprint
  • Manage and optimize IBM Workload Scheduler job streams, dependency chains, and SLA-driven scheduling configurations
  • Oversee DB2 and SQL Server platform health — object optimization, index management, storage monitoring, and release governance
  • Lead monthly release cycles end-to-end including environment coordination, change control, and production readiness validation
  • Drive automation of high-volume manual tasks — dataset refreshes, extract modifications, and scheduled distribution workflows
  • Maintain comprehensive data lineage documentation, pipeline dependency maps, and transformation logic runbooks
  • Define and implement data validation frameworks and reconciliation controls across critical pipeline assets
  • Lead documentation efforts for runbooks, standard operating procedures, onboarding guides, and disaster recovery playbooks
  • Support rebadged workforce integration by transferring institutional knowledge and onboarding new team members to PHS systems
  • Partner with Tableau and BusinessObjects developers, business analysts, and domain leads to ensure data reliability for reporting
  • Proactively communicate pipeline status, capacity constraints, and delivery timelines to the Analytics Delivery Manager
  • Support CSAT improvement by resolving data trust and availability issues that create friction for end-user analytics





Required Qualifications

  • Minimum Degree Required: Bachelor’s Degree in Engineering, Statistics, Mathematics, Computer Science, Data Science, Economics, or a related quantitative field
  • 5-9 years of data engineering experience with deep expertise in ETL/ELT development, pipeline architecture, and enterprise data integration
  • Expert-level SQL proficiency in IBM DB2 and/or SQL Server — complex queries, stored procedures, performance tuning, and schema design
  • Hands-on IBM DataStage experience including job design, parallel processing, and enterprise deployment
  • Strong experience with IBM Workload Scheduler or equivalent enterprise job scheduling platforms
  • Proficiency in Oracle GoldenGate or equivalent CDC/replication technologies
  • Demonstrated experience in AWS data services — S3, Glue, RDS, Redshift, or Lambda — in a production data engineering context
  • Proven ability to lead technical design, conduct reviews, and mentor junior engineers
  • Strong RCA and troubleshooting skills across complex, multi-system data environments
  • Experience in regulated environments with HIPAA, audit trail, and data access control requirements

Preferred Qualifications

  • Healthcare data engineering experience — claims processing, clinical data (HL7/FHIR), EMR integration, or population health pipelines
  • AWS certification (Data Engineer – Associate, Solutions Architect, or equivalent)
  • Experience with modern data stack tools (dbt, Apache Airflow, Spark) as part of a modernization roadmap
  • Familiarity with Tableau or BusinessObjects data source architecture and performance optimization
  • Knowledge of HITRUST, CMS reporting requirements, or healthcare regulatory data standards
  • Experience supporting on-premises to AWS cloud migration for enterprise data platforms

Sign up for Job Alerts