Skip to main content
Posted 16 June, 2026

Senior Data Specialist

Publicis Global Delivery (PGD)
Bengaluru, KA, IN Full Time
Reference: b8e194662f8301ab

Job Description

Hiring | Gen-AI Data Engineering Specialist\nLocation: Bangalore / Mumbai / Gurugram / Hyderabad / Pune\nExperience: 8-11 Years\nWork Model: Hybrid (2 Days Office)\n\nJob Description\nWe are looking for an experienced Gen-AI Data Engineering Specialist with strong expertise in building scalable data platforms and AI-ready data pipelines supporting LLMs, RAG systems, and enterprise Generative AI applications.\nThe ideal candidate should have deep experience in modern data engineering, cloud-native architectures, vector databases, and distributed data processing systems. This role will work closely with ML Engineers, AI teams, and Data Scientists to build high-performance data foundations powering next-generation AI platforms.\n\nKey Responsibilities\nDesign and develop scalable data pipelines for AI and Generative AI workloads\nBuild and optimize AI-ready data platforms supporting LLM and RAG use cases\nManage embeddings, vectorized datasets, and vector database integrations\nDevelop cloud-native batch and near real-time data processing systems\nEnsure data quality, reliability, scalability, and performance across platforms\nCollaborate with AI/ML teams to translate AI requirements into robust data solutions\nDrive data pipeline automation, orchestration, and CI/CD initiatives\nOptimize distributed data systems for cost, efficiency, and scalability\n\nMandatory Skills\nStrong hands-on experience with Python and SQL\nExpertise in Apache Spark and distributed data processing\nExperience with orchestration tools such as Airflow, Prefect, or Azure Data Factory\nStrong experience with cloud platforms such as Azure, AWS, or GCP\nExperience supporting AI/ML workloads, RAG pipelines, and LLM data systems\nHands-on experience with vector databases such as Pinecone, Milvus, or Chroma\nStrong understanding of big data and streaming technologies like Kafka and Spark\nExperience implementing CI/CD for data engineering workflows\nStrong debugging, optimization, and troubleshooting skills\n\nGood to Have\nExposure to Hugging Face, PyTorch, or TensorFlow\nExperience with Docker and Kubernetes\nFamiliarity with real-time data architectures\nExposure to enterprise AI and Generative AI platforms\n\nQualifications\nBachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or related field\n5–8 years of experience in Data Engineering or AI Data Platforms\nProven experience working on large-scale cloud and distributed data systems\nExperience collaborating with AI/ML engineering teams is highly preferred

Sign up for Job Alerts