Skip to main content
Posted 25 June, 2026

Senior Data Engineer - Teradata/Databricks & AI/ML

VARITE INDIA PRIVATE LIMITED
Bangalore, Karnataka, IN Full Time
Reference: 26-35432-2522-2

Company Name: VARITE India Private Limited

About The Client:
A global IT services and consulting company, multinational information technology (IT), headquartered in Tokyo, Japan. The Client offers a wide array of IT services, including application development, infrastructure management, and business process outsourcing. Their consulting services span business and technology, while their digital solutions focus on transformation and user experience design. It excels in data and intelligence services, emphasizing analytics, AI, and machine learning. Additionally, their cybersecurity, cloud, and application services round out a comprehensive portfolio designed to meet the diverse needs of businesses worldwide.

About The Job:
  • Client Americas is seeking a highly skilled Senior Data Engineer to support a Teradata Utilization Analysis engagement, which focuses on analyzing Teradata platform utilization to identify cost-reduction opportunities and deliver a repeatable, log-driven analysis solution.
  • The ideal candidate combines deep Teradata DBA expertise with hands-on Databricks engineering capability and applied AI/ML skills. The analysis will be performed primarily in Databricks, leveraging DBQL logs, system metadata, and external scheduling/orchestration data sources.
Essential Job Functions:
  • Ingest and validate 18+ months of Teradata DBQL logs including SQL text, object usage, timestamps, user/application IDs, row counts, and steps.
  • Integrate metadata from Autosys (scheduling), DataStage (orchestration), and MagicWand (observability) to supplement DBQL analysis.
  • Build end-to-end, log-driven analysis pipelines in Databricks to identify unused datasets, read-only (non-updating) datasets, and unused partitions within active datasets.
  • Capture and analyze CPU/IO resource usage and workload statistics (ResUsage) to quantify cost-reduction opportunities.
  • Classify data into cold, warm, and hot tiers; generate heatmaps of date/partition access patterns.
  • Develop a prioritized recommendation backlog with expected savings, risk levels, and required changes.
  • Apply AI/ML models or LLM-assisted analysis to detect access pattern anomalies, predict cold data candidates, and automate classification.
  • Produce and present deliverables: Observation Report, Workshop Notes & Action Log, and Final Readout for customer stakeholders.
Qualifications:
  • Experience ( Relevant) : 10+ Years
  • Shift Timings : 2pm to 11pm IST
Top 3 skills which is mandatory :
  • Please focus on the Databricks AI/ML skills and we can be flexible on the cost for the right resource.
Required Skills & Experience
Teradata DBA Skills
Skill / Technology Min. Experience Relevance to Work Order
Teradata DBA / Administration 7+ years Primary platform under analysis; deep knowledge of system internals required
Teradata DBQL (Database Query Logging) 5+ years Core data source — SQL text, object usage, timestamps, user/app IDs, row counts, steps
Teradata System Views & Space Metadata 5+ years Object inventory, space analysis, last read/write tracking
Teradata SQL & Performance Tuning 7+ years Writing complex analytical queries across DBQL and system catalog tables
Teradata ResUsage / Workload Statistics 3+ years Quantifying CPU/IO cost impact for dataset removal/archival recommendations
Data Classification (hot/warm/cold) 3+ years Categorizing datasets for tiered storage management recommendations
ETL Pipeline Assessment (DataStage, Autosys) 4+ years Identifying and recommending decommission of stale ingestion pipelines
Teradata BTEQ / FastExport / TPT 3+ years Data extraction, log export, and metadata collection utilities

Databricks Engineering Skills
Skill / Technology Min. Experience Relevance to Work Order
Databricks Platform (Workspaces, Clusters, Jobs) 4+ years Core analysis and solution delivery platform for the engagement
Apache Spark (PySpark / Spark SQL) 5+ years Large-scale log ingestion, transformation, and analytical processing
Delta Lake 3+ years Building reliable, ACID-compliant data pipelines for log-driven analysis
Databricks Notebooks & Workflows 3+ years Developing repeatable, shareable analytical notebooks for customer handoff
Python (pandas, numpy, matplotlib) 5+ years Data wrangling, statistical analysis, and visualization of usage heatmaps
SQL (Spark SQL / ANSI SQL) 7+ years Analytical querying across large DBQL datasets in Databricks
Unity Catalog / Data Governance in Databricks 2+ years Organizing deliverable assets and maintaining data lineage within the solution
Dashboarding (Databricks SQL / Power BI) 3+ years Usage heatmaps, partition access patterns, and recommendation visualizations

AI / ML Skills
Skill / Technology Min. Experience Relevance to Work Order
Machine Learning (scikit-learn, MLflow) 3+ years Anomaly detection on access patterns, predictive cold-data classification
LLM / Generative AI Integration 2+ years LLM-assisted log interpretation, automated recommendation narrative generation
Feature Engineering on Time-Series Log Data 3+ years Extracting access frequency, recency, and seasonality signals from DBQL logs
Clustering & Classification Algorithms 3+ years Unsupervised grouping of datasets by usage behavior for cold/warm/hot tiers
MLflow / Experiment Tracking 2+ years Tracking model runs for reproducible analysis and customer handoff artifacts
Natural Language Processing (NLP) 2+ years Parsing and classifying SQL text from DBQL for workload pattern analysis
Data Visualization (Plotly, Matplotlib, Seaborn) 3+ years Generating usage heatmaps and partition access charts for stakeholder readouts

Preferred
  • Familiarity with DataStage orchestration log parsing
  • Prior work on data platform cost optimization or cloud migration readiness assessments
  • Experience working in healthcare payer data environments (HIPAA awareness)
  • Strong written communication skills for advisory deliverable authoring (Observation Reports, recommendation backlogs)
How to Apply: Interested candidates are encouraged to respond/submit their updated resumes, and for additional job opportunities, please visit Jobs In India – VARITE.

Unlock Rewards: Refer Candidates and Earn.
If you're not available or interested in this opportunity, please pass this along to anyone in your network who might be a good fit and interested in our open positions. VARITE offers a Candidate Referral program, where you'll receive a one-time referral bonus based on the following scale if the referred candidate completes a three-month assignment with VARITE.

Experience Level Bonus Referral:
0-2 years INR 5,000
2-6 years INR 7,500
6+ years INR 10,000

About VARITE: VARITE is a global staffing and IT consulting company providing technical consulting and team augmentation services to Fortune 500 Companies in USA, UK, CANADA and INDIA. VARITE is currently a primary and direct vendor to the leading corporations in the verticals of Networking, Cloud Infrastructure, Hardware and Software, Digital Marketing and Media Solutions, Clinical Diagnostics, Utilities, Gaming and Entertainment, and Financial Services.

Equal Opportunity Employer:
VARITE is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, veteran status, or disability status.

Sign up for Job Alerts