Posted 25 June, 2026

Senior Data Engineer - Teradata/Databricks & AI/ML

VARITE INDIA PRIVATE LIMITED

Bangalore, Karnataka, IN Full Time

Reference: 26-35432-2522-2

Company Name: VARITE India Private Limited

About The Client:
A global IT services and consulting company, multinational information technology (IT), headquartered in Tokyo, Japan. The Client offers a wide array of IT services, including application development, infrastructure management, and business process outsourcing. Their consulting services span business and technology, while their digital solutions focus on transformation and user experience design. It excels in data and intelligence services, emphasizing analytics, AI, and machine learning. Additionally, their cybersecurity, cloud, and application services round out a comprehensive portfolio designed to meet the diverse needs of businesses worldwide.

About The Job:

Client Americas is seeking a highly skilled Senior Data Engineer to support a Teradata Utilization Analysis engagement, which focuses on analyzing Teradata platform utilization to identify cost-reduction opportunities and deliver a repeatable, log-driven analysis solution.
The ideal candidate combines deep Teradata DBA expertise with hands-on Databricks engineering capability and applied AI/ML skills. The analysis will be performed primarily in Databricks, leveraging DBQL logs, system metadata, and external scheduling/orchestration data sources.

Essential Job Functions:

Ingest and validate 18+ months of Teradata DBQL logs including SQL text, object usage, timestamps, user/application IDs, row counts, and steps.
Integrate metadata from Autosys (scheduling), DataStage (orchestration), and MagicWand (observability) to supplement DBQL analysis.
Build end-to-end, log-driven analysis pipelines in Databricks to identify unused datasets, read-only (non-updating) datasets, and unused partitions within active datasets.
Capture and analyze CPU/IO resource usage and workload statistics (ResUsage) to quantify cost-reduction opportunities.
Classify data into cold, warm, and hot tiers; generate heatmaps of date/partition access patterns.
Develop a prioritized recommendation backlog with expected savings, risk levels, and required changes.
Apply AI/ML models or LLM-assisted analysis to detect access pattern anomalies, predict cold data candidates, and automate classification.
Produce and present deliverables: Observation Report, Workshop Notes & Action Log, and Final Readout for customer stakeholders.

Qualifications:

Experience ( Relevant) : 10+ Years
Shift Timings : 2pm to 11pm IST

Top 3 skills which is mandatory :

Please focus on the Databricks AI/ML skills and we can be flexible on the cost for the right resource.

Required Skills & Experience
Teradata DBA Skills

Skill / Technology	Min. Experience	Relevance to Work Order
Teradata DBA / Administration	7+ years	Primary platform under analysis; deep knowledge of system internals required
Teradata DBQL (Database Query Logging)	5+ years	Core data source — SQL text, object usage, timestamps, user/app IDs, row counts, steps
Teradata System Views & Space Metadata	5+ years	Object inventory, space analysis, last read/write tracking
Teradata SQL & Performance Tuning	7+ years	Writing complex analytical queries across DBQL and system catalog tables
Teradata ResUsage / Workload Statistics	3+ years	Quantifying CPU/IO cost impact for dataset removal/archival recommendations
Data Classification (hot/warm/cold)	3+ years	Categorizing datasets for tiered storage management recommendations
ETL Pipeline Assessment (DataStage, Autosys)	4+ years	Identifying and recommending decommission of stale ingestion pipelines
Teradata BTEQ / FastExport / TPT	3+ years	Data extraction, log export, and metadata collection utilities

Databricks Engineering Skills

Skill / Technology	Min. Experience	Relevance to Work Order
Databricks Platform (Workspaces, Clusters, Jobs)	4+ years	Core analysis and solution delivery platform for the engagement
Apache Spark (PySpark / Spark SQL)	5+ years	Large-scale log ingestion, transformation, and analytical processing
Delta Lake	3+ years	Building reliable, ACID-compliant data pipelines for log-driven analysis
Databricks Notebooks & Workflows	3+ years	Developing repeatable, shareable analytical notebooks for customer handoff
Python (pandas, numpy, matplotlib)	5+ years	Data wrangling, statistical analysis, and visualization of usage heatmaps
SQL (Spark SQL / ANSI SQL)	7+ years	Analytical querying across large DBQL datasets in Databricks
Unity Catalog / Data Governance in Databricks	2+ years	Organizing deliverable assets and maintaining data lineage within the solution
Dashboarding (Databricks SQL / Power BI)	3+ years	Usage heatmaps, partition access patterns, and recommendation visualizations

AI / ML Skills

Skill / Technology	Min. Experience	Relevance to Work Order
Machine Learning (scikit-learn, MLflow)	3+ years	Anomaly detection on access patterns, predictive cold-data classification
LLM / Generative AI Integration	2+ years	LLM-assisted log interpretation, automated recommendation narrative generation
Feature Engineering on Time-Series Log Data	3+ years	Extracting access frequency, recency, and seasonality signals from DBQL logs
Clustering & Classification Algorithms	3+ years	Unsupervised grouping of datasets by usage behavior for cold/warm/hot tiers
MLflow / Experiment Tracking	2+ years	Tracking model runs for reproducible analysis and customer handoff artifacts
Natural Language Processing (NLP)	2+ years	Parsing and classifying SQL text from DBQL for workload pattern analysis
Data Visualization (Plotly, Matplotlib, Seaborn)	3+ years	Generating usage heatmaps and partition access charts for stakeholder readouts

Preferred

Familiarity with DataStage orchestration log parsing
Prior work on data platform cost optimization or cloud migration readiness assessments
Experience working in healthcare payer data environments (HIPAA awareness)
Strong written communication skills for advisory deliverable authoring (Observation Reports, recommendation backlogs)

How to Apply: Interested candidates are encouraged to respond/submit their updated resumes, and for additional job opportunities, please visit Jobs In India – VARITE.

Unlock Rewards: Refer Candidates and Earn.
If you're not available or interested in this opportunity, please pass this along to anyone in your network who might be a good fit and interested in our open positions. VARITE offers a Candidate Referral program, where you'll receive a one-time referral bonus based on the following scale if the referred candidate completes a three-month assignment with VARITE.

Experience Level Bonus Referral:

0-2 years	INR 5,000
2-6 years	INR 7,500
6+ years	INR 10,000

About VARITE: VARITE is a global staffing and IT consulting company providing technical consulting and team augmentation services to Fortune 500 Companies in USA, UK, CANADA and INDIA. VARITE is currently a primary and direct vendor to the leading corporations in the verticals of Networking, Cloud Infrastructure, Hardware and Software, Digital Marketing and Media Solutions, Clinical Diagnostics, Utilities, Gaming and Entertainment, and Financial Services.

Equal Opportunity Employer:
VARITE is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, veteran status, or disability status.

Apply to this Job

Senior Data Engineer - Teradata/Databricks & AI/ML

Sign up for Job Alerts

Share this Job