ML Ops Engineer
Job Description
Role Brief:
We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language...
Job Description
Role Brief:
We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production.
The ideal candidate will have hands-on experience with AWS tools such as SageMaker, Lambda, Bedrock, Batch with Fargate, and infrastructure components like RDS, DynamoDB, and SQS. You will be responsible for automating CI/CD workflows, managing auto-scaling APIs, and provisioning cloud resources to support high-performance ML workloads, including RAG systems.
Primary Responsibilities:
- Strategizing and implementing scalable infrastructure for ML or LLM model pipelines using tools like and cloudservices such as AWS (e.g.,AWS Batch, Fargate,Bedrock)
- Manage auto-scaling mechanisms to handle varying workloads and ensure high availability of Rest APIs
- Automate CI/CD pipelines and Lambda functions for model testing, deployment, and updates, reducing manual errorsand improving efficiency.
- Amazon SageMaker Pipelines for end-to-end ML workflow automation. Optimize utilizing step-functions
- Conduct drift analysis to detect and respond to data drift, concept drift, and label drift. Implement mitigation strategies such as automated alerts, model retraining triggers, and performance audits.
- Set up reproducible workflows for data preparation, model training, and deployment.
- Provision and optimize cloud resources (e.g., GPUs, memory) to meet computational demands of large models like those used in RAG systems
- Automate retraining workflows to keep models updated as data evolves
- Work closely with data scientists, ML engineers, and DevOps teams to integrate models into production environments.
- Implement monitoring tools to track model performance and detect issues like drift or degradation in real- time. Monitoring dashboards with real-time alerts for pipeline failures or performance issues C Implementing ModelObservability frameworks.
Required Skills:
- Education Any Engineering (BE/Btech/ME/Mtech)
- Min 5 years of experience with AWS services such as Lambda, Bedrock, Batch with Fargate, RDS (PostgreSQL), DynamoDB, SQS, CloudWatch, API Gateway, SageMaker
- Should have hands-on experience in drift analysis, including detecting and mitigating data, concept, and label drift in production ML systems
- Knowledge of ML frameworks (e.g., PyTorch, TensorFlow) to understand model requirements during deployment
- Experience with Rest API Frameworks like Fast APIs, Flask
- Familiarity with model observability like Evidently, Nanny ML, Phoenix and monitoring tools (Grafana etc) and retraining tools like MLflow/ Kubeflow / Airflow
- AWS Certified Machine Learning – Specialty – Good to have this certification
Below are some other jobs we think you might be interested in.
-
ML Ops Engineer
- Blend360
- Hyderabad,TS,India
Company Description Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power...12 Jun -
Senior ML Ops Engineer
- Aditi Consulting
- Bangalore, Karnataka, IN
Summary:The Senior ML Ops Engineer is responsible for defining and owning the MLOps architecture for deep learning systems across the organization. This...08 Jun -
AI/ML OPS Engineer
- BNP Paribas
- Chennai, TN, IN
Job Description Position Overview We are seeking a Senior AI/ML OPS Engineer to join our team at BNP Paribas . The successful candidate will play a key...04 Jun -
Machine Learning (ML) Ops Engineer
- Bizmoni Corp.
- Mumbai, Maharashtra, India
Location: Fully remote Employment type: Part-time About Bizmoni Corp. Bizmoni is the worlds first AI Super App designed to help anyone earn, learn,...17 May -
ML-Ops
- Sia
- Mumbai,Maharashtra,India,400076
Company Description Sia Partners is a next-generation global management consulting firm, founded in 1999 and headquartered in Paris,...06 Jun -
ML OPS
- Aditi Consulting
- Bangalore,Karnataka
Experience: 5+ years Interview Mode: All Virtual Work Location: Remote Client Ops - 1 resource GPU cluster development...25 May -
Senior AI/ML Ops Engineer (Hybrid in Bangalore)
- Smartsheet
- Bangalore, INDIA
Our India Global Capability Center isn't just supporting global operations-we're leading global innovation. After scaling rapidly into a best-in-class...12 Jun -
Manager, AI/ML Ops Engineering (Hybrid in Bangalore)
- Smartsheet
- Bangalore, INDIA
For over 20 years, Smartsheet has helped people and teams achieve-well, anything. As the Intelligent Work Management Platform, we are redefining the...24 May -
Senior AI/ML Ops Engineer-II (Hybrid in Bangalore)
- Smartsheet
- Bangalore, INDIA
Our India Global Capability Center isn't just supporting global operations-we're leading global innovation. After scaling rapidly into a best-in-class...12 Jun -
Software Engineer (Data & ML Ops)
- NoBroker.com
- Bangalore,Karnataka,India,560035
About Us NoBroker is India's first PropTech Unicorn and world's largest NoBrokerage property platform, making the entire real-estate transaction...26 May -
Senior Software Engineer, Machine Learning (ML Ops)
- Roku
- Bengaluru, India
About the teamThe Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers and...09 Jun -
ML Ops Developer - F2F Interview - Delhi
- Tata Consultancy Services
- New Delhi, DL, IN
Job Description Greetings from Tata Consultancy Services!!! Role: ML Ops Developer Experience: 5 to 12 Years Interview Mode: F2F Interview Date:...14 Jun -
ML Ops Developer - F2F Interview - Delhi
- Tata Consultancy Services
- Noida, UP, IN
Job Description Greetings from Tata Consultancy Services!!! Role : ML Ops Developer Experience: 5 to 12 Years Interview Mode: F2F Interview...12 Jun -
ML engineer
- Crescens Inc.
- Chennai, Tamil Nadu, IN
Job Title: ML engineer Location: Chennai, Tamilnadu (onsite) Total Experience: 5 to 7 years Mode of Interview: Face to Face (Mandatory) Pay...08 Jun -
ML Engineer
- Antino Labs
- Haryana,Gurugram,India,122018
We seek a highly skilled ML Engineer with 3-5 years of industry experience specializing in AI/ML, MLOps, Azure cloud computing, and Python to...12 Jun -
ML Engineer
- Veracity
- Pune,Maharashtra,India
ML Engineer Location – Pune Experience – 5 to 7 years Role Overview: Key Responsibilities: Develop, train, and fine-tune Machine Learning models for...28 May -
12 Jun
-
ML engineer
- National Staffing
- Chennai,Tamil Nadu,India
Job Title: ML engineer Location: Chennai, Tamilnadu (onsite) Total Experience: 5 to 7 years Mode of Interview: Face to Face (Mandatory) Pay...09 Jun -
ML engineer
- Crescens Inc.
- Chennai,Tamil Nadu,India
Job Title: ML engineer Location: Chennai, Tamilnadu (onsite) Total Experience: 5 to 7 years Mode of Interview: Face to Face (Mandatory) Pay...09 Jun -
ML Engineer
- Clean Harbors
- Hyderabad,TG,IN,500081
About CleanHarborsClean Harbors Inc. (www.cleanharbors.com) is a NYSE listed US based $6.5 billion company. Clean Harbors was founded in 1980 near...05 Jun