Senior Machine Learning Engineer- Vashi, Navi Mumbai
Job Description
Job Description
Title: Senior Machine Learning Engineer, AI & ML
Reporting To: Engineering Manager, AI & ML
Job Overview
As...
Job Description
Job Description
Title: Senior Machine Learning Engineer, AI & ML
Reporting To: Engineering Manager, AI & ML
Job Overview
As a Senior Machine Learning Engineer (MLE) on the AI & ML (Data Collection) team, you will play a critical role in delivering AI-powered systems that extracts meaningful data from PitchBook’s wealth of structured and unstructured data, including reports, news, and other textual content.
This role requires deep technical expertise in advanced data analytics and machine learning, as well as a hands-on approach to designing, building, and optimizing ML solutions that empower the PitchBook Platform.
You will be deeply involved in the end-to-end development and operationalization of ML models, including their architecture, training, deployment, and ongoing maintenance. Your focus will span across natural language processing (NLP), generative AI (GenAI), large language models (LLMs), and scalable data systems. You will be expected to tackle complex technical challenges, contribute to architectural decisions, and collaborate closely with cross-functional stakeholders, including product managers, engineers, and domain experts, to translate requirements into scalable AI/ML solutions. Strong communication, collaboration, and a focus on delivering reliable systems will be key to your success in this role.
Your contributions will help unlock value for PitchBook customers by improving the accuracy, scalability, and efficiency of data extraction, enabling faster and more reliable access to structured information across the platform. This includes developing models that can infer meaning and structure from millions of discrete data sources, and applying ML to enrich our datasets with predictive and generative intelligence. As a senior engineer, you will take ownership of key technical components and ensure that our systems meet the highest standards of performance, reliability, and security.
You will be expected to contribute to the team’s technical excellence by providing guidance through code and design reviews, sharing best practices, and supporting peers when needed. While this role does not involve direct people management, you will lead by example through strong ownership and high-quality execution. You will actively contribute to solving complex technical problems and ensure your work aligns with broader product and business objectives.
In addition to driving product impact, you will have opportunities to deepen your expertise and contribute to the broader AI/ML community through experimentation, technical contributions, or knowledge sharing. A strong interest in advancing capabilities in areas such as generative AI, Agentic AI, LLMs, and applied NLP will help you continuously improve systems and deliver meaningful impact.
Team Overview
You will be part of a multidisciplinary team of ML engineers and data scientists responsible for building AI & ML solutions and services as part of robust data collection pipelines handling large volumes of unstructured data. Team will focus on building scalable and reliable systems to process and categorize data that is essential for downstream data collection processing.
Please Note: This document is a summary of the primary duties and responsibilities of the identified role as it pertains to PitchBook. Employees must maintain a high degree of flexibility in our rapidly changing environment, and as such, it should be noted that from time to time, they may be required to perform additional duties beyond the scope described within.
Outline of Duties and Responsibilities
• AI & ML Extraction Contribution: Build and deliver high-impact AI/ML solutions focused on extracting structured data from unstructured sources. Ensure outputs improve data quality, coverage, and reliability across data collection pipelines.
• Technical Execution: Design, develop, and deploy ML/NLP/LLM-based extraction systems. Contribute to building scalable, efficient, and production-grade services with strong focus on accuracy, latency, cost, and robustness.
• Extraction System Development: Develop and optimize extraction workflows using techniques such as document parsing, chunking, embeddings, RAG, and LLM-based extraction methods.
• Evaluation & Quality Improvement: Define and implement evaluation frameworks (precision, recall, F1, field-level accuracy) and continuously improve extraction performance through iterative experimentation.
• Data Pipeline Contribution: Work on high-throughput data collection pipelines, ensuring seamless integration of extraction components with upstream and downstream systems.
• MLOps & Reliability: Contribute to model deployment, monitoring, logging, and CI/CD pipelines. Ensure models are observable and reliable in production environments.
• Collaboration & Stakeholder Alignment: Partner with Product, Data Collection Engineering, and
Platform teams to translate requirements into scalable extraction solutions aligned with business goals.
• Code Quality & Knowledge Sharing: Maintain high standards of code quality, participate in design and code reviews, and share knowledge to improve overall team capability.
• Innovation & Continuous Improvement: Explore and apply advancements in NLP, LLMs, and
extraction techniques to improve system performance, scalability, and cost efficiency.
• Process & Delivery Efficiency: Contribute to efficient development cycles by following Agile practices and continuously improving workflows and automation.
• Hiring & Onboarding Support: Support hiring efforts through technical interviews and help onboard new team members via documentation and knowledge sharing.
Experience, Skills and Qualifications
• Bachelor’s or Master’s degree in Computer Science, Data Science, Mathematics, or a related technical field.
• 5+ years of experience in machine learning engineering or data science, with a strong focus on
unstructured data processing and information extraction
• Strong hands-on experience in NLP and extraction-focused ML (transformers, embeddings, RAG, LLM, Agentic workflows) with practical experience handling large-scale unstructured datasets, including preprocessing, chunking, and feature engineering
• Experience building and deploying production-grade ML/LLM systems for tasks such as document parsing, information extraction, and text processing
• Proficiency in Python and SQL, with experience using standard ML/data libraries such as scikit-learn, pandas, numpy, and deep learning frameworks like PyTorch or TensorFlow
• Experience with the LangChain and equivalent ecosystem for building and orchestrating LLM-based extraction workflows is a plus.
• Familiarity with data pipeline and orchestration tools such as Kafka, Airflow, or similar technologies.
• Experience with cloud platforms (AWS or GCP) and building scalable, production-ready systems
• Working knowledge of containerization (Docker) and exposure to Kubernetes is a plus
• Understanding of MLOps practices including model deployment, monitoring, evaluation, and iterative improvement.
• Effective communication and collaboration skills, with experience working cross-functionally with product, engineering, and data teams.
• Experience working in fast-paced, data-driven environments; prior exposure to financial data or similar domains is a plus.
Please Note: This document is a summary of the primary duties and responsibilities of the identified role as it pertains to PitchBook. Employees must maintain a high degree of flexibility in our rapidly changing environment, and as such, it should be noted that from time to time, they may be required to perform additional duties beyond the scope described within.
Working Conditions
The job conditions for this position are in a standard office setting. Employees in this position use PC and phones on an ongoing basis throughout the day. Limited corporate travel may be required to remote offices or other business meetings and events.
Morningstar is an equal opportunity employer!
Please Note: This document is a summary of the primary duties and responsibilities of the identified role as it pertains to PitchBook. Employees must maintain a high degree of flexibility in our rapidly changing environment, and as such, it should
be noted that from time to time, they may be required to perform additional duties beyond the scope described within.
Below are some other jobs we think you might be interested in.
-
Optometrist - Navi Mumbai (Vashi)
- Reliance Retail
- Navi Mumbai, Maharashtra, IN
Role: Optometrist Job Description - Dispenses and Fits Accurate Prescriptions Dispenses accurate prescriptions and fits spectacles, contact lenses and...15 Jun -
Senior Machine Learning Engineer
- Morningstar
- Mumbai, MH, IN
Job Description About the Role We are looking for a Senior Machine Learning Engineer to design, build, and scale production-grade ML and GenAI systems...16 Jun -
Senior Machine Learning Engineer
- Real
- Mumbai, Maharashtra, India
Real (Nasdaq: REAX) is a publicly traded, fast-growing global real estate brokerage powered by technology and driven by people. Since our founding in...12 Jun -
Senior Machine Learning Engineer
- BRAHMA
- Bengaluru,Karnataka,India
We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build...24 May -
Senior Machine Learning Engineer
- Veracity
- Bangalore,Karnataka,India
Senior Machine Learning Engineer Bangalore, Karnataka, India Joining time / Notice Period: Less than 15 Days, Immediate Preferred MUST be from...16 Jun -
Senior Machine Learning Engineer
- BRAHMA
- India
We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build...24 May -
Senior Machine Learning Engineer
- Sumo Logic
- Noida, Uttar Pradesh, India
Senior Machine Learning EngineerLocation: (Bangalore or Noida) The proliferation of machine log data has the potential to give organizations...12 Jun -
Senior Machine Learning Engineer
- Referrals Only
- Bangalore, India
Senior Machine Learning Engineers at Thoughtworks build, maintain and test the architecture and infrastructure for managing machine learning...05 Jun -
Senior Machine Learning Engineer
- SELECTED RECRUITMENT
- Area, JH, IN
Job Description Senior Machine Learning Engineer (MLOps & LLMs) — Relocation to Abu Dhabi or Dubai I'm partnering with a fast-growing AI company in...12 Jun -
Senior Machine Learning Engineer
- PubMatic
- Pune, IN
About the Role We are hiring a results-oriented Senior Machine Learning Engineer to join our growing team in Pune on a hybrid schedule. Reporting to...22 May -
Senior Machine Learning Engineer
- SELECTED RECRUITMENT
- Hyderabad, TG, IN
Job Description Senior Machine Learning Engineer (MLOps & LLMs) — Relocation to Abu Dhabi or Dubai I'm partnering with a fast-growing AI company in...12 Jun -
Senior Machine Learning Engineer
- Apex Systems
- Bengaluru, KA, IN
Job Description As a Senior Machine Learning Engineer (Model Training & Evaluation) , you will own the end-to-end training and evaluation cycle for our...16 Jun -
Senior Machine Learning Engineer
- RZR Global Inc.
- Bengaluru
Who are we? RZR Global is an AI-driven company specializing in mobile advertising solutions designed to fuel revenue growth. We leverage AI to...22 May -
Senior Machine Learning Engineer
- Voicegain
- Bengaluru, KA, IN
Job Description Job Title : Senior Machine Learning Engineer (ML SDE-3) Location: Bangalore. Hybrid Type: Full-time Experience: 7 - 10 Years...21 May -
Senior Machine Learning Engineer
- Barracuda Networks Inc.
- Bangalore ,Karnataka,,India
Envision yourself at Barracuda: We are looking for a talented Senior Machine Learning Engineer to join our growing team and help us develop cutting-edge...16 Jun -
Senior Machine learning Engineer
- Bosch Group
- bangalore,India,560030
Company Description Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the...26 May -
Senior Machine Learning Engineer
- Cognite - AI for Industry
- India (Bengaluru)
About the Opportunity We are building the next generation of contextual AI for Industrial Operations. Our team focuses on transforming unstructured,...16 Jun -
Senior Machine Learning Engineer
- Inovalon
- Gurugram, India
Job Title: Senior Full Stack Machine Learning EngineerAbout Us:Inovalon is a leading healthcare technology company dedicated to revolutionizing the...12 Jun -
Senior Machine Learning Engineer
- VARITE INDIA PRIVATE LIMITED
- Bangalore, Karnataka, IN
Company Name: VARITE India Private Limited About The Client: Client is an Indian multinational technology company specializing in information...27 May -
Senior Machine Learning Engineer
- Aqilea (formerly Soltia)
- Bangalore, Karnataka, India
We are a consulting company with a bunch of tech-savvy and happy people! We love technology, we love design, and we love quality. Our diversity makes...19 May