Posted 18 May, 2026
Data Scientist
Veracity
Pune,Maharashtra,India
Full Time
Reference: 365_621153_24-02017
Position: Data Scientist
Location: Pune- Twice a month in the office. Maharashtra candidates only.
Job Summary:
We are looking for an experienced Data Scientist with 5-6 years of hands-on
experience in developing machine learning and deep learning models,
specifically in the domain of text analytics and natural language processing
(NLP). The ideal candidate will take ownership of the entire process, from
understanding requirements to proposing and designing effective solutions.
This role requires a professional capable of developing innovative projects
in NLP, text analytics and text generation.
Key Responsibilities:
Collaborate with stakeholders to gather and understand business
requirements.
Propose and design innovative solutions using machine learning and
deep learning techniques.
Develop and validate models for text analytics, NLP, and text
generation applications, using machine learning, deep learning
techniques
Analyze and assess the effectiveness of solutions through testing and
validation.
Conduct comprehensive data analysis and statistical modeling to
extract insights and drive data-driven decision-making
Communicate insights and results clearly to both technical and non-
technical audiences.
Stay updated on industry trends and advancements in data science
and NLP.
Mentor and guide juniors, fostering a culture of continuous learning
and improvement.
Required Skill Set (Must):
Good analytical skills and an inherent ability to solve problems.
Bachelor's or master's degree in Data Science, Statistics,
Mathematics, Computer Science, or a related field.
Proven expertise in text analytics, NLP, machine learning, and deep
learning.
Strong proficiency in Python, demonstrating algorithmic thinking.
Proven experience with machine learning libraries such as NLTK,
spaCy, Scikit-learn, Pandas, and NumPy.
Must have experience in training and validating models using deep
learning frameworks like TensorFlow or PyTorch, including Hugging
Face.
Must have experience in tuning and validating models from the
families of BERT, T5, BART, or FLAN-T5.
Must have experience in performing hyperparameter tuning in at
least one project.
Good To Have Skill Set:
In-depth knowledge of the fundamentals of transformer-based
models and related techniques.
Hands-on experience with generative models (e.g., GANs, VAEs,
GPT).
Experience developing with frameworks like LangChain and
LlamaIndex
Other GenAI related skills RAG pipeline design, Multi agentic
solution
Location: Pune- Twice a month in the office. Maharashtra candidates only.
Job Summary:
We are looking for an experienced Data Scientist with 5-6 years of hands-on
experience in developing machine learning and deep learning models,
specifically in the domain of text analytics and natural language processing
(NLP). The ideal candidate will take ownership of the entire process, from
understanding requirements to proposing and designing effective solutions.
This role requires a professional capable of developing innovative projects
in NLP, text analytics and text generation.
Key Responsibilities:
Collaborate with stakeholders to gather and understand business
requirements.
Propose and design innovative solutions using machine learning and
deep learning techniques.
Develop and validate models for text analytics, NLP, and text
generation applications, using machine learning, deep learning
techniques
Analyze and assess the effectiveness of solutions through testing and
validation.
Conduct comprehensive data analysis and statistical modeling to
extract insights and drive data-driven decision-making
Communicate insights and results clearly to both technical and non-
technical audiences.
Stay updated on industry trends and advancements in data science
and NLP.
Mentor and guide juniors, fostering a culture of continuous learning
and improvement.
Required Skill Set (Must):
Good analytical skills and an inherent ability to solve problems.
Bachelor's or master's degree in Data Science, Statistics,
Mathematics, Computer Science, or a related field.
Proven expertise in text analytics, NLP, machine learning, and deep
learning.
Strong proficiency in Python, demonstrating algorithmic thinking.
Proven experience with machine learning libraries such as NLTK,
spaCy, Scikit-learn, Pandas, and NumPy.
Must have experience in training and validating models using deep
learning frameworks like TensorFlow or PyTorch, including Hugging
Face.
Must have experience in tuning and validating models from the
families of BERT, T5, BART, or FLAN-T5.
Must have experience in performing hyperparameter tuning in at
least one project.
Good To Have Skill Set:
In-depth knowledge of the fundamentals of transformer-based
models and related techniques.
Hands-on experience with generative models (e.g., GANs, VAEs,
GPT).
Experience developing with frameworks like LangChain and
LlamaIndex
Other GenAI related skills RAG pipeline design, Multi agentic
solution