Posted 03 June, 2026
Agentic AI Engineer
ExlService Holdings, Inc.
Pune, Maharashtra, India
Full Time
Reference: 218_689623_14947
We are looking for an Agentic AI Engineer to design, build, execute, test, and orchestrate autonomous AI agent systems that operate across complex, multi-step workflows. You will work at the intersection of large language models, tool-use frameworks, and enterprise data pipelines to deliver reliable, production-grade agentic solutions.
EXL (NASDAQ: EXLS) is a leading data analytics and digital operations and solutions company. We partner with clients using a data and AI-led approach to reinvent business models, drive better business outcomes and unlock growth with speed. EXL harnesses the power of data, analytics, AI, and deep industry knowledge to transform operations for the world's leading corporations in industries including insurance, healthcare, banking and financial services, media and retail, among others. EXL was founded in 1999 with the core values of innovation, collaboration, excellence, integrity and respect. We are headquartered in New York and have more than 54,000 employees spanning six continents. For more information, visit www.exlservice.com.
- Minimum 4 years of AI engineering experience, with at least 3 years focused on LLM/agent systems in production.
- Hands-on experience designing agentic architectures: ReAct, plan-and-execute, reflection loops, tool-use patterns.
- Proficiency in Python; experience with at least one agent framework (LangChain/LangGraph, AutoGen, CrewAI, Semantic Kernel, or equivalent).
- Strong understanding of prompt engineering, context window management, and structured output extraction.
- Experience building and testing tool-use integrations: REST APIs, code interpreters, vector databases, SQL executors.
- Familiarity with evaluation frameworks for LLM outputs (RAGAS, custom eval harnesses, LLM-as-judge patterns).
- Understanding of agent safety concerns: prompt injection, tool misuse, hallucination detection, and mitigation strategies.
- Experience with cloud infrastructure (AWS/GCP/Azure) and containerization (Docker, Kubernetes).
- Experience with MLOps, AIOps tooling (MLflow, Weights & Biases, experiment tracking).
Strong experience designing and building memory and caching layers for agentic AI systems, including conversational memory, semantic retrieval, context optimization, and token cost reduction strategies for scalable production deployments
- Design and implement agentic AI systems (single and multi-agent) with tool use, memory, and fallback mechanisms.
- Build production-grade agents using frameworks like LangGraph, AutoGen, CrewAI, or custom LLM orchestration layers.
- Implement agent reasoning loops including planning, tool selection, execution, observation, and re-planning with safety guardrails.
- Develop prompt and context engineering strategies for reliable, grounded LLM outputs.
- Design agent orchestration workflows include task routing, parallel execution, state management, retries, and human-in-the-loop escalation.
- Build evaluation frameworks for LLMs and agents including automated testing, adversarial testing, and performance benchmarking.
- Implement retrieval and grounding using vector databases, embeddings, and knowledge graphs for contextual accuracy.
- Ensure observability of agent systems by tracing LLM calls, tool usage, and decision paths using monitoring tools.
- Apply security and governance controls including prompt injection defense, access control, and safe tool execution.
- Optimize agent systems for latency, cost, and scalability in production environments.
- Build CI/CD pipelines for agent workflows including versioning, testing, and controlled deployments.
- Integrate agents with enterprise systems and APIs to automate end-to-end business workflows.
- Design feedback loops using production traces and evaluation signals to continuously improve agent performance.
- Experience with Model Context Protocol (MCP) systems to design database connections, integrate APIs, and enable secure tool orchestration for AI agents.
- Hands-on experience in fine-tuning LLMs for domain-specific applications using LoRA, PEFT, QLoRA, RLHF, instruction tuning, and other parameter-efficient adaptation techniques.
- Stay current with emerging agentic AI frameworks, research, and best practices for production deployment.