Posted 04 June, 2026
Generative AI Engineer
HCLTech
Chennai, TN, IN
Full Time
Reference: a1bff7e5ca35ff65
Job Description
AI Engineer (LLM / Generative AI)\n\nThe AI Engineer will design, build, deploy, and operate production-grade Large Language Model (LLM) solutions. This role focuses on engineering rigor—turning LLM capabilities into reliable, scalable, and secure enterprise applications, and continuously improving them based on performance, cost, and user feedback.\n\nKey Responsibilities\n\n• Design and build LLM-powered applications using proprietary and open-source models.\n• Implement prompt engineering, Retrieval-Augmented Generation (RAG), tool/function calling, and agent workflows.\n• Deploy LLM solutions into cloud and enterprise environments with scalability and reliability.\n• Build inference APIs, microservices, and CI/CD pipelines for AI applications.\n• Monitor model quality, latency, cost, drift, and hallucinations in production.\n• Fine-tune and enhance models using parameter-efficient techniques where required.\n• Optimize inference performance using caching, batching, quantization, and prompt optimization.\n• Ensure security, privacy, and responsible AI guardrails in all deployments.\n• Collaborate with product, platform, and engineering teams to deliver enterprise AI solutions.\n\nRequired Skills & Experience\n\n• Strong programming skills in Python; experience with backend APIs.\n• Hands-on experience with LLM frameworks (LangChain, LlamaIndex, or equivalent).\n• Experience building RAG pipelines using vector databases.\n• Knowledge of Docker, Kubernetes, cloud platforms, and CI/CD pipelines.\n• Familiarity with LLMOps / MLOps tools and monitoring systems.\n\nExperience Level\n• 3–15 years of overall software or ML engineering experience.\n\nSuccess Expectations\n\n• Production-grade LLM solutions running reliably at scale.\n• Continuous improvement in quality, cost-effectiveness, and user outcomes.\n• Reusable AI engineering patterns that accelerate enterprise AI adoption.