Skip to main content
Posted 07 June, 2026

**Urgent**Edge AI Engineer | Edge AI Platforms| Embedded Programming| RAG|LLM| Vector DB| 4-6Yrs...

Senzcraft
Bengaluru, KA, IN Full Time
Reference: 13cfac0c7ee8993f

Job Description About Senzcraft: Founded by IIM Bangalore and IEST Shibpur Alumni, Senzcraft is a hyper-automation company. Senzcraft vision is to...

Job Description

About Senzcraft:


Founded by IIM Bangalore and IEST Shibpur Alumni, Senzcraft is a hyper-automation company. Senzcraft vision is to Radically Simplify Today's Work. And Design Business Process For The Future. Using intelligent process automation technologies.


We have a suite of SaaS products and services, partnering with automation product companies.

Please visit our website - https://www.senzcraft.com for more details


  • Our AI Operations SaaS platform – https://MagicOps.ai
  • Senzcraft on linkedin -> https://www.linkedin.com/company/senzcraft


Senzcraft is awarded by Analytics India Magazine in it’s report “State of AI in India” as a “Niche AI startup”. Senzcraft is also recognized by NY based SSON as a top hyper-automation solutions provider.


About the Opportunity:

We are looking for an engineer experienced in deploying and optimizing Large Language Models on edge devices, preferably NVIDIA Jetson platforms. The role is not limited to model inference; the candidate should be able to design practical LLM-based solutions for real-world scenarios using prompt engineering, input preprocessing, caching strategies, data creation, and model fine-tuning when required.

The ideal candidate should understand how to make LLM applications reliable, efficient, and context-aware under edge-device constraints such as limited compute, memory, latency, and power.


Location: Bangalore (Hybrid)

Experience: 4-6 Years


Mandatory Skills

  • Hands-on experience with LLMs, prompt engineering, and scenario-specific prompt design.
  • Experience running AI/ML models on edge devices with compute and memory constraints.
  • Practical knowledge of preprocessing techniques for text, speech transcripts, and structured inputs.
  • Experience implementing caching, context management, and optimization techniques for LLM applications.
  • Ability to create datasets and fine-tune or adapt models for domain-specific use cases.
  • Strong Python programming skills.
  • Understanding of NLP tasks such as intent handling, entity extraction, and text classification.
  • Experience with model evaluation, latency optimization, and debugging AI behavior.
  • Familiarity with NVIDIA Jetson or similar edge AI platforms.

Good to Have Skills

  • Experience with speech processing, speech-to-text systems, and audio preprocessing.
  • Knowledge of noise handling, speech enhancement, and robust voice input pipelines.
  • Experience with NER models and entity extraction pipelines.
  • Familiarity with TensorRT, ONNX, PyTorch, Hugging Face, or similar model deployment tools.
  • Experience with quantization, pruning, distillation, or other model compression techniques.
  • Knowledge of retrieval-augmented generation, vector databases, or local knowledge caching.
  • Experience building real-time AI applications on embedded Linux systems.
  • Familiarity with multilingual or domain-specific language processing.
  • Experience integrating LLMs with sensors, robotics, industrial systems, or IoT devices.






This listing expired on 15 Jun. Applications are no longer accepted.

Below are some other jobs we think you might be interested in.