Skip to main content
Posted 06 June, 2026

ATARI- Principal AI System Engineer

Nexthire
Delhi,IN Full Time
Reference: 136_762505_163a3c9d03a5

About Us: Founded in 1972, Atari is one of the world's most iconic consumer brands and a pioneer in the

video game industry, known for creating classics like Pong, Asteroids, and Centipede. Today, Atari Inc.

continues to build on its legacy by developing games, hardware, and experiences that honor the past while

driving innovation for the future.

Over the past two years, we've been building Atari India, a growing team that plays a critical role in supporting

our global operations. We're proud of the team we've assembled so far, and we're just getting started. As part

of a lean, high-impact organization, the team in India works closely with colleagues in North America and

Europe on projects that move the company forward. Whether you're helping launch a new game, keeping our

infrastructure secure, or supporting day-to-day operations, your work here matters. Join us as we continue to

grow Atari India and build the future of a legendary brand.

Position: Principal AI Systems Engineer

Experience: 8+ Years

Location: Netaji Subhash Place, Pitampura, New Delhi.

Employment Type: Full-Time (Hybrid)

Reports to: Senior Director of Technology, India

About the Role

Architect, build, and own AI systems that automate expert-intensive technical workflows end-to end -

from CLI frameworks, MCP servers, and agent tooling through to production deployment, business

outcome tracking, and continuous improvement. You solve real business problems with AI, ensure

solutions are fully implemented and adopted, and measure whether they are actually working.

Responsibilities

System Architecture

Own end-to-end architecture of AI automation systems: workflow decomposition, component

communication, human checkpoints, and failure behaviour

Design and build internal CLI frameworks, reusable libraries, and agent scaffolding

Author and maintain agent instruction files (SKILL.md, CLAUDE.md, system prompts) and MCP

server definitions

Configure Claude Code and Codex CLI environments: MCP wiring, tool permissions, slash

commands, and engineering standards

Evaluate and document architectural trade-offs across reliability, latency, cost, and

maintainability

Pipeline Development

Build production-grade AI pipelines in Python: orchestration, structured prompting, context

assembly, schema validation, and retry strategies

Integrate AI systems with external tooling - version control, build pipelines, SDKs, compliance

Design context assembly: how domain knowledge, runtime state, retrieved documents, and

tool outputs compose into the precise input each pipeline stage needs

Build and operate multi-agent systems: orchestrator-worker patterns, agent memory,

structured handoffs, and conflict resolution

Prompt & Context Engineering

Design, version, and maintain system prompts and agent instructions as first-class engineering

artefacts

Own output schema design and prompt regression testing with a maintained ground-truth eval

set

Engineer context windows with precision - balancing accuracy, token cost, and latency

through compression and selective retrieval

Partner with the RAG Engineer to define retrieval requirements - what knowledge is needed,

under what conditions, and at what granularity

Build and maintain structured runtime knowledge assets: curated document corpora, rule sets,

decision trees, and validation reference libraries

Work with domain experts to translate specialist knowledge into agent behaviour: decision

logic, edge cases, and failure modes

Evaluation & Reliability

Build and own the evaluation framework: test suites, regression benchmarks, LLM-as-judge

pipelines, and per-stage quality metrics

Implement production monitoring using LangFuse, Arize, or equivalent - latency, token usage,

success rates, and output quality drift

Run structured failure analysis and implement targeted fixes across context assembly,

orchestration, and tool integration

Define automation rate as a first-class metric and report on business effectiveness of deployed

systems

Governance & Technical Leadership

Implement full audit trails - inputs, tools called, outputs, and human review triggers

Enforce versioning of all agent instructions and system prompts as engineered artefacts with

controlled rollout

Set the technical standard for AI development across the organization - architecture patterns,

eval practices, and quality gates

Collaborate with engineering, product, and domain teams; engage leadership on roadmap

priorities and technical risk.

Requirements

Proven track record of building production AI automation systems from scratch - end-to-end from

architecture through deployment.

Hands-on expertise with Claude Code, Codex CLI, Cursor, or equivalent - including MCP server

configuration and agent instruction authoring

Experience designing and deploying MCP servers and custom tools: tool schema, authentication, and

permission boundaries

Experience building internal CLI frameworks, agent scaffolding, and reusable libraries that others build

on.

Experience creating internal tooling and automation that measurably improved engineering team

efficiency - reducing manual processes and accelerating workflows

Experience working with data scientists and domain experts to implement AI solutions that

measurably improved team productivity

Deep prompt and context engineering: system prompts, few-shot design, chain-of-thought, token

budget management, and prompt versioning

Proficiency with LLM orchestration frameworks - LangChain, LangGraph, LlamaIndex, AutoGen, or

equivalent

Experience building AI evaluation frameworks: test suites, regression benchmarks, LLM-as-judge, and

production quality monitoring

Production Python engineering: modular, testable, well-logged code with proper error handling

Cloud platform experience (AWS, Azure, or GCP): deploying and monitoring AI workloads with

containerisation

Experience integrating AI systems with external APIs - tool definition, permission management, and

failure handling

Experience defining and tracking AI productivity metrics: automation rate, time-to-completion, and

human intervention rate

Bonus Points

Experience in gaming: game development pipelines, Unity/Unreal engine architectures, or platform

certification processes

Familiarity with game engine scripting, asset pipelines, or platform SDKs (Xbox GDK, PlayStation SDK,

or similar)

Shift Timings: 9AM TO 6PM IST

Employment Type: FULL_TIME

Sign up for Job Alerts