Posted 28 June, 2026
Network Engineer(Enterprise / AI Infrastructure)
HCLTech
Bengaluru, KA, IN
Full Time
Reference: b8fc6ecae67b511e
Job Description
Job Title: Network (Enterprise / AI Infrastructure)
Location: Noida/Bangalore
Skill Requirement
- Strong hands-on expertise in enterprise networking and Linux systems
- Responsible for troubleshooting, performance tuning, and operational stability of network and OS layers supporting AI/HPC/Kubernetes workloads
Networking (L3)
- Certifications (Preferred):
- CCNA (Mandatory baseline)
- CCNP (Strongly preferred)
- Experience:
- 5–10 years in data center / cloud networking operations
- Hands-on experience with:
- Routing & Switching (BGP, OSPF, VLANs)
- Load balancing, firewall basics
- InfiniBand card and switches
- Nvidia UFM
- Skill Depth:
- Strong in incident troubleshooting and RCA
- Ability to diagnose:
- Latency, packet loss, connectivity failures
- Network issues affecting GPU / distributed workloads
- Working knowledge of cloud networking (AWS/Azure/GCP)
- Working Knowledge of AI/HPC networking for distributed training for GPU-GPU communication.
- Define and maintain production readiness standards across platform, data, model, application, and security layers.
- Establish SLO/SLI frameworks for latency, availability, quality, safety, and drift implement error budget policies.
- Maintain compliance mappings (e.g., ISO 27001, SOC 2, GDPR/DPDP, HIPAA where applicable).
- Author PRR checklists, runbooks/playbooks, and DR/BCP blueprints (RTO/RPO, multi‑region/site failover). Drive enablement (trainings, brown-bags) and maintain knowledge repositories and decision records.
- Implement observability (tracing, metrics, logs), dashboards, and SLO burn and cost anomaly alerting.
Execute safe releases (canary/shadow/blue green), prompt/model versioning, feature flags, and rollback plans.