AI Devops Engineer - CST Time Zone
Job Description
Please find the Job Description (JD) attached for your reference.
\n\nSenior DevOps Engineer
Location: Remote (Must be available to work in CST Timezone)
Experience Required: 8+ Years
Department: Engineering
Employment Type: Full-Time
Role Overview
Ekshvaku Tech Innovations is looking for a highly experienced Senior DevOps Engineer who excels at building scalable, automated, and resilient infrastructure for modern distributed systems and AI-driven platforms.
You will lead DevOps strategy, modernize our infrastructure, and ensure high performance across multi-environment deployments. This role requires strong hands-on expertise, architectural thinking, excellent documentation discipline, and the ability to collaborate with cross-functional engineering teams.
Key Responsibilities
CI/CD & Automation
- \n
- Design, implement, and maintain scalable CI/CD pipelines for Prerel, QA, and Production. \n
- Continuously improve build, release, and deployment workflows. \n
- Reduce manual ops through automation and scripting. \n
Infrastructure Engineering
\n\n- \n
- Automate provisioning and configuration using Terraform, Ansible, and similar IaC tools. \n
- Architect, deploy, and maintain cloud infrastructure (AWS/GCP). \n
- Lead modernization initiatives: containerization, orchestration, microservices optimization. \n
- Implement high availability setups, DR strategies, and cost-optimized infrastructure. \n
AI/ML Operations
\n\n- \n
- Support AI/ML model training and deployment pipelines. \n
- Manage GPU orchestration and scalable model-serving environments. \n
- Work closely with AI and Data Engineering teams. \n
Monitoring, Security & Observability
- \n
- Implement centralized monitoring, logging, and alerting systems. \n
- Improve system reliability and incident response times. \n
- Ensure cloud security best practices (IAM, network security, zero-trust). \n
- Contribute to compliance efforts for HIPAA, SOC2, ISO 27001. \n
Documentation & Collaboration
\n\n- \n
- Maintain clear documentation, runbooks, and architecture diagrams. \n
- Collaborate with Engineering, Product, QA, and AI teams. \n
- Evaluate new DevOps tools and AI-powered automation capabilities. \n
Required Skills & Experience
- \n
- \n8+ years in DevOps or Cloud Infrastructure roles. \n
- Strong expertise in GitHub Actions and CI/CD pipeline architecture. \n
- Deep understanding of AWS or GCP (VPC, IAM, security, autoscaling). \n
- Production-level experience with Docker & Kubernetes (EKS/GKE). \n
- Proficiency in scripting: Python, Bash, or Go. \n
- Strong experience with Terraform, Ansible, CloudFormation. \n
- Solid understanding of monitoring tools (Prometheus, Grafana, CloudWatch, ELK). \n
- Experience supporting AI/ML pipelines (MLflow, Kubeflow, SageMaker, Vertex AI). \n
- Knowledge of security best practices & compliance frameworks. \n
- Excellent communication and documentation skills. \n
Preferred Qualifications
\n\n- \n
- Experience with GPU workloads and ML orchestration. \n
- Knowledge of event-driven or serverless architectures (Lambda, Cloud Run). \n
- Exposure to GitOps tools (Argo CD, Flux). \n
- Background in AIOps (Dynatrace Davis, New Relic AI). \n
- AWS DevOps Engineer Professional or equivalent certification. \n
Success Indicators
- \n
- \n90%+ reduction in manual deployment tasks within 6 months. \n
- Fully standardized infrastructure documentation across all environments. \n
- Reduced downtime and faster incident resolution through observability. \n
- Improved developer velocity and increased deployment frequency. \n
- Adoption of AI-assisted automation and predictive monitoring. \n