Posted 17 June, 2026
Staff Software Engineer (DevOps)
Kaseya Careers
Pune, India
Full Time
Reference: 102_699653_5977499004
About the Role:
- Kaseya's SaaS Backup products protect M365, Google Workspace, and Salesforce data for hundreds of thousands of businesses globally.
- We are building the next generation of our backup storage platform - a unified, cloud agnostic system built to operate at massive scale across trillions of objects and petabytes of data.
- As a Staff Engineer, you will own one or more platform services end to end: design, implementation, testing, and production readiness. You work with high autonomy on well-scoped problems and are the go-to technical owner for your domain.
What You Will Do:
- Own the full lifecycle of a platform service - from design through production deployment and on call support
- Build reliable, well tested backend services in Go with a strong focus on correctness and operational simplicity
- Design and implement integrations with SaaS APIs (M365, Google Workspace, Salesforce) handling rate limits, delta sync checkpointing, and failure recovery
- Collaborate closely with principal and senior engineers on cross service interfaces and data models
- Write high quality code that sets the standard for the engineers around you
- Participate in design reviews and contribute to architectural decisions within your domain
Required Skills:
- 8-12 years of DevOps / SRE experience, with at least 3 years managing large-scale infrastructure
- Kubernetes - cluster operations, resource management, custom controllers, multi-tenant workload isolation
- Infrastructure as Code - Terraform or Pulumi at production scale; versioned, modular, reusable
- CI/CD pipeline ownership - designing and maintaining pipelines (GitHub Actions, Jenkins, ArgoCD or equivalent)
- Observability stack - metrics, logs, and traces in production (Prometheus, Grafana, Datadog or equivalent); defining SLOs/SLAs, not just dashboards
- Incident management at scale - structured on-call, alert triage, runbooks, post-mortems; experience reducing alert noise (1K+ alerts/month environment)
- Networking fundamentals - DNS, load balancing, firewalls, VPC/overlay networks in hybrid environments
- Security & compliance mindset - secrets management (Vault), RBAC, image scanning, audit logging; critical for a backup product handling customer data
- Scripting proficiency - Go or Python for automation; shell scripting for ops tooling
- Linux systems depth - performance tuning, kernel parameters, storage I/O, process management at scale
Desired Skills:
- OpenStack operations - managing Nova, Swift, Neutron, Cinder at scale
- Multi-cloud abstraction - managing workloads across AWS, GCP, Azure and private cloud with consistent tooling
- Large-scale infrastructure (5K+ nodes) - capacity planning, hardware lifecycle, rack-level failure domains
- Cost optimization / FinOps - cloud spend analysis, rightsizing, storage tiering strategies
- Chaos engineering - fault injection, game days, resilience testing (Chaos Monkey, Litmus)
- Bare metal provisioning - PXE boot, IPMI, automated OS provisioning at scale (Ironic, MaaS)
- Backup/DR domain awareness - understanding RPO/RTO, storage replication, data protection pipelines
- Go proficiency - reading and debugging Go services, contributing to internal tooling
Why Join Kaseya:
- High ownership over a real production service from day one
- Greenfield platform with strong technical leadership and a clear roadmap
- Small team where your contributions are visible and impactful
- Competitive salary and benefits