Clearwateranalytics

Clearwateranalytics

Staff Cloud Engineer

Role

Staff Cloud Engineer

Location

India

Job type

Full time

Posted

20 hours ago

Salary

Not disclosed by employer

Job description

Cloud Architecture & Infrastructure

  • Design, architect, and implement scalable, secure, and highly available cloud infrastructure on AWS across multi-account, multi-region environments.
  • Define and enforce cloud architecture standards, best practices, and governance policies using AWS Organizations, Control Tower, and SCPs.
  • Build and maintain Infrastructure as Code (IaC) using Terraform and AWS CloudFormation — writing reusable modules consumed across all product teams.
  • Improve and optimize cloud environments for cost, performance, and reliability — owning FinOps practices including Savings Plans, Spot strategy, and Graviton adoption.
  • Collaborate with engineering, data, and security teams to build resilient distributed systems.
  • Drive innovation and continuous improvement initiatives across the platform.
  • Design, deploy, and manage production EKS clusters at multi-tenant financial-services scale.
  • Plan and execute cluster upgrades, patching, and Kubernetes version lifecycle management with zero customer impact.
  • Build and maintain internal Helm chart libraries and GitOps-driven cluster configuration using ArgoCD or Flux.
  • Implement zero-trust network principles and enforce IAM least-privilege across all AWS accounts.
  • Drive SRE practices: define and enforce SLOs for EKS, API Gateway etc.
  • Lead incident response, postmortem analysis, and blameless RCA processes for platform-level outages.
  • Build chaos engineering exercises and disaster recovery testing across availability zones and regions.
  • Partner with software engineering teams to deliver end-to-end solutions from design through production.
  • Evaluate new AWS services and open-source tooling to continuously improve infrastructure capabilities.

Required Qualifications

  • Strong, hands-on experience with AWS cloud services: EC2, VPC, IAM, EKS, S3, CloudWatch, API Gateway, Route 53, and more.
  • Proven experience operating Amazon EKS in production: cluster lifecycle, RBAC, IRSA, node groups, and autoscaling.
  • Proficiency in Infrastructure as Code with Terraform and AWS CloudFormation.
  • Solid understanding of containerization: Docker, Kubernetes architecture, and container lifecycle management.
  • Experience with monitoring and logging tools: Prometheus, Grafana, Dynatrace, OpenSearch, ELK/Loki.
  • Strong Linux/Unix systems administration and scripting in Bash, Python, or similar.
  • Deep knowledge of cloud security best practices: IAM, RBAC, secrets management, and network security.
  • Solid networking fundamentals: VPCs, subnets, load balancing, DNS, and Kubernetes ingress controllers.
  • Ability to troubleshoot distributed systems and debug complex production issues at scale.
  • Strong problem-solving skills with the ability to drive technical decisions across teams in a fast-paced environment.

Preferred Skills

  • AWS Certifications: Solutions Architect Professional or DevOps Engineer Professional.
  • Kubernetes Certifications: CKA or CKAD.
  • Experience with Helm and GitOps tools (ArgoCD, Flux).
  • Experience with Rancher/ArgoCD or similar tools for EKS node provisioning.
  • Exposure to microservices architecture and distributed systems at scale.
  • Experience with AWS API Gateway and Lambda Authorizers for JWT/OIDC-based auth flows.
  • Background in cost optimization and performance tuning (Graviton, Spot, Savings Plans).
  • Familiarity with CIAM/identity federation: OIDC, OAuth2, SAML, Auth0 integration.
  • Understanding AI/ML infrastructure: model training pipelines, deployment on EKS, and model monitoring.
Resume ExampleCover Letter Example

Explore more

Similar jobs