Description
Must Haves:
- Bachelor’s or Master’s degree in Computer Science, Information Technology, Engineering, or a closely related field, or equivalent practical experience.
- 4+ years of DevOps experience with hands-on expertise in designing, implementing, and operating secure, resilient AWS infrastructure using AWS services (EKS, ECS, Lambda, S3, RDS, Route53, VPC, CloudFormation, EC2, SNS, Secrets Manager, etc.
- Strong operational experience with Kubernetes (cluster management, networking, storage, upgrades).
- Experience implementing CI/CD pipelines using GitHub Actions or equivalent tooling.
- Proficiency with shell scripting and experience in at least one programming language (e.g., Python, Go, Java).
- Demonstrated experience supporting deployments for scalable production systems.
- Solid understanding of computer networking fundamentals — DNS, domains, firewalls, SSL/TLS certificate management, and encryption.
- Experience working in an Agile development environment.
- Strong documentation and communication skills; ability to produce clear runbooks and release notes.
- Proficiency with Terraform and Ansible for automated deployments
Additional Guidelines:
- This is work from office role in Noida/ Pune. You will have to relocate in your own expense if you are willing to relocate.
Vacancy details- Role + Responsibilities
Designation: DevOps Engineer
Experience required- 4-8 years.
Reporting To: Individual contributor role , reporting manager is in USA
Vacancy: 1
Interview Processes- 3 technical (company + Client) + 1 HR
Role Overview:
We are seeking an experienced DevOps Engineer to join our engineering organization to design, operate, and scale the infrastructure that supports our production services. The successful candidate will combine strong hands-on AWS and Kubernetes experience with a rigorous approach to automation, reliability, security, and operational excellence in a fast-paced, agile environment.
Key Responsibilities
- Ensure production services are highly available and reliable, with proactive monitoring and incident management to support 24×7 operations.
- Design, implement, and operate secure, resilient AWS infrastructure using EKS, ECS, Lambda, S3, RDS, Route 53, VPC, EC2, CloudFormation, SNS, and Secrets Manager.
- Operate and optimize Kubernetes clusters and containerized workloads; implement best practices for scaling, resource utilization, and upgrades.
- Build, maintain, and improve CI/CD pipelines (preferably GitHub Actions) to automate builds, tests, and deployments with safe rollback strategies.
- Own deployment activities for microservices and distributed applications; collaborate with engineering teams on release coordination and deployment automation.
- Drive infrastructure automation, configuration-as-code, and IaC best practices; maintain and evolve CloudFormation templates and related tooling.
- Implement and maintain observability: logging, metrics, alerting, and dashboards to support SLO/SLI objectives.
- Conduct vulnerability assessments and coordinate remediation with engineering and security teams; participate in incident response and post-incident reviews.
- Enforce and administer access controls; handle onboarding/offboarding and secret management with least-privilege principles.
- Produce and maintain runbooks, operational procedures, and release documentation.
- Support security and compliance initiatives; participation in ISO 20000 or SOC 2 efforts is an advantage.
- Mentor and assist new team members on operational practices, access processes, and onboarding.
Required Qualifications (Must-have)
- Bachelor’s or Master’s degree in Computer Science, Information Technology, Engineering, or a closely related field, or equivalent practical experience.
- 4+ years of DevOps experience with hands-on expertise in designing, implementing, and operating secure, resilient AWS infrastructure using AWS services (EKS, ECS, Lambda, S3, RDS, Route53, VPC, CloudFormation, EC2, SNS, Secrets Manager, etc.
- Strong operational experience with Kubernetes (cluster management, networking, storage, upgrades).
- Experience implementing CI/CD pipelines using GitHub Actions or equivalent tooling.
- Proficiency with shell scripting and experience in at least one programming language (e.g., Python, Go, Java).
- Demonstrated experience supporting deployments for scalable production systems.
- Solid understanding of computer networking fundamentals — DNS, domains, firewalls, SSL/TLS certificate management, and encryption.
- Experience working in an Agile development environment.
- Strong documentation and communication skills; ability to produce clear runbooks and release notes.
- Proficiency with Terraform and Ansible for automated deployments
Preferred Qualifications (Good-to-have)
- Well-versed in AI-assisted developer tools (for example: GitHub Copilot, ChatGPT, and similar platforms) and experienced using these tools to accelerate code generation, create infrastructure templates, author runbooks, automate repetitive tasks, and support faster troubleshooting.
- Awareness of DDoS mitigation strategies and network protection controls.
- Experience with vulnerability management programs and formal incident management processes.
- Operational experience with Elasticsearch at scale.
- Prior work with microservice deployment patterns and distributed application environments.
- Experience participating in or implementing ISO 20000 / SOC 2 compliance activities.
- Hands-on experience managing distributed application environments at scale.
Personal Attributes
- Strong ownership mentality and pragmatic problem-solver.
- Detail-oriented with a bias for automation and repeatability.
- Effective collaborator who can communicate operational and security concerns to engineering and business stakeholders.
Comfortable working under pressure and participating in on-call rotations
