Responsibilities and Qualifications
- Bachelor’s or master’s degree in computer science or computer engineering.
- 3-4 years of experience in DevOps.
- Passionate about Continuous build, integration, test, and delivery of systems.
- Good understanding of distributed systems, APIs, microservices, and cloud computing.
- Experience in implementing applications in private, and public cloud infrastructure and container technologies such as Kubernetes.
- Kubernetes experience with public clouds like AWS, and GCP platforms through migrations, scaling, and day-to-day operations.
- Must have a working knowledge of AWS services like VPC, EC2, EKS, S3, IAM, etc.
- Knowledge of source control management such as Git, GitHub, and GitLab.
- Hands-on experience with logging tools.
- Experience working with network load balancers (Ngnix, Netscaler).
- Solid grasp of API gateways like KONG API, Kubernetes, Postgresql, NoSQL databases, and Kafka.
- Built S3 buckets and managed policies for S3 buckets and used S3 bucket and Glacier for storage and backup on AWS.
- Experience in responding to production incidents and taking on-call responsibilities.
- Willing to work on multiple cloud providers as per the demand and design of applications.
- Hands-on experience in owning and operating mission-critical, large-scale product operations like provisioning, deployment, upgrades, patching, and incidents in Production on the cloud.
- Should ensure high availability and scalability of Production systems by working with engineering wherever required.
- Continuously raise the standard of engineering excellence by implementing best DevOps practices.
- Quick learner and must know when to listen, and when to take charge.
- You will be exposed to a variety of challenges supporting our infrastructure development team.
- Developing and implementing new tools to streamline manual operations and processes.
- Develop and maintain CI/CD pipeline systems for application development teams.
- Prioritizing production-related issues along with other operational team members.
- Conduct root cause analysis, resolve, and implement long-term fixes.
- Expand the capacity and improve the performance of current operational systems.