Research, design, develop and implement solutions to improve operability, elastic capacity and performance, change management, business continuity, security, continuous integration and deployment of technical solutions.
Involved in all aspects of running and scaling the service, including server platforms, operating systems, automation, and overall systems management.
Working in a CI/CD agile environment with a dedicated team of engineers and technologies including AWS, Java, Jetty, Kubernetes, Hadoop, HBase, Docker, Elastic Search, Apache Traffic Server (ATS) and more.
Engage in building operational requirements, design considerations for new development efforts including service migration from staging environments to AWS cloud or on premise datacenters and building elastic capabilities in Kubernetes clusters running on either.
Designing the final integration of development efforts into the production environment.
Identifying production stability concerns via stress testing tools impacting analysis and design phases of software development lifecycle, and ability to troubleshoot performance issues within the production system technical stack down to source code and network packet level.
Ensuring the operational health of application systems 24x7 and providing tier 2 support for escalated issues and working with developers to solve code related issues.
Establishing and maintaining internal SLAs with business stakeholders.
Expertise in DevOps Production Engineer roles.
Computer Science degree (or in relevant specialization) with 3+ years of industry experience
Strong problem solving skills with experience in troubleshooting and system administration In UNIX/RHEL and AWS production environments.
Strong knowledge of TCP/IP, HTTPS, DNS, and Load balancing solutions and ability to analyze the output of tcpdump.
Understanding of Database concepts and SQL writing skills in Oracle / MySQL
Experience with capacity planning and forecasting
Strong coding and scripting automation solutions skills in one of: Bash, Perl, Python, Golang.
Experience with configuration management, CI/CD and automation tools (Ansible, Puppet or Chef, Splunk, Elastic Search, Jenkins/Screwdriver, Docker/Kubernetes).
Experience in managing Java applications at scale- Jetty, ATS that are often high-volume and low-latency.
Understanding big data technologies i.e. Hadoop, Storm, Hbase, Hive, Kafka and/or other open source technologies related to this area.
Ability to work independently and in teams with ability to drive projects end-to-end.
Excellent communication, interpersonal and teamwork skills.