Job Responsibilities:
Primary responsibilities include monitoring, troubleshooting/ correcting and ensuring 100% uptime of our cloud hosted application. Will also be responsible for operational tasks like bringing up new environments, deployment, handling authorization requests, cloud best practices, backup/DR etc.
The engineer will have to understand and debug/ correct the technology stack used – Elastic Search, Apache big data stack including Solr, Kafka, Zookeeper, Storm, Hbase, Hadoop, Spark and other infra components like Netty/ NGINX, Docker/ Docker swarm etc.
The engineer will need to have public cloud knowledge and experience on:
AWS: VPC, EC2, Load balancers, Auto scaling, EBS, Kinesis, S3, Lambda, CFT, CloudWatch
Will be working different shifts in a 24/7 roster plan and tracking against SLAs for different operations
Requirements:
Technical skills Mandatory
Cloud: Cloud platform knowledge on Azure and AWS (Cloud formation scripting / Azure resource manage scripting etc.)
OS: Linux
Scripting: Python, Bash or Ruby
Technology: ELK stack, Elastic, Hbase, Hadoop, Kafka/Zookeeper, Netty/ Nginx, Spark
(Two or more of these technologies)
Nice to have:
Monitoring: Any monitoring tool like Zabbix, Nagios, Sensu etc.
Container Orchestration: Docker or Kubernetes knowledge
Programming skills will be added advantage
Experience/ Background: 3-6 years, should have worked in a cloud support environment, should have current certifications in 2 or more of the related technologies/ stack
Qualification: BE/ BTech/ MCA