Detailed understanding of application flows and dependent systems
Proactive monitoring of application and server health including availability, performance and capacity constraints to mitigate potential incidents before they impact the users
Provide primary operational support or multiple large, distributed software applications
Provide primary test data management support for multiple large, distributed software applications
Partner with several technology partners to troubleshoot and drive problem resolution utilising the available tools
Effectively lead RCA and escalate as part of incident management process.
Debug programs in Go to troubleshoot any ongoing issue.
Job Responsibilities
Good communication, collaboration and partnering skills with other peer teams across different location
Self-motivated with a strong sense of urgency and dedication to deadlines
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
Direct application monitoring and work towards implementing automated monitoring scripts
Strong technical troubleshooting and analytical skills with the ability to resolve application issues in a preproduction environment
Ability to troubleshoot (structured and OO) with one or more high level languages, such as Python, Java, Golang
Experience with debugging techniques for root cause analysis of issues.
Experience with identifying application/infrastructure risks and mitigation strategy and the ability to work with a team to ensure risks are mitigated.
Experience with infrastructure as code, with Kubernetes and Kubernetes satellite technologies
Experience with monitoring large scale distributed systems with Splunk, Prometheus and Grafana, or similar log analysis and querying tools - writing queries, building dashboards, configuring alerts, and reports is a plus
Analytical knowledge and exposure on troubleshooting and root cause identification using analyzer tools like Splunk, ELK, Dynatrace etc
Experience with using command line tools
Experience with CI as code in a GitHub environment
Experience with containerization
Exposure to databases such as Cassandra, Postgres, Redis etc. and messaging queues like Solace, Kafka, etc
Experience with System/Functional/Performance Testing
Location
Bengaluru, Karnataka, India
About Company
HuQuo (derived from Human Quotient) is the next stage in the evolution of HR. In an industry thirsty for innovation, HuQuo is a fresh perspective towards people and organizations. At HuQuo,we do not look at human beings as mere resources, but as an ocean of untapped potential. Human Quotient goes beyond IQ and EQ. It’s the holistic representation of all human potential. We strategically manage this Human quotient. We believe that every organization has a human like individuality which can help nurture and flourish those individuals that resonate with it. Our endeavor is to bring such high resonance individuals and organizations together. We will achieve this through extensive use of analytics and deep immersion to understand the core of the organizations and people we work with.