As an SRE at Rakuten Symphony, you will play a pivotal role in ensuring the stability, scalability, and reliability of Symcloud production deployments.
Your responsibilities will include supporting the Symcloud Kubernetes platform and Symcloud orchestration products, and ensuring the overall high availability of the platform.
Require a minimum of 3 years of hands-on experience in overseeing and managing Kubernetes clusters.
Minimum of 4 years of expertise in Linux networking and administration, with proficiency in Red Hat, Rocky, or CentOS environments
Require a minimum of 3 years of experience in providing platform support, demonstrating proficiency in maintaining and troubleshooting platform-related issues.
Minimum 3 years of experience in shell scripting or Python, showcasing proficiency in scripting languages to automate tasks and enhance operational efficiency.
Desirable skillsets include CKA Certification, expertise in Docker, proficiency in Ansible, experience with virtualization technologies (KVM), familiarity with storage technologies (e.g., NFS, block storage, SAN, hyper-converged storage), expertise in solution architecture, experience in observability tools such as Grafana, Prometheus & ELK and a working knowledge of managed Kubernetes or private cloud deployments.
Job Responsibilities
Implement & Ensuring platform capabilities and containerization plan using Symcloud Kubernetes platform, Docker & service mesh
Proactively identify and address issues to meet service level agreements (SLA), ensuring the continuous reliability and performance of the system
Collaborate closely with the engineering team to troubleshoot and resolve issues, leveraging strong problem-solving skills and technical expertise.
Develop comprehensive runbooks to streamline and document operational procedures, ensuring effective and standardized processes for the team
Serve as member of platform team to closely work with application teams to ensure
Act as a key member of the platform team, collaborating closely with application teams to ensure uptime, seamless integration and optimal performance of the overall system.
Optimize Symcloud platform infrastructure for high availability and scalability.
Create tools using shell scripting or Python to empower L2 Engineers, enhancing efficiency and streamlining processes within the team.
Collaborate within a global team spanning Japan, Europe, and North America, operating on a shift basis to ensure around-the-clock support and seamless coordination across different time zones.
Location
Bengaluru, Karnataka, India
About Company
We are India's Leading Recruitment Group, providing best-in-breed hiring solutions to our customers for hiring top-notch technical & Non-technical talent.