An exciting Senior, Site Reliability Engineer (Cloud) job has just been made available at a top tier IT payments solution company based in Kuala Lumpur.
About the Senior, Site Reliability Engineer (Cloud) Role: In this role, you will be responsible for overseeing and ensure the constant availability and reliability of the company's services. You will also monitor system performance and implement solutions for ongoing improvement.
Oversee and ensure the constant availability and reliability of our services. Monitor system performance and implement solutions for ongoing improvement
Act as the primary responder in the event of cloud infrastructure disruptions. Efficiently identify, mitigate, and resolve issues, and conduct thorough post-incident analyses to prevent recurrence
Continuously work towards enhancing cloud infrastructure efficiency. This includes refining code, scaling infrastructure effectively and ensuring resource optimisation
Anticipate future system demands and ensure our infrastructure is prepared to meet these needs through thoughtful capacity planning and appropriate scaling measures
Establish and enforce best practices for system reliability and maintainability, including policies for version control, code reviews, and deployment processes
Set up and manage monitoring and alerting systems to promptly detect and address system health issues
Prepare for potential system failures by developing comprehensive disaster recovery plans, including backup systems and failover procedures
To succeed in this Senior, Site Reliability Engineer (Cloud) job, you must have three years experience managing cloud-based services and infrastructure.
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience
Successful experience in building technical teams
Experience in system design, system architecture, distributed systems, software engineering and development
Experience in technology risk domain, focusing on system stability for more than three years
Understanding of the factors and scenarios that generate technology risks, knowledge in how to manage and prevent these risks and ability to design general technology risk solutions/systems/products, etc. through systematic abstraction
Excellent communication skills and team management experience
The scope of the offer, the size of the business, the freedom and autonomy to drive your career forward all add up to a great place to work.
If you have a successful track record in DevOps/SRE, you can take your career forward with this exciting Senior, Site Reliability Engineer (Cloud) job.
Apply today or email me at Sarah.Nunis@robertwalters.com.my to discuss this new opportunity.
Do note that we will only be in touch if your application is shortlisted.
Agensi Pekerjaan Robert Walters Sdn Bhd Business Registration Number : 729828-T Licence Number : JTKSM 423C