en

Services

We understand that no two organisations are the same. Find out more about how we've customised our talent solutions to help clients across South East Asia meet their needs.

Read more
Candidates

Together, we’ll map out career-defining, life-changing pathways to achieve your career ambitions. Browse our range of services, advice, and resources.

Learn more
Services

We understand that no two organisations are the same. Find out more about how we've customised our talent solutions to help clients across South East Asia meet their needs.

Read more
About Robert Walters Malaysia

Since our establishment in 2006, our belief remains the same: Building strong relationships with people is vital in a successful partnership.

Learn more

Work for us

Our people are the difference. Hear stories from our people to learn more about a career at Robert Walters Malaysia.

Learn more

Site Reliability Engineer

Save job

Our client is seeking a Site Reliability Engineer (SRE) to play a pivotal role in maintaining the reliability and performance of their critical services.

This role offers an exciting opportunity to bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. The successful candidate will be instrumental in driving reliability improvements and fostering a culture of continuous learning and accountability.

* Key role in maintaining reliability and performance of critical services

* Opportunity to bridge the gap between development and operations

* Drive reliability improvements and foster a culture of continuous learning

* This is a home-based role and is open to expatriate applications

What you'll do:

As a Site Reliability Engineer (SRE), your primary focus will be on maintaining the reliability and performance of our client's critical services. You will have the opportunity to design resilient system architectures that support high availability and scalability. Your expertise in developing automation tools will be crucial in enhancing operational efficiency. You will also be responsible for defining, tracking, and analysing SLOs and SLIs, ensuring that they meet business needs. Your ability to conduct thorough post-mortem analyses following incidents will drive continuous improvement.

* Design and implement resilient system architectures that support high availability and scalability.

* Develop automation tools and scripts to enhance operational efficiency.

* Define, track, and analyse Service Level Objectives (SLOs) and Service Level Indicators (SLIs).

* Conduct thorough post-mortem analyses following incidents.

* Collaborate with diverse teams to establish best practices in system reliability.

* Troubleshoot issues related to database performance, network connectivity, and deployment failures.

* Ensure issues are resolved within stipulated Service Level Agreements (SLAs).

* Identify performance bottlenecks across systems, providing actionable recommendations for enhancements.

What you bring:

The ideal Site Reliability Engineer (SRE) candidate brings proficiency in programming languages such as Python, Golang or Java. They have demonstrated experience in system architecture design with a strong understanding of SRE principles including SLOs and SLIs. Their experience extends to working with cloud environments like AWS, Azure or Google Cloud. They possess expertise in Linux system administration along with proven experience troubleshooting application support issues focusing on performance and connectivity.

* Proficiency in programming languages such as Python, Golang, Java.

* Demonstrated experience in system architecture design.

* Strong understanding of SRE principles including SLOs, SLIs.

* Experience with cloud environments like AWS, Azure, Google Cloud.

* Expertise in Linux system administration.

* Proven experience troubleshooting application support issues focusing on performance and connectivity.

* Familiarity with networking concepts and effective troubleshooting techniques.

* Excellent problem-solving abilities.

What sets this company apart:

Our client is committed to fostering an environment where continuous learning and accountability are at the forefront. They value the importance of bridging the gap between development and operations, ensuring a robust, scalable, and responsive infrastructure. This is an opportunity to join a team that values your expertise and encourages growth and development in your career.

What's next:

Ready to take the next step in your career as a Site Reliability Engineer? Don't miss this exciting opportunity!

Apply today by clicking on the link or email me at tenghong.khoo@robertwalters.com.my to discuss this new opportunity. We look forward to receiving your application!

Do note that we will only be in touch if your application is shortlisted.

Agensi Pekerjaan Robert Walters Sdn Bhd
Business Registration Number : 729828-T
Licence Number : JTKSM 423C

Contract Type: FULL_TIME

Specialism: Banking & Financial Services

Focus: Finance & Accounting

Industry: Financial Services

Salary: MYR168,000 - MYR240,000 per annum

Workplace Type: Remote

Experience Level: Mid Management

Location: Penang

Job Reference: HHNDNP-93CADB4C

Date posted: 26 November 2024

Consultant: TengHong Khoo

I'm Robert Walters Are you?

Come join our global team of creative thinkers, problem solvers and game changers. We offer accelerated career progression, a dynamic culture and expert training.