System Reliability Engineer
We are seeking a System Reliability Engineer to ensure the reliability, scalability, and performance of enterprise systems and services. This role bridges software development and IT operations by applying engineering principles to operations, driving automation, and fostering a culture of resilience and observability. The ideal candidate will have hands-on experience in monitoring front-end applications, automating operational tasks, and optimizing system performance.
Key Skills: SRE Principles, Front-End Performance Monitoring, Dynatrace/APM Tools, Automation & Scripting (Python/Bash), Cloud Technologies (AWS/Azure), CI/CD Pipelines, Observability Practices, Docker/Kubernetes, Financial Compliance Standards
What you'll do:
- Ensure System Reliability & Availability: Monitor application performance, identify issues, and collaborate with developers to implement durable fixes.
- Incident Management & Root Cause Analysis: Act as a Subject Matter Expert during incidents, analyse root causes, and support post-mortem reviews.
- Automation & Tooling: Automate monitoring, alerts, and recovery processes; build scripts to eliminate manual tasks.
- Monitoring & Observability: Implement telemetry practices using tools like Dynatrace; design dashboards for system health tracking.
- Security & Compliance: Ensure systems meet regulatory standards (e.g., PCI-DSS) by implementing access controls and encryption.
- Capacity Planning & Optimization: Analyse usage trends to forecast demand and optimize cost-performance across infrastructure.
- Documentation & Knowledge Sharing: Maintain operational documentation and champion SRE principles across teams.
What you bring:
- Minimum 3 years in SRE or DevOps
- Proficiency with APM tools like Dynatrace, New Relic, or Datadog.
- Hands-on experience with Real User Monitoring (RUM), Synthetic Monitoring, and distributed tracing (OpenTelemetry).
- Strong scripting skills in Python, Bash, or JavaScript.
- Familiarity with CI/CD pipelines (e.g., GitHub Flow) and cloud platforms (AWS/Azure).
- Knowledge of Docker/Kubernetes and secure coding practices for front-end applications.
- Awareness of financial compliance standards such as PCI-DSS.
What sets this company apart:
Our client is a leading financial institution with a rich history and a strong presence in the industry. They prioritize innovation and technology, offering exciting opportunities for IT professionals. Join their dynamic team and be part of shaping the future of financial technology through cutting-edge solutions. They invest in employee growth and provide a supportive work culture. With a competitive compensation package and a commitment to corporate social responsibility, this organization offers a rewarding and impactful career opportunity.
What's next:
Ready for a career that skyrockets your professional development? Apply now!
Apply today by sending your latest CV to Eugene.Lim@robertwalters.com.my !
Do note that we will only be in touch if your application is shortlisted.
Agensi Pekerjaan Robert Walters Sdn Bhd
Business Registration Number : 729828-T
Licence Number : JTKSM 423C
About the job
Contract Type: Perm
Specialism: Tech & Transformation
Focus: Cloud and DevOps
Industry: IT
Salary: MYR12,000 - MYR16,000 per month + Great Benefits
Workplace Type: Hybrid
Experience Level: Mid Management
Location: Kuala Lumpur
FULL_TIMEJob Reference: 7YLBVK-8AC3CD1B
Date posted: 6 April 2026
Consultant: Eugene Lim
kuala-lumpur tech-transformation/cloud-and-devops 2026-04-06 2026-06-05 it Kuala Lumpur MY MYR 12000 16000 16000 MONTH Robert Walters https://www.robertwalters.com.my https://www.robertwalters.com.my/content/dam/robert-walters/global/images/logos/web-logos/square-logo.png true