Enable job alerts via email!

DevOps Engineer

Asia Digital Engineering (ADE)

Sepang

On-site

MYR 50,000 - 90,000

Full time

9 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a skilled Site Reliability Engineer to design and maintain robust cloud infrastructure. In this pivotal role, you will implement best practices for monitoring, incident response, and CI/CD pipelines, ensuring optimal performance and security. You will collaborate with development teams to enhance application reliability while managing incidents and driving improvements. If you are passionate about cloud technologies and thrive in a dynamic environment, this opportunity offers a chance to make a significant impact in a forward-thinking organization.

Qualifications

  • Proven experience in Site Reliability Engineering or DevOps roles.
  • Strong knowledge of cloud platforms and cloud-native technologies.

Responsibilities

  • Design and maintain scalable infrastructure using cloud technologies.
  • Collaborate with teams to enhance application reliability and performance.

Skills

Cloud Technologies
Scripting and Automation
Site Reliability Engineering
DevOps Practices
Networking Concepts
Problem-Solving Skills
Communication Skills
Microservices Architecture

Education

Bachelor's Degree in Computer Science
Equivalent Work Experience

Tools

GCP
Terraform
Ansible
Docker
Kubernetes
Prometheus
Grafana
Datadog

Job description

What you will do:

  • Design, implement, and maintain scalable, reliable, and secure infrastructure using cloud technologies, currently GCP.
  • Develop and automate monitoring, alerting, and incident response processes to ensure the highest service availability level.
  • Collaborate with development teams to enhance the reliability and performance of applications through best practices and automation.
  • Manage and resolve software development incidents or system failures by performing root cause analysis, implementing timely fixes, corrective measures, and conducting post-mortems to prevent future occurrences.
  • Develop and maintain comprehensive documentation for infrastructure, processes, and procedures.
  • Participate in on-call rotations to provide 24/7 support for critical systems and respond to incidents promptly.
  • Continuously improve system observability and monitoring using tools such as Prometheus, Grafana, Datadog, etc.
  • Implement and manage CI/CD pipelines to streamline the deployment process and ensure rapid, reliable software releases.
  • Drive initiatives to optimize the cost, performance, and security of the infrastructure.
  • Stay up-to-date with industry trends and best practices in site reliability engineering and cloud technologies.

Your experience and skills

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience.
  • Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Strong knowledge of cloud platforms (AWS, GCP, Azure) and cloud-native technologies.
  • Proficiency in scripting and automation using languages such as JavaScript, NodeJS, Python, Go, Bash, or similar.
  • Experience with configuration management tools (Terraform, Ansible, Chef, Puppet).
  • Solid understanding of networking concepts and protocols.
  • Familiarity with containerization technologies (Docker, Kubernetes).
  • Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, ELK stack).
  • Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed system.
  • Excellent communication and collaboration skills.
  • Ability to work in a fast-paced, dynamic environment and manage multiple priorities.
  • Experience with microservices architecture and related technologies.
  • Knowledge of database administration and optimization (SQL, NoSQL).
  • Familiarity with security best practices and compliance standards.
  • Contributions to open-source projects or active participation in the SRE community.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.