Enable job alerts via email!

Senior Site Reliability Engineer

HALA

Saudi Arabia

Hybrid

SAR 60,000 - 100,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking fintech company as a Senior Site Reliability Engineer, where you will play a crucial role in ensuring the reliability and performance of high-traffic applications. With a focus on innovation and collaboration, you will design scalable infrastructure, automate operations, and enhance system resilience. This role offers the opportunity to work with cutting-edge technologies in a diverse team, fostering personal growth and professional development. If you are passionate about optimizing complex systems and thrive in a dynamic environment, this is the perfect opportunity for you.

Benefits

Inclusive and diverse culture
Highly competitive compensation
Personal development opportunities
Flexible work setups
Annual learning stipend
Mentoring and growth opportunities

Qualifications

  • 5+ years of experience in site reliability engineering and managing complex systems.
  • Proficient in automation and incident management.

Responsibilities

  • Design and implement scalable infrastructure for high availability of applications.
  • Collaborate with teams to enhance software reliability and resilience.

Skills

Linux Management
Python Scripting
Bash Scripting
Cloud Platforms (GCP, OCI, Alibaba)
CI/CD Tools
Kubernetes
Docker
Monitoring Tools (Prometheus, Grafana)

Education

Bachelor's Degree in Information Technology
Relevant Certifications (CKA, Terraform Associate)

Tools

Terraform
Ansible
GitLab CI/CD
Jenkins
Patroni Postgres
MongoDB
SQL/NoSQL Databases
ELK Stack

Job description

Who Are We

HALA is a leading fintech player in the MENAP region that aims to redefine financial services and build the future bank of SMEs. HALA empowers SMEs to start, run, and grow their businesses by providing cutting-edge financial and technological tools.

HALA has multiple entities in the UAE, Saudi Arabia, and Egypt, including HALA Payments, HALA Cashier, and HALA Logistics. The company offers solutions enabling merchants to digitize payments and manage sales and operations.

Founded in 2017, HALA is licensed by the Saudi Arabian Central Bank and the Financial Services Regulatory Authority (FSRA) in Abu Dhabi Global Market.

Job Summary

We are seeking a result-oriented Senior Site Reliability Engineer with 5+ years of experience. The role involves maintaining and enhancing the reliability, scalability, and performance of complex distributed systems. The candidate should be proficient in industry-leading tools and technologies to optimize infrastructure and automate operations, with strong incident management, monitoring, and deployment automation skills. Excellent problem-solving and collaboration abilities are essential to ensure seamless operations for high-traffic FinTech applications.

Key Responsibilities
  • Design and implement scalable, reliable infrastructure ensuring high availability and optimal performance of HALA applications.
  • Collaborate with development teams to embed reliability and resilience into the software development lifecycle.
  • Conduct post-incident reviews and root cause analyses to identify improvements and prevent future failures.
  • Implement monitoring and alerting systems to proactively detect and resolve issues.
  • Automate routine tasks to improve operational efficiency and reduce manual interventions.
  • Participate in on-call rotations to provide 24/7 support and rapid incident resolution.
  • Ensure compliance with industry standards and best practices for system reliability and security.
Skills & Experience
  • Deep expertise in managing Linux-based systems.
  • Proficiency in Python, Bash, and Shell scripting for automation.
  • Experience with Patroni Postgres, MongoDB, SQL/NoSQL databases.
  • Hands-on experience with cloud platforms such as GCP, OCI, or Alibaba Cloud.
  • Strong knowledge of observability and monitoring tools like Prometheus, Grafana, ELK stack.
  • Experience with CI/CD tools and Infrastructure as Code (Terraform, Ansible, GitLab CI/CD, Jenkins).
  • Understanding of Kubernetes, Docker, and security best practices.
Education & Certifications

Bachelor's degree in information technology or a related field. Relevant certifications such as CKA, Terraform Associate, GCP/AWS/OCI certifications are a plus.

What We Offer
  • An inclusive, diverse culture encouraging innovation and flexibility in remote, in-office, and hybrid work setups.
  • Highly competitive compensation packages, including potential shares.
  • Focus on personal development with regular training and an annual learning stipend.
  • Opportunity to work with a talented team from over 30 nationalities across 7 countries.
  • Autonomy, mentoring, and challenging goals fostering growth for you and the company.
  • Responsibility and trust to enable you to deliver your best work.

If you believe you have what it takes to join our remarkable team, #apply_now.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.