Enable job alerts via email!

Senior Engineer - Site Reliability

Presight

Abu Dhabi

On-site

AED 120,000 - 180,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a meticulous Senior Engineer in Site Reliability to enhance their delivery model. This role involves managing cloud infrastructure, deploying updates, and ensuring system health through proactive monitoring. The ideal candidate will possess extensive experience with Kubernetes, observability platforms, and a strong analytical mindset. Join a diverse and inclusive environment that fosters personal growth and innovative projects, and make a significant impact in the AI-driven analytics landscape. If you are a performance-driven individual eager to tackle challenges and collaborate with stakeholders, this opportunity is perfect for you.

Benefits

Healthcare
Education Support
Leave Benefits
Training Programs
Innovative Projects

Qualifications

  • 5+ years of experience in managing Kubernetes clusters and observability platforms.
  • Strong analytical mind with excellent problem-solving skills.

Responsibilities

  • Manage infrastructure for cloud-deployed solutions and ensure system health.
  • Drive scope definition and architecture for CI/CD services.

Skills

Kubernetes Management
Observability Platforms Configuration
Linux Knowledge
Message Queues Understanding
Elasticsearch (ELK Stack)
Network Concepts
Python Programming
Problem Solving
Analytical Skills
Communication Skills

Job description

Overview

Role: Senior Engineer – Site Reliability

Location: Abu Dhabi

About Presight

Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). It combines big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With its world-class computer vision, AI and omni-analytics platform as its engine, Presight leverages all-source data to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.

The Opportunity:

Seeking a meticulous and expert Senior Engineer - Site Reliability to build and support the Presight delivery model that empowers product & technology teams to develop & deliver high-quality products, improve platform infrastructure and strengthen the reliability of products and solutions. You play a key role in defining & establishing the delivery model deployed in the development of cutting edge, next-gen analytics solutions & services at Presight.

Responsibilities

As a Senior Engineer – Site Reliability, you will be responsible for working with relevant stakeholders to drive scope definition, specification, and architecture for all services in the CI/CD pipeline that power the Presight delivery model.

  • Managing the infrastructure required to run our solutions deployed to public or private cloud (air-gapped)
  • Deploying application updates
  • Ensuring the health of the environment by monitoring technical and business metrics, setting up alerts for things going wrong, acting proactively to prevent disasters
  • Ensuring emergency events can be responded to, quickly and precisely
  • Enabling the engineering team to execute the roadmap addressing roadblocks as needed
  • Identifying, evaluating, and conducting proof-of-concepts for new technologies
  • Contributing to the knowledge base
  • Comply with QHSE (Quality Health Safety and Environment), Business Continuity, Information Security, Privacy, Risk, Compliance Management and Governance of Organizations policies, procedures, plans and related risk assessments.
Qualifications
  • 5+ years of experience in managing Kubernetes clusters
  • 5+ years experience in configuring/tuning observability platforms (preferably Prometheus; any of the following is also relevant: Datadog / Splunk / NewRelic / CloudWatch)
  • Very good Linux knowledge
  • Good understanding of message queues (preferably RabbitMQ; any of the following is also relevant: ActiveMQ / Kafka / SNS / SQS)
  • Experience using Elasticsearch (ELK stack)
  • Good understanding of network concepts
  • Basic knowledge of at least one programming language (preferably Python)
  • A highly detail-oriented and methodical approach to problem solving.
  • A passion for technology, troubleshooting and customer service.
  • A strongly analytical mind.
  • Great verbal and written communication skills

What we look for: If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Presight community.

What working at Presight offers:

Culture: An open, diverse and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.

Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.

Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.