Ativa os alertas de emprego por e-mail!

Senior Software Engineer, Platform Reliability Operations - 100% Remote

Georgia IT, Inc.

Teletrabalho

BRL 120.000 - 160.000

Tempo integral

Hoje

Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading IT solutions provider is seeking a Senior Software Engineer for Platform Reliability Operations. This fully remote position requires 5+ years of software development experience, solid skills in Golang and JavaScript, and expertise in building service-oriented APIs and microservices. The role involves analyzing and improving system designs, collaborating with teams, and responding to incidents. Ideal candidates will possess a deep understanding of distributed systems and experience in high-demand environments.

Qualificações

5+ years' experience in software development.
Solid engineering and coding skills.
Experience in building service-oriented APIs.
Technical hands-on server software experience.
Understanding of large-scale distributed systems.

Responsabilidades

Analyze and improve system design to reduce failure modes.
Establish and maintain robust observability systems.
Shape architecture and designs for reliability.
Collaborate with engineers on SLAs and SLOs.
Respond to incidents and solve critical issues.

Conhecimentos

Software development

Golang

JavaScript

APIs and cloud services

Microservices

Linux environment

Distributed systems

TCP / IP stack

Formação académica

Degree in Computer Science or equivalent

Ferramentas

Kubernetes

Terraform

Datadog

Senior Software Engineer, Platform Reliability Operations – 100% Remote Location : Brazil / Mexico

Start date - ASAP

12 Months plus Contract

Responsibilities

Analyze and improve system design to reduce failure modes and promote self-healing systems
Establish and maintain robust systems that facilitate observability, encompassing logging, monitoring, distributed tracing, alerting, and offline test tools.
Work with development partners to shape the architecture, design, and implementations of new and existing systems to enhance their reliability, performance, efficiency, and scalability
Ability to work both independently as well as part of a geographically dispersed yet integrated team.
Collaborate with service engineers to establish Service Level Agreements (SLAs) and Service Level Objectives (SLOs) for backend services.
Being able to identify the indications or cues that demonstrate the effectiveness of an application and having the knowledge to improve or repair its performance
Ability to assess options and suggest solutions when there is limited or unclear information. This position requires a level of comfort and assurance in dealing with uncertain situations.
Ability to work seamlessly within a team as well as manage individual tasks
Respond to emerging incidents, solve critical issues, and follow through with a plan for resolution or future mitigation
Act as an SME on the Engineering Operations team, partnering with backend services teams and application teams to overcome challenges across all the platforms where we stream our service

Required Skills

5+ years' experience in software development
Degree in Computer Science or related or equivalent work experience
You have solid engineering and coding skills, data structure knowledge, and the ability to write high-performance production-quality code.
Experience building service-oriented APIs and cloud services
Experience designing, implementing, and deploying microservices
Extremely technical hands-on server software experience
Proficient in Golang, and JavaScript, and quick to learn new languages.
Experience in the Linux environment and a good understanding of its fundamentals and internals : filesystems and modern memory management, threads, and processes, the user / kernel-space divide, etc.
A good understanding of large-scale distributed systems in practice, including multi-tier architectures, application security, monitoring, and storage systems.
Working knowledge of the TCP / IP stack, internet routing, and load balancing.
Grit, drive, and a deep feeling of ownership.

Bonus Points for Experience with the following

Golang
Typescript
Kubernetes
Terraform
Open telemetry
eBPF
Datadog
Helm Charts

HLS video transcoding, distribution & playback

Experience designing, implementing, and running services in high demand high-traffic environments
Experience with high-availability services

Obtém a tua avaliação gratuita e confidencial do currículo.

ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.

Principais localizações

Melhores empresas

Principais cargos