Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Senior Software Engineer, Platform Reliability Operations - 100% Remote

Georgia IT, Inc.

Teletrabalho

BRL 120.000 - 160.000

Tempo integral

Hoje
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading IT solutions provider is seeking a Senior Software Engineer for Platform Reliability Operations. This fully remote position requires 5+ years of software development experience, solid skills in Golang and JavaScript, and expertise in building service-oriented APIs and microservices. The role involves analyzing and improving system designs, collaborating with teams, and responding to incidents. Ideal candidates will possess a deep understanding of distributed systems and experience in high-demand environments.

Qualificações

  • 5+ years' experience in software development.
  • Solid engineering and coding skills.
  • Experience in building service-oriented APIs.
  • Technical hands-on server software experience.
  • Understanding of large-scale distributed systems.

Responsabilidades

  • Analyze and improve system design to reduce failure modes.
  • Establish and maintain robust observability systems.
  • Shape architecture and designs for reliability.
  • Collaborate with engineers on SLAs and SLOs.
  • Respond to incidents and solve critical issues.

Conhecimentos

Software development
Golang
JavaScript
APIs and cloud services
Microservices
Linux environment
Distributed systems
TCP / IP stack

Formação académica

Degree in Computer Science or equivalent

Ferramentas

Kubernetes
Terraform
Datadog
Descrição da oferta de emprego

Senior Software Engineer, Platform Reliability Operations – 100% Remote Location : Brazil / Mexico

Start date - ASAP

12 Months plus Contract

Responsibilities
  • Analyze and improve system design to reduce failure modes and promote self-healing systems
  • Establish and maintain robust systems that facilitate observability, encompassing logging, monitoring, distributed tracing, alerting, and offline test tools.
  • Work with development partners to shape the architecture, design, and implementations of new and existing systems to enhance their reliability, performance, efficiency, and scalability
  • Ability to work both independently as well as part of a geographically dispersed yet integrated team.
  • Collaborate with service engineers to establish Service Level Agreements (SLAs) and Service Level Objectives (SLOs) for backend services.
  • Being able to identify the indications or cues that demonstrate the effectiveness of an application and having the knowledge to improve or repair its performance
  • Ability to assess options and suggest solutions when there is limited or unclear information. This position requires a level of comfort and assurance in dealing with uncertain situations.
  • Ability to work seamlessly within a team as well as manage individual tasks
  • Respond to emerging incidents, solve critical issues, and follow through with a plan for resolution or future mitigation
  • Act as an SME on the Engineering Operations team, partnering with backend services teams and application teams to overcome challenges across all the platforms where we stream our service
Required Skills
  • 5+ years' experience in software development
  • Degree in Computer Science or related or equivalent work experience
  • You have solid engineering and coding skills, data structure knowledge, and the ability to write high-performance production-quality code.
  • Experience building service-oriented APIs and cloud services
  • Experience designing, implementing, and deploying microservices
  • Extremely technical hands-on server software experience
  • Proficient in Golang, and JavaScript, and quick to learn new languages.
  • Experience in the Linux environment and a good understanding of its fundamentals and internals : filesystems and modern memory management, threads, and processes, the user / kernel-space divide, etc.
  • A good understanding of large-scale distributed systems in practice, including multi-tier architectures, application security, monitoring, and storage systems.
  • Working knowledge of the TCP / IP stack, internet routing, and load balancing.
  • Grit, drive, and a deep feeling of ownership.
Bonus Points for Experience with the following
  • Golang
  • Typescript
  • Kubernetes
  • Terraform
  • Open telemetry
  • eBPF
  • Datadog
  • Helm Charts
HLS video transcoding, distribution & playback
  • Experience designing, implementing, and running services in high demand high-traffic environments
  • Experience with high-availability services
Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.