Enable job alerts via email!

Senior Observability Platform Engineer

SS&C Technologies

United States

Remote

USD 80,000 - 120,000

Full time

27 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player in financial services and healthcare technology is seeking passionate software engineers to join their innovative team. This role focuses on developing and maintaining a comprehensive observability stack using cutting-edge open source solutions. You will ensure the reliability and performance of services across diverse cloud platforms while collaborating with cross-functional teams to tackle infrastructure challenges. If you thrive in a dynamic environment and are eager to stay ahead of the latest trends in observability, this opportunity is perfect for you to make a significant impact in a growing organization.

Benefits

Health Insurance
Dental Insurance
401k Plan
Tuition Reimbursement
Professional Development Reimbursement

Qualifications

  • Experience in observability and system monitoring in cloud environments.
  • Hands-on experience with Kubernetes and observability tools.

Responsibilities

  • Design and maintain observability stack including monitoring and visualization.
  • Collaborate with teams to integrate observability practices.

Skills

Observability
System Performance Analysis
Problem-Solving
Scripting (Go, Python, Shell)
Communication Skills

Education

Bachelor’s degree in Computer Science
Master’s degree in Computer Science

Tools

Prometheus
Grafana
Kubernetes
ELK Stack
Terraform
Zabbix

Job description

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

The position offers an exciting opportunity for software engineers passionate about open source software, Linux, Kubernetes, and Observability. The monitoring stack will provide comprehensive monitoring across system metrics, database performance, network health, and message queues. It will also oversee applications running on diverse cloud platforms, including Kubernetes and ESXi, as well as on bare-metal servers, virtual machines, and containers in the SS&C Private Cloud.

Responsibilities:

  • Responsible for designing, developing, implementing, and maintaining our comprehensive observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards. You will play a key role in ensuring the reliability, performance, and operational efficiency of our services.
  • Design and implement a robust observability framework using composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar.
  • Develop and maintain health monitoring and alerting systems for our compute platforms, databases, network infrastructure as well as Kubernetes-based platforms including GPU-supported environments.
  • Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health.
  • Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively.
  • Collaborate with development and operations teams to integrate observability practices into the development lifecycle.
  • Conduct performance analysis and optimization to ensure system reliability and efficiency.
  • Stay updated with the latest trends and technologies in observability and performance monitoring.
  • Collaborate with cross-functional teams (Cloud Engineering, Network, and DevOps/Solutions Engineering) to troubleshoot and resolve infrastructure issues.

Preferred Qualifications:

  • Proven experience in observability, system and network monitoring, and system performance analysis, particularly in a cloud or data center environment.
  • Expertise in implementing and managing observability tools and technologies such as composable open source solutions like Prometheus, Alertmanager, OpenTelemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar commercial solutions.
  • Hands-on experience with Kubernetes.
  • Experience with infrastructure-as-code and configuration management tools such as Consul, GitHub, Salt Stack, Terraform, etc.
  • Proficiency in scripting and automation using languages such as Go, Python, Shell.
  • Excellent problem-solving skills and the ability to work independently or as part of a team.
  • Strong communication skills and the ability to work in a fast-paced, dynamic environment.

Educational Qualifications:

  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.

SS&C offers excellent benefits including health, dental, 401k plan, tuition and professional development reimbursement plan.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.