Site Reliability Engineer / Observability Engineer
Public Cloud - Offerings and Delivery – Workforce Mgmt & Delivery Ops /
Full - Time / Remote
Rackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.
If you enjoy solving complex business problems and can contribute to building the next generation of modern applications for our customers—helping them understand the connections between application performance, user experience, and business outcomes—creating amazing customer experiences with modern interpretations of SRE and Observability using Datadog, New Relic, AppDynamics, or Dynatrace, then join us!
Rackspace enables businesses to accelerate digital transformation through our innovative data and integration solutions that help you fix problems quickly, maintain complex systems, and improve code. We believe Datadog, AppDynamics, or New Relic will be significant contributors to our work, and we seek talented, creative, and thoughtful individuals to shape Observability Engineering for our customers.
You Will- Work with customers to implement Observability solutions.
- Build and maintain scalable systems and automation supporting engineering goals.
- Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
- Analyze metric and log data to perform anomaly detection, performance tuning, capacity planning, and fault isolation.
- Collaborate with development teams to deploy new features, ensuring they meet reliability, security, and performance standards.
- Document and share solutions with team members.
- Maintain a deep understanding of the customer’s business and technical environment.
- Identify performance bottlenecks, anomalous behavior, and root causes of service issues.
You Have:- Bachelor’s degree in engineering/computer science or equivalent.
- Senior experience with Site Reliability Engineering, DevOps, application support, AWS infrastructure, automation, and related areas.
- Experience with observability tools like Splunk, Datadog, SignalFx, etc.
- Experience deploying and supporting applications/services in AWS.
- Proactive problem-solving skills.
- Proficiency in languages like Python, PHP, Perl, Ruby, or Linux Shell.
- Experience with Terraform or CloudFormation scripting.
- Experience with configuration management tools like Ansible, Chef, or Puppet.
- Familiarity with software development best practices and tools such as Git.
- Experience working in agile environments.
- Understanding of AWS pricing models, especially compute, storage, and databases.
- Good knowledge of network and system management solutions.
- Strong organizational, project management, communication, critical thinking, and analytical skills.
About Rackspace Technology- We are multicloud solutions experts, combining leading technologies to deliver end-to-end solutions. We advise, design, build, manage, and optimize solutions tailored to our customers' needs. Recognized as a top place to work by Fortune, Forbes, and Glassdoor, we attract and develop world-class talent. Join us to embrace technology, empower customers, and shape the future.
More on Rackspace Technology- We value diversity and believe that unique perspectives fuel innovation. We foster an inclusive environment where everyone can thrive. We are committed to equal employment opportunity and welcome applicants with disabilities or special needs for accommodations. Apply today and become part of our inspiring mission.