Enable job alerts via email!

Site Reliability Engineering Manager, GCP

Motorsport Hackers

Dearborn (MI)

Remote

USD 90,000 - 150,000

Full time

28 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Site Reliability Engineering Manager, where you'll lead a team in enhancing cloud services and driving innovation in mobility. This role offers the chance to shape the future of transportation, ensuring reliability and performance across critical systems. You'll collaborate with diverse teams, promote best practices, and foster a culture of continuous improvement. With a focus on operational excellence, your leadership will empower engineers to excel and make a significant impact. If you're passionate about technology and ready to make a difference, this opportunity is for you!

Benefits

Immediate medical, dental, and prescription drug coverage
Flexible family care and parental leave
Vehicle discount program
Paid time off for community service
Generous paid holidays
Option to purchase additional vacation time

Qualifications

  • 5+ years in Site Reliability Engineering with strong SRE principles.
  • Proven leadership experience in building high-performing teams.

Responsibilities

  • Lead and mentor a team of Site Reliability Engineers.
  • Develop strategic vision for Site Reliability Engineering.

Skills

Site Reliability Engineering
Cloud Computing (GCP)
Leadership
Incident Management
Communication Skills
Automation Tools
Scripting Languages (Python, Golang)

Education

Bachelor's degree in Computer Science
Master's degree in Computer Science

Tools

Prometheus
Grafana
Datadog
Terraform
Docker
Kubernetes

Job description

Site Reliability Engineering Manager, GCP

Dearborn, MI, United States (Remote)

Job Description

Lead the Charge in Mobility's Future: Manager, Site Reliability Engineering at Ford!

Enterprise Technology is at the heart of Ford's transformation, and we're seeking a dynamic Manager of Site Reliability Engineering (SRE) to lead our team in redefining mobility. In this role, you'll empower a team of talented engineers to leverage cutting-edge technology, enhance customer experiences, improve lives, and build vehicles as smart as you are.

As the SRE Manager, you'll be a key driver in developing, enhancing, and expanding our global monitoring and observability platform. You'll guide your team to blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll champion the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.

If you're a passionate and experienced leader with a vision for the future of transportation, this is your opportunity to make a significant impact. Join us and lead a team that's building the future of mobility!

Responsibilities
  • Team Leadership and Management:
    • Lead, mentor, and develop a team of Site Reliability Engineers, providing technical guidance, coaching, and performance feedback.
    • Foster a culture of collaboration, innovation, and continuous improvement within the team.
    • Set clear goals and expectations for the team and individual members, aligning with the overall organizational objectives.
    • Conduct regular performance reviews and provide opportunities for professional growth and development.
    • Manage team resources, including budget, tools, and training.
  • Technical Strategy and Vision:
    • Develop and execute a strategic vision for Site Reliability Engineering, aligning with the company's overall technology roadmap.
    • Identify and evaluate new technologies and methodologies to improve the reliability, performance, and scalability of our systems.
    • Drive the adoption of SRE best practices and principles across the organization.
    • Collaborate with other engineering leaders to ensure alignment on technical direction and priorities.
  • Operational Excellence:
    • Ensure the reliability, performance, and scalability of our critical systems and services.
    • Oversee the implementation and maintenance of monitoring, alerting, and incident response systems.
    • Drive the automation of operational tasks and processes to improve efficiency and reduce toil.
    • Lead or participate in incident management, root cause analysis, and postmortem reviews.
    • Be the escalation point for the on-call of our observability product.
    • Develop and maintain disaster recovery plans and procedures.
  • Collaboration and Communication:
    • Collaborate with development, operations, security, and other teams to ensure the reliability, performance, and security of our systems.
    • Communicate effectively with stakeholders at all levels, providing updates on team progress, challenges, and opportunities.
    • Represent the SRE team in cross-functional meetings and initiatives.
  • Security and Compliance:
    • Ensure compliance with security policies and industry best practices.
    • Participate in security audits and vulnerability assessments.
    • Promote a security-conscious culture within the team.
Qualifications

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or work equivalence.
  • 5+ proven experience in a Site Reliability Engineering role, with a strong understanding of SRE principles and practices.
  • 3+ years in demonstrated experience in a leadership or management role, with a track record of building and developing high-performing teams.
  • Deep technical expertise in areas such as cloud computing (GCP preferred), systems administration, networking, and software development.
  • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog).
  • Experience with automation tools and scripting languages (e.g., Terraform, Golang, Python).
  • Strong understanding of incident management and root cause analysis.
  • Excellent communication, interpersonal, and leadership skills.
  • Ability to work effectively in a fast-paced, dynamic environment.

Preferred Qualifications:

  • Master's degree in Computer Science, Engineering, or a related field.
  • Experience with Agile development methodologies.
  • Experience with DevOps practices and tools.
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Experience with security best practices and compliance frameworks (e.g., SOC 2, ISO 27001).
  • Experience with budgeting and resource management.

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you, including:

  • Immediate medical, dental, and prescription drug coverage
  • Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Vehicle discount program for employees and family members, and management leases
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year’s Day
  • Paid time off and the option to purchase additional vacation time.

*Please note: This is a remote role but if you live within 50 miles of Dearborn, MI, you will be expected to commute on-site up to 3 times a week*

*Visa Sponsorship is NOT provided for this role*

Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.

We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.

#LI-DS2

Job Info
  • Job Identification 44233
  • Job Category Enterprise Technology
  • Posting Date 04/11/2025, 04:25 PM
  • Degree Level Bachelor's Degree or equivalent
  • Job Schedule Full time
  • Locations 21931 Michigan Ave, Dearborn, MI, 48124, US (Remote)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineering Manager, GCP

Ford Pro

Dearborn

Remote

USD 100.000 - 160.000

28 days ago

Site Reliability Engineering Manager, GCP

Ford Motor Company

Dearborn

Hybrid

USD 90.000 - 150.000

28 days ago