Enable job alerts via email!

Site Reliability Engineering Manager, GCP

Ford Pro

Dearborn (MI)

Remote

USD 100,000 - 160,000

Full time

28 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a dynamic Site Reliability Engineering Manager to lead a talented team in transforming mobility. This role involves developing a global monitoring platform, ensuring the reliability and scalability of cloud services, and fostering a culture of innovation. If you are passionate about technology and leadership, this is your chance to make a significant impact in the future of transportation. Join a forward-thinking company that values collaboration and offers a flexible work environment, empowering you to shape your career while making a difference.

Benefits

Immediate medical, dental, and prescription drug coverage
Flexible family care and parental leave
Vehicle discount program
Paid time off for community service
Generous paid holidays
Option to purchase additional vacation time

Qualifications

  • 5+ years in Site Reliability Engineering with strong SRE principles.
  • 3+ years in leadership roles, building high-performing teams.

Responsibilities

  • Lead and mentor a team of Site Reliability Engineers.
  • Develop and execute strategic vision for Site Reliability Engineering.
  • Ensure reliability and performance of critical systems and services.

Skills

Site Reliability Engineering principles
Cloud computing (GCP preferred)
Systems administration
Networking
Software development
Incident management
Leadership skills
Communication skills

Education

Bachelor's degree in Computer Science
Master's degree in Computer Science

Tools

Prometheus
Grafana
Datadog
Terraform
Golang
Python
Docker
Kubernetes

Job description

Site Reliability Engineering Manager, GCP

Dearborn, MI, United States (Remote)

Job Description

Lead the Charge in Mobility's Future: Manager, Site Reliability Engineering at Ford!

Enterprise Technology is at the heart of Ford's transformation, and we're seeking a dynamic Manager of Site Reliability Engineering (SRE) to lead our team in redefining mobility. In this role, you'll empower a team of talented engineers to leverage cutting-edge technology, enhance customer experiences, improve lives, and build vehicles as smart as you are.

As the SRE Manager, you'll be a key driver in developing, enhancing, and expanding our global monitoring and observability platform. You'll guide your team to blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll champion the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.

If you're a passionate and experienced leader with a vision for the future of transportation, this is your opportunity to make a significant impact. Join us and lead a team that's building the future of mobility!

Responsibilities
  • Team Leadership and Management:
    • Lead, mentor, and develop a team of Site Reliability Engineers, providing technical guidance, coaching, and performance feedback.
    • Foster a culture of collaboration, innovation, and continuous improvement within the team.
    • Set clear goals and expectations for the team and individual members, aligning with the overall organizational objectives.
    • Conduct regular performance reviews and provide opportunities for professional growth and development.
    • Manage team resources, including budget, tools, and training.
  • Technical Strategy and Vision:
    • Develop and execute a strategic vision for Site Reliability Engineering, aligning with the company's overall technology roadmap.
    • Identify and evaluate new technologies and methodologies to improve the reliability, performance, and scalability of our systems.
    • Drive the adoption of SRE best practices and principles across the organization.
    • Collaborate with other engineering leaders to ensure alignment on technical direction and priorities.
  • Operational Excellence:
    • Ensure the reliability, performance, and scalability of our critical systems and services.
    • Oversee the implementation and maintenance of monitoring, alerting, and incident response systems.
    • Drive the automation of operational tasks and processes to improve efficiency and reduce toil.
    • Lead or participate in incident management, root cause analysis, and postmortem reviews.
    • Be the escalation point for the on-call of our observability product.
    • Develop and maintain disaster recovery plans and procedures.
  • Collaboration and Communication:
    • Collaborate with development, operations, security, and other teams to ensure the reliability, performance, and security of our systems.
    • Communicate effectively with stakeholders at all levels, providing updates on team progress, challenges, and opportunities.
    • Represent the SRE team in cross-functional meetings and initiatives.
  • Security and Compliance:
    • Ensure compliance with security policies and industry best practices.
    • Participate in security audits and vulnerability assessments.
    • Promote a security-conscious culture within the team.
Qualifications

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or work equivalence.
  • 5+ proven experience in a Site Reliability Engineering role, with a strong understanding of SRE principles and practices.
  • 3+ years in demonstrated experience in a leadership or management role, with a track record of building and developing high-performing teams.
  • Deep technical expertise in areas such as cloud computing (GCP preferred), systems administration, networking, and software development.
  • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog).
  • Experience with automation tools and scripting languages (e.g., Terraform, Golang, Python).
  • Strong understanding of incident management and root cause analysis.
  • Excellent communication, interpersonal, and leadership skills.
  • Ability to work effectively in a fast-paced, dynamic environment.

Preferred Qualifications:

  • Master's degree in Computer Science, Engineering, or a related field.
  • Experience with Agile development methodologies.
  • Experience with DevOps practices and tools.
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Experience with security best practices and compliance frameworks (e.g., SOC 2, ISO 27001).
  • Experience with budgeting and resource management.

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you, including:

  • Immediate medical, dental, and prescription drug coverage
  • Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Vehicle discount program for employees and family members, and management leases
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year’s Day
  • Paid time off and the option to purchase additional vacation time.

*Please note: This is a remote role but if you live within 50 miles of Dearborn, MI, you will be expected to commute on-site up to 3 times a week

*Visa Sponsorship is NOT provided for this role

Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.

We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.

Job Info
  • Job Identification 44233
  • Job Category Enterprise Technology
  • Posting Date 04/11/2025, 04:25 PM
  • Degree Level Bachelor's Degree or equivalent
  • Job Schedule Full time
  • Locations 21931 Michigan Ave, Dearborn, MI, 48124, US (Remote)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineering Manager, GCP

Motorsport Hackers

Dearborn

Remote

USD 90,000 - 150,000

28 days ago

Site Reliability Engineering Manager, GCP

Ford Motor Company

Dearborn

Hybrid

USD 90,000 - 150,000

28 days ago