Enable job alerts via email!

Site Reliability Engineering Manager, GCP

Ford Motor Company

Dearborn (MI)

Hybrid

USD 90,000 - 150,000

Full time

28 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a dynamic Manager of Site Reliability Engineering to lead a talented team in redefining mobility. This role involves enhancing customer experiences and building a global monitoring platform. As a key driver of innovation, you'll blend software and systems engineering, ensuring the uptime and scalability of critical cloud services. Join a forward-thinking company where your leadership can make a significant impact on the future of transportation, and enjoy a collaborative environment that values your contributions.

Benefits

Immediate medical, dental, and prescription drug coverage
Flexible family care and parental leave
Vehicle discount program
Tuition assistance
Paid time off for community service
Generous holiday schedule
Option to purchase additional vacation time

Qualifications

  • 5+ years in Site Reliability Engineering with strong SRE principles.
  • 3+ years in leadership roles, building high-performing teams.
  • Deep expertise in cloud computing and systems administration.

Responsibilities

  • Lead and mentor a team of Site Reliability Engineers.
  • Develop strategic vision for Site Reliability Engineering.
  • Ensure reliability and scalability of critical systems.

Skills

Site Reliability Engineering
Leadership
Cloud Computing (GCP preferred)
Incident Management
Root Cause Analysis
Communication Skills

Education

Bachelor's degree in Computer Science or Engineering
Master's degree in Computer Science or Engineering

Tools

Prometheus
Grafana
Datadog
Terraform
Golang
Python
Docker
Kubernetes

Job description

Lead the Charge in Mobility's Future: Manager, Site Reliability Engineering at Ford!

Enterprise Technology is at the heart of Ford's transformation, and we're seeking a dynamic Manager of Site Reliability Engineering (SRE) to lead our team in redefining mobility. In this role, you'll empower a team of talented engineers to leverage cutting-edge technology, enhance customer experiences, improve lives, and build vehicles as smart as you are.

As the SRE Manager, you'll be a key driver in developing, enhancing, and expanding our global monitoring and observability platform. You'll guide your team to blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll champion the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.

If you're a passionate and experienced leader with a vision for the future of transportation, this is your opportunity to make a significant impact. Join us and lead a team that's building the future of mobility!

Qualifications

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or work equivalence.
  • 5+ proven experience in a Site Reliability Engineering role, with a strong understanding of SRE principles and practices.
  • 3+ years in demonstrated experience in a leadership or management role, with a track record of building and developing high-performing teams.
  • Deep technical expertise in areas such as cloud computing (GCP preferred), systems administration, networking, and software development.
  • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog).
  • Experience with automation tools and scripting languages (e.g., Terraform, Golang, Python).
  • Strong understanding of incident management and root cause analysis.
  • Excellent communication, interpersonal, and leadership skills.
  • Ability to work effectively in a fast-paced, dynamic environment.

Preferred Qualifications:

  • Master's degree in Computer Science, Engineering, or a related field.
  • Experience with Agile development methodologies.
  • Experience with DevOps practices and tools.
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Experience with security best practices and compliance frameworks (e.g., SOC 2, ISO 27001).
  • Experience with budgeting and resource management.

You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you, including:

  • Immediate medical, dental, and prescription drug coverage
  • Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Vehicle discount program for employees and family members, and management leases
  • Tuition assistance
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year’s Day
  • Paid time off and the option to purchase additional vacation time.

*Please note: This is a remote role but if you live within 50 miles of Dearborn, MI, you will be expected to commute on-site up to 3 times a week.

*Visa Sponsorship is NOT provided for this role.

*Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.

We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.

Responsibilities
  • Team Leadership and Management:
    • Lead, mentor, and develop a team of Site Reliability Engineers, providing technical guidance, coaching, and performance feedback.
    • Foster a culture of collaboration, innovation, and continuous improvement within the team.
    • Set clear goals and expectations for the team and individual members, aligning with the overall organizational objectives.
    • Conduct regular performance reviews and provide opportunities for professional growth and development.
    • Manage team resources, including budget, tools, and training.
  • Technical Strategy and Vision:
    • Develop and execute a strategic vision for Site Reliability Engineering, aligning with the company's overall technology roadmap.
    • Identify and evaluate new technologies and methodologies to improve the reliability, performance, and scalability of our systems.
    • Drive the adoption of SRE best practices and principles across the organization.
    • Collaborate with other engineering leaders to ensure alignment on technical direction and priorities.
  • Operational Excellence:
    • Ensure the reliability, performance, and scalability of our critical systems and services.
    • Oversee the implementation and maintenance of monitoring, alerting, and incident response systems.
    • Drive the automation of operational tasks and processes to improve efficiency and reduce toil.
    • Lead or participate in incident management, root cause analysis, and postmortem reviews.
    • Be the escalation point for the on-call of our observability product.
    • Develop and maintain disaster recovery plans and procedures.
  • Collaboration and Communication:
    • Collaborate with development, operations, security, and other teams to ensure the reliability, performance, and security of our systems.
    • Communicate effectively with stakeholders at all levels, providing updates on team progress, challenges, and opportunities.
    • Represent the SRE team in cross-functional meetings and initiatives.
  • Security and Compliance:
    • Ensure compliance with security policies and industry best practices.
    • Participate in security audits and vulnerability assessments.
    • Promote a security-conscious culture within the team.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineering Manager, GCP

Motorsport Hackers

Dearborn

Remote

USD 90,000 - 150,000

28 days ago

Site Reliability Engineering Manager, GCP

Ford Pro

Dearborn

Remote

USD 100,000 - 160,000

28 days ago