May Mobility is transforming cities through autonomous technology to create a safer, greener, more accessible world. Based in Ann Arbor, Michigan, May develops and deploys autonomous vehicles (AVs) powered by our innovative Multi-Policy Decision Making (MPDM) technology that reimagines the way AVs think.
Our vehicles do more than just drive themselves - they provide value to communities, bridge public transit gaps, and move people safely, easily, and enjoyably. We’re building the world’s best autonomy system to reimagine transit by minimizing congestion, expanding access, and encouraging better land use to foster more green, vibrant, and livable spaces. Since our founding in 2017, we’ve provided over 300,000 autonomy-enabled rides globally. We’re just getting started. Join us if you’re passionate about building the future and solving real-world problems.
Job Summary
The Network Reliability Engineer is responsible for ensuring 99.99% uptime for our headquarters and data center operations. We are investing in infrastructure and team growth to achieve unparalleled reliability and performance. The Site Reliability Engineer will collaborate with our Network Engineer and IT team to enhance the stability, performance, and resilience of our critical network and server infrastructure. This role emphasizes a proactive, automation-first approach, focusing on developing internal tools, automating routine tasks, and building robust testing mechanisms. The ideal candidate will have a deep understanding of underlying technologies, be vendor-agnostic, and embody a strong DevOps mentality.
Essential Responsibilities
- Proactive Maintenance & Monitoring: Conduct regular health checks, configuration audits, firmware updates, and preventative maintenance on network devices (routers, switches, firewalls) and server infrastructure to identify and mitigate issues before they impact services.
- Automation & Tooling Development: Design, develop, and implement internal tools and scripts (e.g., Python, Golang, Bash) to automate network and system monitoring, configuration management, and failover testing. Build automation-driven processes to streamline IT operations.
- Infrastructure Reliability & Resilience: Contribute to designing and implementing highly available and redundant infrastructure solutions, ensuring adherence to disaster recovery and business continuity best practices.
- Troubleshooting & Incident Response: Provide advanced support for complex network and system issues, participate in incident response, and perform root cause analysis to prevent recurrence.
- Documentation & Knowledge Sharing: Maintain comprehensive documentation for network diagrams, configurations, automation scripts, and operational procedures. Cross-train with senior engineers to build team knowledge redundancy.
- Technology Evaluation: Research, evaluate, and recommend new technologies to enhance infrastructure reliability, performance, and automation, focusing on underlying principles rather than vendor lock-in.
- Vendor Management: Collaborate with hardware and software vendors on support, maintenance, and technology integration.
- On-Call & Incident Response: Participate in on-call duties and incident response processes as needed.
Qualifications
Required:
- 5+ years of progressive experience in IT infrastructure, emphasizing network engineering and systems administration.
- Deep understanding of IPv6, routing protocols (BGP, OSPF), switching, VLANs, VPNs, and firewall concepts, with experience in network reliability technologies (BFD, VRRP) and underlying principles.
- Solid experience with BSD/Linux server OS and cloud platforms (AWS, GCP).
- Proficiency in scripting languages such as Python, Go, Bash for automation and data analysis.
- Experience with open-source networking software (e.g., TNSR, pfSense, WireGuard) and Linux/BSD systems.
- Strong DevOps mindset, including Infrastructure as Code (IaC), CI/CD pipelines, and automation of repetitive tasks.
- Excellent problem-solving, analytical, and diagnostic skills.
- Effective communication skills for technical and non-technical audiences.
- Reside within an hour of Ann Arbor, MI.
Preferred:
- Relevant certifications (e.g., CCNA/CCNP, JNCIA/JNCIS, CompTIA Network+, Server+).
- Experience with configuration management tools (Ansible, Puppet, Chef).
- Knowledge of cloud platforms (AWS preferred) and hybrid cloud networking.
- Experience with network performance monitoring tools (Prometheus, SNMP, Zabbix).
- Experience with IPv6 dynamic routing and Cisco L2/L3 networking.
- 5+ years using configuration management tools and IaC (Terraform, CloudFormation).
- 5+ years Linux system administration and scripting experience.
- Experience with cloud services, compute, storage, and containers.
Desirable:
- Leadership in adopting DevOps practices.
- Experience with container orchestration and CI/CD pipelines.
Physical Requirements
- Standard office conditions: prolonged sitting, standing, and computer use.
- Travel: moderate (11%-25%).
Salary Range: $95,000 — $120,000 USD
Benefits and Perks
- Comprehensive healthcare including medical, dental, vision, life, and disability.
- Health Savings and Flexible Spending Accounts.
- Retirement plans with employer match.
- Paid parental leave and phased return to work.
- Flexible vacation and paid holidays.
- Wellness resources and programs.
We encourage diverse applicants, even if you don’t meet every qualification. Apply if excited about the role!
Learn more about our culture & benefits on our website.
May Mobility is an equal opportunity employer. All applicants are considered without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, gender identity or expression, veteran status, genetics, or other protected categories. You may voluntarily share demographic info to help us improve our hiring processes. Accommodation requests are welcome.
Note to recruitment agencies: We do not accept unsolicited resumes or pay fees for candidates submitted by agencies outside our approved partners.