Enable job alerts via email!

Staff Network Operations Engineer

Crusoe Energy Systems LLC

San Francisco (CA)

Hybrid

USD 195,000 - 230,000

Full time

9 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering company at the forefront of AI-first cloud infrastructure, where you'll play a crucial role in managing and optimizing a global network. As a Staff Network Operations Engineer, you'll ensure high performance and reliability of cutting-edge technologies while working collaboratively with cross-functional teams. This role offers the opportunity to make a significant impact in a fast-paced environment committed to sustainability and innovation. If you're passionate about environmental technologies and network engineering, this is the perfect chance to thrive in a dynamic and supportive atmosphere.

Benefits

Hybrid work schedule
Restricted Stock Units
Health insurance options
Paid Parental Leave
401(k) with 100% match
Generous paid time off
Tuition reimbursement
Company-paid commuter benefit
Pet-friendly offices
Subscription to the Calm app

Qualifications

  • 10+ years experience in production environments.
  • In-depth knowledge of network protocols and monitoring tools.

Responsibilities

  • Manage Crusoe Energy Cloud's global network and optimize connectivity.
  • Lead operational excellence initiatives for high network availability.

Skills

TCP/IP
BGP
OSPF/IS-IS
EVPN
VXLAN
MPLS
SNMP
Python
Ansible
Public Cloud Connectivity

Education

Bachelor's degree in Computer Science
Equivalent work experience (3+ years)

Tools

Kentik
Arbor
Thousand Eyes
Catchpoint

Job description

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About the Role

The Crusoe Cloud Network Engineering team seeks an ambitious, experienced team player to join our Network Operations team. This team is responsible for designing, building, and operating the global edge, backbone, and data center network for High-Performance Compute (HPC) Clusters with GPUs. The ideal candidate will be highly motivated, self-directed, and passionate about working on cutting-edge environmental technologies. Excellent analytical, communication skills, and teamwork are essential.

As a Staff Network Operations Engineer, you will be part of the Network Engineering team, overseeing the operations of the global Crusoe Cloud Network. Your responsibilities include ensuring network uptime through monitoring, outage fixes, and participating in a 24/7 on-call rotation. This role offers valuable experience in managing edge, backbone, and HPC-based data center networking at a large scale.

A Day in the Life:
  1. Manage and optimize Crusoe Energy Cloud's global network, including edge, backbone, data center, and public cloud connectivity.
  2. Collaborate with Network Engineering and cross-functional teams such as Software Infrastructure and Product teams to drive network innovation and evolution.
  3. Lead operational excellence initiatives by developing monitoring, alerting, and self-healing systems to ensure high network availability.
  4. Perform advanced troubleshooting and root cause analysis for incidents, guiding post-mortem reviews and improvements.
  5. Mentor network engineers and establish best practices for incident response, documentation, and operational readiness.
  6. Participate in a 24/7 On-call Support rotation for the Crusoe Network.

You Will Thrive In This Role If:

  1. You have 10+ years of experience building and operating at scale in a production environment.
  2. You possess in-depth knowledge of network protocols such as TCP/IP, QoS, BGP, OSPF/IS-IS, EVPN, VXLAN, QoS, and MPLS-related technologies like RSVP-TE, LDP.
  3. You understand network monitoring protocols and tools like SNMP, IPFIX, Sflow/netflow, and Telemetry.
  4. You have experience with tools such as Kentik, Arbor, Thousand Eyes, Catchpoint, and packet design.
  5. You are familiar with data center network architectures like Fat Tree, CLOS, BGP-TE, and peering for edge.
  6. You have hands-on experience with network devices from Mellanox, Cisco, Arista, Juniper, and other vendors.
  7. You are familiar with mainstream switch/router chipsets like Broadcom and Barefoot.
  8. Knowledge of RDMA, Infiniband, and RoCE is a plus.
  9. You have in-depth knowledge of public cloud connectivity options (AWS, GCP, Azure, Ali Cloud, OCI).
  10. You understand IPv6 and IPv4-IPv6 coexistence technologies.
  11. Programming or scripting experience in Python, Ansible, Puppet, Chef, or similar languages is a plus.
  12. You are self-motivated with good communication and writing skills.
  13. You are a team player willing to participate in the global on-call rotation.
  14. You hold a Bachelor's degree in Computer Science, Information Science, Engineering, Mathematics, or have equivalent work experience (3+ years).

Benefits:

  • Hybrid work schedule
  • Industry-competitive pay
  • Restricted Stock Units in a fast-growing, well-funded tech company
  • Health insurance options including HDHP and PPO, vision, dental for you and dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc access
  • Pet-friendly offices
  • 401(k) with 100% match up to 4%
  • Generous paid time off and holidays
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • Company-paid commuter benefit of $200 per pay period

Compensation Range:

Salary between $195,000 and $230,000, including Restricted Stock Units. Final compensation depends on education, experience, skills, and internal equity considerations.

Crusoe is an Equal Opportunity Employer, committed to diversity and inclusion in the workplace.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Network Operations Engineer

Crusoe

San Francisco

Hybrid

USD 195,000 - 230,000

7 days ago
Be an early applicant

Staff Security Architect

QSC

Boulder

Remote

USD 164,000 - 214,000

2 days ago
Be an early applicant

Staff Security Architect

QSC

California

Remote

USD 164,000 - 214,000

2 days ago
Be an early applicant

Staff Security Architect

QSC

Fort Wayne

Remote

USD 164,000 - 214,000

2 days ago
Be an early applicant

Staff Security Operations Engineer, Observability & Automation Engineering

Affirm

Charlotte

Remote

USD 200,000 - 250,000

3 days ago
Be an early applicant

Staff Security Operations Engineer, Observability & Automation Engineering

Affirm

Richmond

Remote

USD 200,000 - 250,000

4 days ago
Be an early applicant

Staff Security Operations Engineer, Observability & Automation Engineering

Affirm

Remote

USD 200,000 - 250,000

5 days ago
Be an early applicant

Staff Security Operations Engineer, Observability & Automation Engineering

Affirm

Connecticut

Remote

USD 225,000 - 275,000

6 days ago
Be an early applicant

Staff Security Architect

QSC

Remote

USD 164,000 - 214,000

3 days ago
Be an early applicant