Enable job alerts via email!

Production Engineer, Network

Meta

Dublin

On-site

EUR 70,000 - 90,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Dublin seeks a Production Engineer to design and implement automation for their complex global network. This role involves collaborating with experts to build scalable and reliable systems while enhancing operational efficiency. Candidates should have a Bachelor's degree in a technical field and extensive experience with network configurations and software development.

Qualifications

  • 5+ years of experience developing network device configurations.
  • 5+ years of coding experience in at least one programming language.
  • Experience developing software to automate operations.

Responsibilities

  • Maintain automation and tools for network products.
  • Improve operational workflows through automation.
  • Resolve complex technical issues across networks.
  • Participate in on-call rotation for service escalation.

Skills

Network device configurations
Software automation
Coding (Python, Go, etc.)
TCP/IP knowledge

Education

Bachelor's degree in Computer Science or equivalent
Master's degree or graduate work (preferred)
Job description
Overview

Summary: Meta is seeking a Production Engineer with in-depth understanding of networking, systems, automation, and tooling to join the PE Network team. This team is responsible for deploying and managing one of the world’s largest and most complex networks. Meta’s network is a foundational component in achieving the company's AI goals and this role would play a key role in supporting it. Given the scale and demands of our infrastructure, automation plays a critical role. In this position, you will design, develop, and implement automation and tooling to streamline network operations while ensuring the scalability and reliability of Meta’s global network. You’ll collaborate with top engineers in the industry to build and maintain the systems that power one of the largest networks in the world, supporting billions of users across our applications.

Responsibilities
  1. Conceptualize, build, and maintain automation and tools to support the next generation of network products, network deployment, release engineering and operations
  2. Develop operational process improvements and implement them in scalable, automated workflows to enhance operational efficiency
  3. Design and develop solutions that scale across a variety of network platforms
  4. Lead enhancements of automation for continuous integration, validations, testing infrastructure, release, and configuration management across our global data center network fleet
  5. Conduct thorough investigations into complex technical issues across networks, ranging from automated tooling to hardware failures and network issues
  6. Participate in a weekly on-call rotation with the team and be an escalation contact for your service
  7. Proactively find operational gaps that impact the efficiency of your team, come up with the execution plan, and drive the project directly and through influence of other team members
  8. Contribute to team growth and development through peer mentorship
Minimum Qualifications
  1. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  2. Experience developing software to automate operations
  3. 5+ years of experience developing and understanding network device configurations for at least one network vendor (e.g. Arista, Juniper, Cisco, Brocade, Ciena, Infinera, Nokia, etc.)
  4. 5+ years of coding experience in at least one programming language (e.g. Python, Go, C++,)
  5. Demonstrated knowledge of TCP, IPv4/6, Routing Protocols (one or more of BGP, MPLS, ISIS, or similar), or related network services (e.g. DHCP and DNS)
Preferred Qualifications
  1. Master's degree or graduate work experience in Computer Science, Computer Engineering, or a related technical field
  2. 6+ years of experience building software solutions for managing network infrastructure, with a focus on scalability and reliability
  3. In-depth knowledge of software and network debugging, profiling, and instrumentation techniques to ensure optimal system performance
  4. Proven experience designing, developing, and operating distributed systems at scale, with an in-depth understanding of the challenges and opportunities in this space
  5. Experience designing and maintaining automated testing infrastructure to ensure the quality and reliability of our systems
  6. Knowledge of IB/RDMA/RoCE Networks, including RDMA congestion control mechanisms, AI training workloads and demands they exert on networks

Industry: Internet

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.