Job Search and Career Advice Platform
3,347

It Support Engineer jobs in United Arab Emirates

Evals Software Engineer

Evals Software Engineer
COL Limited
London
GBP 60,000 - 85,000
I want to receive the latest job alerts for “It Support Engineer” jobs

Software Implementation Consultant

Software Implementation Consultant
IMC AG
England
GBP 40,000 - 60,000

Kafka developer

Kafka developer
Avance Consulting
Sheffield
GBP 70,000 - 100,000

Android Developer

Android Developer
Nixor Recruitment
Oxford
GBP 30,000 - 50,000

C++ Engineer

C++ Engineer
Oxford Knight
London
GBP 70,000 - 100,000
Discover more opportunities than anywhere else.
Find more jobs now

Senior Application Engineer

Senior Application Engineer
Luminance
Cambridge
GBP 60,000 - 85,000

Applications Engineer

Applications Engineer
Luminance
Cambridge
GBP 30,000 - 50,000

Full stack Angular software engineer

Full stack Angular software engineer
Avaloq
City of Edinburgh
GBP 35,000 - 55,000
HeadhuntersConnect with headhunters to apply for similar jobs

Senior Software Engineer

Senior Software Engineer
Arqit
Belfast
GBP 50,000 - 75,000

Senior Android Engineer

Senior Android Engineer
Burns Sheehan
London
GBP 60,000 - 80,000

Software Developer C++

Software Developer C++
BAE Systems
Guildford
GBP 45,000 - 60,000

Senior Software Engineer, Pixel Graphics, GPU Software

Senior Software Engineer, Pixel Graphics, GPU Software
Google Inc.
London
GBP 50,000 - 80,000

Applications Engineer Control Systems

Applications Engineer Control Systems
ZipRecruiter
England
GBP 40,000 - 70,000

Affiliate Android Developer (6 Month Contract)

Affiliate Android Developer (6 Month Contract)
AND Digital
Leeds
GBP 40,000 - 60,000

Software Engineer - Agentic

Software Engineer - Agentic
Pendo
Sheffield
GBP 40,000 - 60,000

Software Engineer II

Software Engineer II
Cadence Design Systems
Cambridge
GBP 30,000 - 45,000

C# Senior Software Engineer

C# Senior Software Engineer
Adria Solutions Ltd.
Stockport
GBP 50,000 - 70,000

Software engineer

Software engineer
Pearson Carter
London
GBP 38,000 - 44,000

Senior Software Engineer

Senior Software Engineer
ZipRecruiter
Rotherham
GBP 50,000 - 60,000

Software Engineer II

Software Engineer II
APEX Fintech Services LLC
Belfast
GBP 40,000 - 55,000

Python Software Engineer - Renewables

Python Software Engineer - Renewables
Clarehill Associates
Bristol
GBP 50,000 - 70,000

Fire Alarm Engineer

Fire Alarm Engineer
ZipRecruiter
London
GBP 30,000 - 45,000

Senior Platform Software Engineer

Senior Platform Software Engineer
FLIR
Fareham
GBP 50,000 - 75,000

Software Engineer II (C# or C++)

Software Engineer II (C# or C++)
Bentley Systems
Horsham
GBP 45,000 - 65,000

Software Development Engineer

Software Development Engineer
Chartsign Limited
Knutsford
GBP 40,000 - 60,000

Top job titles:

Nhs jobsAdministration jobsWork From Home jobsWarehouse jobsPart Time jobsCustomer Care Advisor jobsRemote jobsBusiness Analyst jobsProject Manger jobsSoftware Developer jobs

Top companies:

Jobs at NhsJobs at TescoJobs at AsdaJobs at AmazonJobs at GuardianJobs at Marks And SpencerJobs at Royal MailJobs at WmJobs at McdonaldsJobs at Morrisons

Top cities:

Jobs in LondonJobs in ManchesterJobs in BirminghamJobs in LeedsJobs in BristolJobs in GlasgowJobs in EdinburghJobs in BelfastJobs in LiverpoolJobs in Nottingham

Similar jobs:

Software Engineer jobsElectrical Engineer jobsMechanical Engineer jobsSecurity Guard jobsCivil Engineer jobsIt Manager jobsEngineer jobsWaiter jobsWaitress jobsSecurity Supervisor jobs

Evals Software Engineer

COL Limited
London
GBP 60,000 - 85,000
Job description

Applications deadline: We review applications on a rolling basis and encourage early submissions.

ABOUT APOLLO RESEARCH

The capabilities of current AI systems are evolving at a rapid pace. While these advancements offer tremendous opportunities, they also present significant risks, such as the potential for deliberate misuse or the deployment of sophisticated yet misaligned models. At Apollo Research, our primary concern lies with deceptive alignment, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight.

Our approach focuses on behavioral model evaluations, which we then use to audit real-world models. We also combine black-box approaches with applied interpretability. In our evaluations, we focus on LM agents, i.e. LLMs with agentic scaffolding similar to AIDE or SWE agent. We also study model organisms in controlled environments (see our security policies), e.g. to better understand capabilities related to scheming.

At Apollo, we aim for a culture that emphasizes truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful. If you’re interested in more details about what it’s like working at Apollo, you can find more information here.

THE OPPORTUNITY

We're seeking a Software Engineer who will enhance our capability to evaluate Large Language Models (LLMs) through building critical tools and libraries for our Evals team. Your work will directly impact our mission to make AI systems safer and more aligned.

What You'll Accomplish in Your First Year

1. Accelerate our frontier LLM evaluations research by leading the design and implementation of software libraries and tools that underpin our end-to-end research workflows

2. Ensure the reliability of our experimental results by building tools that identify subtle changes in LLM behavior and maintain integrity across our research

3. Shape the vision for our internal software platform, leading key decisions about how researchers will run workloads, interact with data, analyze results, and share insights

4. Increase team productivity by providing design guidance, debugging, and technical support to unblock researchers and enable them to focus on their core research

5. Build expertise working with state of the art (SOTA) AI systems and tackling the unique challenges posed when building software around them

Key Responsibilities

- Rapidly prototype and iterate on internal tools and libraries for building and running frontier language model evaluations

- Lead the development of major features from ideation to implementation

- Collaboratively define and shape the software roadmap and priorities

- Establish and advocate for good software design practices and codebase health

- Establish design patterns for new types of evaluations

- Build LLM agents that automate our internal software development and research

- Work closely with researchers to understand what challenges they face

- Assist researchers with implementation and debugging of research code

- Communicate clearly about technical decisions and tradeoffs

Job Requirements

You must have experience writing production-quality python code. We are looking for strong generalist software engineers with a track record of taking ownership. Candidates may demonstrate these skills in different ways. For example, you might have one of more of these:

- Led the development of a successful software tool or product over an extended period (e.g. 1 year or more)

- Started and built the tech stack for a company

- Worked your way up in a large organisation, repeatedly gaining more responsibility and influencing a large part of the codebase

- Authored and/or maintained a popular open-source tool or library

- 5+ years of professional software engineering experience

The following experience would be a bonus:

- Experience working with LLM agents or LLM evaluations

- Infosecurity / cybersecurity experience

- Experience working with AWS

- Interest in AI Safety

We want to emphasize that people who feel they don’t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.

Representative projects

- Implement an internal job orchestration tool which allows researchers to run evals on remote machines.

- Build out an eval runs database which stores all historical results in a queryable format.

- Implement LLM agents to automate internal software engineering and research tasks.

- Design and implement research tools for loading, viewing and interacting with transcripts from eval runs.

- Establish internal patterns and conventions for building new types of evaluations within the Inspect framework.

- Optimize the CI pipeline to reduce execution time and eliminate flaky tests.

ABOUT THE TEAM

The current evals team consists of Mikita Balesni, Jérémy Scheurer, Alex Meinke, Rusheb Shah, Bronson Schoen, Andrei Matveiakin, Felix Hofstätter, and Axel Højmark. MariusHobbhahn manages and advises the team, though team members lead individual projects. You would work closely with Rusheb and Andrei, who are the full-time software engineers on the evals team, but you would also interact a lot with everyone else. You can find our full team here.


EVALS TEAM WORK. The evals team focuses on the following efforts:
  • We have recently switched to Inspect as our primary evals framework. If you want to prepare for the SWE role, we recommend playing around with Inspect.
  • Conceptual work on safety cases for scheming, for example, our work on evaluation-based safety cases for scheming
  • Building evaluations for scheming-related properties, such as situational awareness or deceptive reasoning.
  • Conducting evaluations on frontier models and publishing the results either to the general public or a target audience such as AI developers or governments, for example, our work in OpenAI’s o1-preview system card.
  • Creating model organisms and demonstrations of behavior related to deceptive alignment, e.g. exploring the influence of goal-directedness on scheming.
  • Designing and evaluating AI control protocols. We have not started these efforts yet but intend to work on them starting Q2 2025.
LOGISTICS
  • Start Date: Target of 2-3 months after the first interview.
  • Time Allocation: Full-time.
  • Location: The office is in London, and the building is shared with the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis.
  • Work Visas: We can sponsor UK visas
BENEFITS
  • Salary: a competitive UK-based salary.
  • Flexible work hours and schedule.
  • Unlimited vacation.
  • Unlimited sick leave.
  • Lunch, dinner, and snacks are provided for all employees on workdays.
  • Paid work trips, including staff retreats, business trips, and relevant conferences.
  • A yearly $1,000 (USD) professional development budget.

Equality Statement: Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.

How to apply:Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples.

About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest working on hands-on LLM evals projects (e.g. as suggested in our starter guide), such as building LM agent evaluations in Inspect.

Applications deadline: We review applications on a rolling basis and encourage early submissions.

* This role is supported byAI Futures Grants, a UK Government program designed to help the next generation of AI leaders meet the costs of relocating to the UK. AI Futures Grants provide financial support to reimburse relocation costs such as work visa fees, immigration health surcharge and travel/subsistence expenses. Successful candidates for this role may beable to get up to £10,000 to meet associated relocation costs, subject to terms and conditions.

Thank you very much for applying to Apollo Research.

  • Previous
  • 1
  • ...
  • 115
  • 116
  • 117
  • ...
  • 134
  • Next

* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.

Job Search and Career Advice Platform
Land a better
job faster
Follow us
JobLeads Youtube ProfileJobLeads Linkedin ProfileJobLeads Instagram ProfileJobLeads Facebook ProfileJobLeads Twitter AccountJobLeads Xing Profile
Company
  • Customer reviews
  • Careers at JobLeads
  • Site notice
Services
  • Free resume review
  • Job search
  • Headhunter matching
  • Career advice
  • JobLeads MasterClass
  • Browse jobs
Free resources
  • Predictions for 2024
  • 5 Stages of a Successful Job Search
  • 8 Common Job Search Mistakes
  • How Long should My Resume Be?
Support
  • Help
  • Partner integration
  • ATS Partners
  • Privacy Policy
  • Terms of Use

© JobLeads 2007 - 2025 | All rights reserved