Job Search and Career Advice Platform
  • Jobs
  • Headhunters
  • Free resume review
  • About Us
EN
765

Jobs in Surabaya, South Africa

Fine Tuning/Post Training Data Scientist - RL (GRPO, PPO, RLHF)

Binance

Daerah Khusus Ibukota Jakarta
Remote
IDR 250,000,000 - 350,000,000
10 days ago
I want to receive the latest job alerts for jobs in Surabaya

Enterprise Sales Executive

PT ODG Indonesia

Jakarta Timur
Remote
IDR 200,000,000 - 300,000,000
10 days ago

Affiliate Manager, SEA market

Medium

Kota Yogyakarta
Remote
IDR 100,000,000 - 200,000,000
11 days ago

Software Engineer (Cloud-Focused) - House Interactions (Ukraine)

DoiT International

Indonesia
Remote
IDR 1,327,140,000 - 1,990,711,000
11 days ago

Software Engineer (Cloud-Focused) - House Interactions

DoiT International

Indonesia
Remote
IDR 1,327,140,000 - 1,990,711,000
11 days ago
discover more jobs illustrationDiscover more opportunities than anywhere else. Find more jobs now

Senior Frontend Engineer - Monetization

Toggl Inc

Indonesia
Remote
IDR 1,407,908,000
11 days ago

Finance and Purchasing Officer

RecruitGo

Indonesia
Remote
IDR 100,000,000 - 200,000,000
11 days ago

Senior Product Manager, Secret Detection

GitLab

Indonesia
Remote
IDR 300,000,000 - 400,000,000
11 days ago
HeadhuntersConnect with headhunters to apply for similar jobs

Payroll Specialist Lead - Mexico

Remote

Indonesia
Remote
IDR 690,112,000 - 776,377,000
11 days ago

Country Growth Manager (Indonesia)

Hostinger

Provinsi Bali
Remote
IDR 100,000,000 - 200,000,000
11 days ago

Senior Account Executive, Global Payroll - EMEA

Remote

Indonesia
Remote
IDR 1,547,777,000 - 4,354,679,000
11 days ago

Sales Executive

Ahti Interiors

Indonesia
Remote
Confidential
11 days ago

Frontend Engineer - House Interactions

DoiT International

Indonesia
Remote
IDR 1,161,247,000 - 1,658,926,000
11 days ago

Country Manager

CDNetworks

Daerah Khusus Ibukota Jakarta
Remote
IDR 829,462,000 - 1,161,248,000
11 days ago

Software Engineer (GenAI-Focused) - House Interactions

DoiT International

Indonesia
Remote
IDR 1,327,140,000 - 1,990,711,000
11 days ago

Software Engineer (GenAI-Focused) - House Interactions, Ukraine

DoiT International

Indonesia
Remote
IDR 1,327,140,000 - 1,990,711,000
11 days ago

Full Stack Engineer AI (Remote)

Bjak

Daerah Khusus Ibukota Jakarta
Remote
IDR 995,355,000 - 1,327,141,000
11 days ago

Outbound Sales Development Representative - UKI

Remote

Indonesia
Remote
IDR 522,561,000 - 1,177,008,000
11 days ago

Webtoon Designer

Kong Vector

Kota Yogyakarta
Remote
IDR 100,000,000 - 200,000,000
11 days ago

Payroll Lead - Mexico

Remote

Indonesia
Remote
IDR 844,392,000 - 950,565,000
11 days ago

Frontend Engineer - House Interactions (Ukraine)

DoiT International

Indonesia
Remote
IDR 771,456,000 - 1,350,049,000
11 days ago

Public Sector Strategic Account Executive - SLED, Southeast

GitLab

Indonesia
Remote
IDR 1,328,903,000 - 1,993,356,000
12 days ago

Customer Success Specialist - APAC

InEvent, Inc.

None
Remote
IDR 664,451,000 - 996,678,000
12 days ago

Product Marketing Manager, Swine

Stryker Corporation

Kota Medan ᯔᯩᯑᯉ᯲
Remote
Confidential
12 days ago

Video creator for social media

Brand Factory

Jakarta Utara
Remote
IDR 100,000,000 - 200,000,000
12 days ago
Fine Tuning/Post Training Data Scientist - RL (GRPO, PPO, RLHF)
Binance
Daerah Khusus Ibukota Jakarta
Remote
IDR 250.000.000 - 350.000.000
Full time
10 days ago

Job summary

A leading blockchain ecosystem is seeking an experienced professional to develop and optimize Reinforcement Learning models for enterprise applications. You will research advanced algorithms, collaborate with cross-functional teams, and implement training pipelines. Candidates should have a Master's degree in a relevant field and 5+ years of experience in RL optimization. This role offers competitive salary and work-from-home arrangements.

Benefits

Competitive salary
Work-from-home arrangement
Opportunities for career growth

Qualifications

  • 5+ years of hands‑on experience in RL and LLM/VLM/Agentic AI optimization.
  • Experience with large-scale distributed training and optimization.
  • Self‑driven and ownership mindset.

Responsibilities

  • Research and develop state-of-the-art RL algorithms.
  • Design and implement RL training pipelines.
  • Apply Reinforcement Learning methods for reasoning and planning.

Skills

Reinforcement Learning optimization
Strong coding skills in Python
Problem-solving skills
Excellent communication

Education

Master’s Degree in Computer Science or related fields

Tools

ML frameworks
RL libraries
Job description

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize Reinforcement Learning (RL) models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.

You will explore and evaluate advanced Algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the Engineering skills to build scalable production systems.

You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities
  • Research and develop state-of-the-art RL algorithms, focusing on large model optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply Reinforcement Learning methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision‑making.
  • Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through iterative training and fine‑tuning.
Requirements
  • Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 5+ years of hands‑on experience in RL and [either 1: LLM/VLM/Agentic AI] optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self‑driven, ownership mindset, and strong problem‑solving skills. Excellent communication skills for cross‑functional collaboration.
Why Binance

• Shape the future with the world’s leading blockchain ecosystem

• Collaborate with world-class talent in a user‑centric global organization with a flat structure

• Tackle unique, fast‑paced projects with autonomy in an innovative environment

• Thrive in a results‑driven workplace with opportunities for career growth and continuous learning

• Competitive salary and company benefits

• Work‑from‑home arrangement (the arrangement may vary depending on the work nature of the business team)

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.

  • 1
  • ...
  • 11
  • 12
  • 13
  • ...
  • 31

* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.

Job Search and Career Advice Platform

Empoweringjob seekers

Tools
  • Jobs
  • Resume review
  • Headhunters
  • Browse jobs
Company
  • About us
  • Careers at JobLeads
  • Site notice
  • Reviews
Support
  • Help
  • Partner integration
  • ATS Partners
Social
  • YouTube
  • LinkedIn
  • Instagram
  • Facebook
  • Privacy Policy
  • Terms of Use

© JobLeads 2007 - 2025 | All rights reserved