Job Url: https://www.remoterocketship.com/company/binance/jobs/data-scientist-reinforcement-learning-united-states-remote

Job Description: 
Binance
Website
LinkedIn
All Job Openings
Binance is the world's leading cryptocurrency exchange, serving over 235 million registered users across more than 180 countries. The platform offers a wide array of services, including the trading of over 350 cryptocurrencies in Spot, Margin, and Futures markets. 

Users can also buy and sell crypto via Binance P2P, earn interest through Binance Earn, and engage in NFT trading on the Binance NFT marketplace. Binance provides low transaction fees and diverse payment options, making it a preferred choice for cryptocurrency enthusiasts worldwide. 

Digital Assets Exchange • Blockchain • Cryptocurrency Exchange • Bitcoin • Fintech


1001 - 5000 employees


Founded 2017

₿ Crypto

💳 Fintech

💰 Initial Coin Offering on 2020-12

Data Scientist, Reinforcement Learning
Yesterday

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

📊 Data Scientist

Apply Now


Receive Emails with Similar Jobs
Report problem
📋 Description
• Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users.
• You will develop and optimize RL models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.
• You will explore and evaluate advanced algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance.
• The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the engineering skills to build scalable production systems.
• You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking.
• Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

🎯 Requirements
• Master’s degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
• 3+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization.
• Strong coding skills in Python, with experience in ML frameworks and RL libraries.
• Experience with large-scale distributed training and optimization.
• Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.

🏖️ Benefits
• Competitive salary and company benefits