4 11 7

Ruize Zhang

Ruize-Zhang

zrz-sh

AI & ML interests

Interested in RL

Recent Activity

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2.7

new activity 3 months ago

RLinf/WideSeek-R1-test-data:Update README.md

updated a dataset 3 months ago

RLinf/Wiki-2018-Corpus

View all activity

Organizations

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated Apr 20 • 1.24M • • 1.16k

New activity in RLinf/WideSeek-R1-test-data 3 months ago

Update README.md

#3 opened 3 months ago by

xzxuan

updated a dataset 3 months ago

RLinf/Wiki-2018-Corpus

Updated Mar 13 • 1.9k

upvoted 2 papers 4 months ago

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Paper • 2602.07837 • Published Feb 8 • 57

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 19

updated a dataset 4 months ago

RLinf/WideSeek-R1-test-data

Viewer • Updated Mar 13 • 200 • 79

published a dataset 4 months ago

RLinf/WideSeek-R1-test-data

Viewer • Updated Mar 13 • 200 • 79

New activity in RLinf/WideSeek-R1-train-data 4 months ago

Add task categories and improve metadata

#1 opened 4 months ago by

nielsr

New activity in RLinf/WideSeek-R1-4b 4 months ago

Add library_name, pipeline_tag, and arxiv metadata

#1 opened 4 months ago by

nielsr

authored a paper 4 months ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 100

upvoted a paper 4 months ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 100

updated a collection 4 months ago

WideSeek-R1

Collection

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning • 5 items • Updated Mar 13

updated a dataset 4 months ago

RLinf/WideSeek-R1-train-data

Preview • Updated Mar 13 • 182 • 2

updated a model 4 months ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated Mar 13 • 28 • • 5

liked a dataset 5 months ago

inclusionAI/ASearcher-Local-Knowledge

Viewer • Updated Aug 6, 2025 • 45.2M • 6.1k • 8

liked 2 models 6 months ago

changyeon/pi0_robocasa_100demos_base_pytorch

4B • Updated Nov 17, 2025 • 9 • 1

youliangtan/gr00t-n1.5-robocasa-tabletop-posttrain

3B • Updated Sep 16, 2025 • 34 • 2

upvoted a paper 6 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

liked a dataset 7 months ago

gaia-benchmark/GAIA

Viewer • Updated Oct 28, 2025 • 932 • 47k • 676

upvoted a paper 7 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29, 2025 • 66