13 2

Lei, Ding

orlando23

https://llv22.github.io/orlando.github.io/

llv22

AI & ML interests

Machine Learning, NLP, dialog system

Recent Activity

upvoted an article 20 days ago

A Guide to Hugging Face’s Papers Page

updated a collection 3 months ago

upvoted a paper 3 months ago

Complementary Reinforcement Learning

View all activity

Organizations

Collections 2

models 3

datasets 5

orlando23/AgentTraj-L_forward

Viewer • Updated Aug 26, 2024 • 14.5k • 38

orlando23/AgentEval_forward

Preview • Updated Aug 26, 2024 • 19

orlando23/screendata

Updated Jul 1, 2024 • 44

orlando23/mobile_pc_web_osworld

Updated Jul 1, 2024 • 24

orlando23/failed_agent_trajectory

Viewer • Updated Jul 1, 2024 • 288 • 294

Lei, Ding

AI & ML interests

Recent Activity

Organizations

Collections 2

ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

π-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

π-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

models 3

orlando23/Qwen3-4B-Instruct-2507

orlando23/Qwen3-4B-Thinking-2507

orlando23/vit-base-patch16-224-in21k-finetuned-lora-food101

datasets 5

orlando23/AgentTraj-L_forward

orlando23/AgentEval_forward

orlando23/screendata

orlando23/mobile_pc_web_osworld

orlando23/failed_agent_trajectory

Lei, Ding

AI & ML interests

Recent Activity

Organizations

Collections 2

models 3 Sort: Recently updated

datasets 5 Sort: Recently updated

models 3

datasets 5