TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling Paper • 2508.17445 • Published Aug 24, 2025 • 80
samuelcardillo/Qwen3-Coder-Next-Opus-4.6-Reasoning-Distilled 6.21M • Updated 20 days ago • 1.58k • 10
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text • 35B • Updated 18 days ago • 1.37M • 1.36k