PaddlePaddle/PaddleOCR-VL-1.6 Image-Text-to-Text • 1.0B • Updated about 12 hours ago • 1.17k • 98
Enabling Versatile Controls for Video Diffusion Models Paper • 2503.16983 • Published Mar 21, 2025 • 15
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks Paper • 2503.04065 • Published Mar 6, 2025
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 238
view article Article From GRPO to DAPO and GSPO: What, Why, and How NormalUhr • Aug 9, 2025 • 121
baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19, 2025 • 51 • 68