MSRL Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation DocTron/MSRL-SFT 8B โข Updated Aug 26, 2025 โข 6 DocTron/MSRL 8B โข Updated Aug 26, 2025 โข 7
Chart-R1 Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner DocTron/Chart-R1 8B โข Updated Jul 25, 2025 โข 3 โข 2 DocTron/Chart-COT 8B โข Updated Jul 25, 2025 โข 1
MSRL Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation DocTron/MSRL-SFT 8B โข Updated Aug 26, 2025 โข 6 DocTron/MSRL 8B โข Updated Aug 26, 2025 โข 7
Chart-R1 Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner DocTron/Chart-R1 8B โข Updated Jul 25, 2025 โข 3 โข 2 DocTron/Chart-COT 8B โข Updated Jul 25, 2025 โข 1