CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning Paper β’ 2512.02551 β’ Published 6 days ago β’ 11
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper β’ 2511.18538 β’ Published 15 days ago β’ 242
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch Paper β’ 2512.02395 β’ Published 6 days ago β’ 43
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions Paper β’ 2509.17177 β’ Published Sep 21 β’ 13
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper β’ 2511.06221 β’ Published 29 days ago β’ 128
Emu3.5 Collection Native Multimodal Models are World Learners π β’ 4 items β’ Updated 25 days ago β’ 71
Emu3.5: Native Multimodal Models are World Learners Paper β’ 2510.26583 β’ Published Oct 30 β’ 106
Uniform Discrete Diffusion with Metric Path for Video Generation Paper β’ 2510.24717 β’ Published Oct 28 β’ 39
Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs β’ 3 items β’ Updated 4 days ago β’ 10
view article Article `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot` +9 Sep 16 β’ 47
Glyph: Scaling Context Windows via Visual-Text Compression Paper β’ 2510.17800 β’ Published Oct 20 β’ 67
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper β’ 2509.16506 β’ Published Sep 20 β’ 19
The Ultimate Collection of Code Classifiers Collection π₯ 15 classifiers, 124M parameters, one per programming languageβ for assessing the educational value of GitHub code β’ 15 items β’ Updated May 5 β’ 15
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Paper β’ 2509.23909 β’ Published Sep 28 β’ 31
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. β’ 358 items β’ Updated 9 days ago β’ 21
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. β’ 10 items β’ Updated 9 days ago β’ 31
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper β’ 2509.25531 β’ Published Sep 29 β’ 7