Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper β’ 2601.14253 β’ Published 9 days ago β’ 9
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published 15 days ago β’ 9
UM-Text: A Unified Multimodal Model for Image Understanding Paper β’ 2601.08321 β’ Published 16 days ago β’ 8
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper β’ 2601.03955 β’ Published 22 days ago β’ 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published 29 days ago β’ 7
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published 29 days ago β’ 9
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published Dec 11, 2025 β’ 9
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 22
Runtime error MCP Featured 142 LongCat Image Edit π 142 Generate or edit images using text prompts
Runtime error MCP Featured 142 LongCat Image Edit π 142 Generate or edit images using text prompts
Running on Zero Featured 169 VibeVoice-Realtime-0.5B π¨ 169 Generate natural-sounding speech from text
Running on Zero Featured 169 VibeVoice-Realtime-0.5B π¨ 169 Generate natural-sounding speech from text