PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation Paper • 2512.04025 • Published 6 days ago • 2 • 2
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation Paper • 2512.04025 • Published 6 days ago • 2
BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation Paper • 2511.22973 • Published 11 days ago • 2
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Paper • 2511.20714 • Published 15 days ago • 45
VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction Paper • 2509.19297 • Published Sep 23 • 24
Neighboring Autoregressive Modeling for Efficient Visual Generation Paper • 2503.10696 • Published Mar 12 • 8
Neighboring Autoregressive Modeling for Efficient Visual Generation Paper • 2503.10696 • Published Mar 12 • 8
ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality Paper • 2412.04062 • Published Dec 5, 2024 • 9
LongVLM: Efficient Long Video Understanding via Large Language Models Paper • 2404.03384 • Published Apr 4, 2024
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models Paper • 2405.14366 • Published May 23, 2024 • 3
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper • 2408.03361 • Published Aug 6, 2024 • 85
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper • 2408.03361 • Published Aug 6, 2024 • 85
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models Paper • 2405.14366 • Published May 23, 2024 • 3
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM Paper • 2403.07487 • Published Mar 12, 2024 • 17