view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 307
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
VN-MTEB: Vietnamese Massive Text Embedding Benchmark Paper • 2507.21500 • Published Jul 29, 2025 • 1
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 130
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published Apr 30, 2024 • 74