SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks Paper • 2605.31433 • Published 3 days ago • 9
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 14
Chitrarth: Bridging Vision and Language for a Billion People Paper • 2502.15392 • Published Feb 21, 2025
LitLLMs, LLMs for Literature Review: Are we there yet? Paper • 2412.15249 • Published Dec 15, 2024 • 2
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs Paper • 2511.04727 • Published Nov 6, 2025
VoiceAgentBench: Are Voice Assistants ready for agentic tasks? Paper • 2510.07978 • Published Oct 9, 2025
Seeing Straight: Document Orientation Detection for Efficient OCR Paper • 2511.04161 • Published Nov 6, 2025
Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems Paper • 2602.16430 • Published Feb 18
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation Paper • 2502.20420 • Published Feb 27, 2025
EvoClaw: Evaluating AI Agents on Continuous Software Evolution Paper • 2603.13428 • Published Mar 13 • 21
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published Mar 11 • 44
view post Post 244 Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into HuggingFace Trainer, Accelerate and TRLFor extensive details please see this writeup:https://huggingface.co/blog/ulysses-spThanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration. See translation 🤗 1 1 + Reply