MoshiRAG Release Collection Candle & PyTorch model checkpoints released as part of the MoshiRAG release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi-rag • 2 items • Updated 2 days ago • 1
Llama-Mimi Collection Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens • 3 items • Updated Sep 19, 2025 • 1
Jagle Collection Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models • 5 items • Updated 20 days ago • 1
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models Paper • 2604.02048 • Published about 1 month ago • 1
JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation Paper • 2604.00909 • Published Apr 1 • 1
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 12 days ago • 34
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published Jan 17 • 36
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published Oct 25, 2025 • 3
WAON Collection WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Mar 2 • 2
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18, 2025 • 2