nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 12 days ago • 258k • 320
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer • Updated Jan 15 • 435k • 1.33k • 58
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs Paper • 2601.01046 • Published Jan 3 • 14
MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition • 2B • Updated Jul 8, 2025 • 7.19k • 114
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper • 2511.11007 • Published Nov 14, 2025 • 15
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 96