Delphi Collection Marin's first open scaling suite. 88 base models, 3e18 β 1e23 FLOPs. https://openathena.ai/blog/delphi β’ 89 items β’ Updated 4 days ago β’ 8
Running 107 Unlocking On-Policy Distillation for Any Model Family π 107 Visualize on-policy distillation for any model family
Running Featured 85 Distilling 100B+ Models 40x Faster with TRL π 85 TRL distillation for 100B+ teachers, 40x faster
MedPsy Collection SOTA Medical and Healthcare text-only Small Language Models for Edge deployment β’ 4 items β’ Updated 16 days ago β’ 3
view article Article QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices qvac β’ 16 days ago β’ 17
Running 17 Qwen-Scope: Decoding Intelligence, Unleashing Potential π¬ 17 Live SAE feature steering for Qwen3-1.7B
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare +1 aaditya, pminervini, clefourrier β’ Apr 19, 2024 β’ 199
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper β’ 2605.12484 β’ Published 11 days ago β’ 17
Running on CPU Upgrade Featured 386 ML Intern π€ 386 Run an interactive ML assistant directly in your browser
OpenEnv India Hackathon top 100 Collection Top 100 Space submissions from the OpenEnv India Hackathon. β’ 98 items β’ Updated 10 days ago β’ 7
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap β’ Feb 26 β’ 161