RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation β’ 8B β’ Updated 12 days ago β’ 26.8k β’ 9
view article Article Building Tensors from Scratch in Rust (Part 1.2): View Operations Jun 18, 2025 β’ 4
Running 595 Scaling test-time compute π 595 Run advanced search strategies to boost LLM problem solving
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation β’ 8B β’ Updated May 29, 2025 β’ 151k β’ β’ 1.04k
Search-R1 Collection Preliminary checkpoints with outcome-only RL. β’ 15 items β’ Updated Aug 12, 2025 β’ 17
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification β’ 8B β’ Updated Oct 25, 2024 β’ 54.5k β’ 42
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16, 2025 β’ 169
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 411k β’ β’ 2.68k