HierSVA: A Data Synthesis Pipeline, Dataset, and Benchmark for LLM-Driven Hierarchical Hardware Formal Verification Paper • 2606.13706 • Published 10 days ago
Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression Paper • 2510.01450 • Published Oct 1, 2025 • 2
Parallax: Parameterized Local Linear Attention for Language Modeling Paper • 2605.29157 • Published 23 days ago • 11
Parallax: Parameterized Local Linear Attention for Language Modeling Paper • 2605.29157 • Published 23 days ago • 11
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published Mar 10 • 26
Running Agents 5 HLE Leaderboard for Agents with Tools 🥇 5 Humanity's Last Exam Leaderboard for LLM Agents with Tools
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs Paper • 2503.06342 • Published Mar 8, 2025 • 1
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs Paper • 2503.06342 • Published Mar 8, 2025 • 1
SeerAttention/SeerAttention-Llama-3.1-8B-AttnGates Text Generation • Updated Mar 3, 2025 • 559 • 4
EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology Paper • 2404.11887 • Published Apr 18, 2024
LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Paper • 2408.06003 • Published Aug 12, 2024