TorchAO: PyTorch-Native Training-to-Serving Model Optimization Paper β’ 2507.16099 β’ Published Jul 21, 2025 β’ 7
Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs Paper β’ 2410.08806 β’ Published Oct 11, 2024 β’ 1
Compiler generated feedback for Large Language Models Paper β’ 2403.14714 β’ Published Mar 18, 2024 β’ 7
Priority Sampling of Large Language Models for Compilers Paper β’ 2402.18734 β’ Published Feb 28, 2024 β’ 19
Large Language Models for Compiler Optimization Paper β’ 2309.07062 β’ Published Sep 11, 2023 β’ 24
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine Paper β’ 2206.10558 β’ Published Jun 21, 2022 β’ 2
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 β’ 92
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper β’ 2507.05687 β’ Published Jul 8, 2025 β’ 30
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 β’ 91