view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 18 days ago β’ 138
Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters