Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13 • 28
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30 • 277
view article Article Building Your Own AI Document Dream Team: A Generic Multi-Agent System Apr 8 • 6
view article Article Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators Jan 28 • 8
view article Article Model Card Generator Interface: Crafting Clear Insights into AI Models Sep 27, 2024 • 4
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Gaudi® Accelerator Sep 9, 2024 • 7
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging Aug 19, 2024 • 79
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors Apr 24, 2024 • 7