Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 5 days ago • 4 • 4
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 5 days ago • 4
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 5 days ago • 4
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 4 days ago • 2
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 4 days ago • 2
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 5 days ago • 4
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 4 days ago • 2
GenRL Collection Model collections trained with our framework: https://github.com/ModelTC/GenRL • 3 items • Updated 2 days ago • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published Feb 3 • 59
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Paper • 2602.04789 • Published Feb 4 • 4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit Paper • 2405.06001 • Published May 9, 2024
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing Paper • 2602.02159 • Published Feb 2 • 2