Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 16 days ago • 40
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 119
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models Paper • 2401.16745 • Published Jan 30, 2024 • 1
dealignai/Gemma-4-31B-JANG_4M-CRACK Image-Text-to-Text • 6B • Updated about 7 hours ago • 143k • 1.25k
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text • 35B • Updated 12 days ago • 1.22M • 1.32k
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published 15 days ago • 33
BidirLM-Embedding Collection BidirLM is a family of 5 frontier bidirectional encoders, including an omnimodal variant at 2.5B. • 6 items • Updated 10 days ago • 5
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 117