AI & ML interests
None yet
Organizations
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation
• 8B • Updated • 421
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation
• 8B • Updated kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation
• 8B • Updated • 6
kangdawei/MMR-Sigmoid-DAPO
Text Generation
• 2B • Updated • 6
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation
• 8B • Updated • 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation
• 8B • Updated • 3
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation
• 8B • Updated Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 2
• 1
Text Generation
• 8B • Updated • 4
• 1
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 2B • Updated • 1
Text Generation
• 8B • Updated • 4
Text Generation
• 2B • Updated • 5
Text Generation
• 2B • Updated • 1
Text Generation
• 8B • Updated Text Generation
• 8B • Updated kangdawei/Open-RS-DR_GRPO-8B
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated Text Generation
• 8B • Updated Text Generation
• 8B • Updated kangdawei/Open-RS-DR_GRPO-7B
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated kangdawei/MMR-DR_GRPO-lambda-0.9
Text Generation
• 2B • Updated kangdawei/MMR-DR_GRPO-lambda-0.8
Text Generation
• 2B • Updated