AI & ML interests
None defined yet.
Aletheia-Bench/DPO-Think-14B
Text Generation
• 15B • Updated • 4
• 1
Aletheia-Bench/DPO-Think-1.5B
Text Generation
• 2B • Updated • 5
Aletheia-Bench/BatchOnline-GRPO-7B
Text Generation
• 8B • Updated • 1
• 1
Aletheia-Bench/BatchOnline-GRPO-14B
Text Generation
• 15B • Updated • 4
• 1
Aletheia-Bench/BatchOnline-GRPO-1.5B
Text Generation
• 2B • Updated • 4
Aletheia-Bench/GRPO-Think-14B-8k
Text Generation
• 15B • Updated • 1
• 1
Aletheia-Bench/GRPO-Think-7B-8k
Text Generation
• 8B • Updated • 1
Aletheia-Bench/GRPO-Think-14B-4k
Text Generation
• 15B • Updated Aletheia-Bench/GRPO-Think-1.5B-8k
Text Generation
• 2B • Updated • 1
Aletheia-Bench/GRPO-Think-7B-4k
Text Generation
• 8B • Updated • 2
Aletheia-Bench/GRPO-Think-1.5B-4k
Text Generation
• 2B • Updated • 1
15B • Updated Aletheia-Bench/GRPO-Instruct-14B
Text Generation
• 15B • Updated • 1
Aletheia-Bench/GRPO-Instruct-1.5B
Text Generation
• 2B • Updated • 2
Aletheia-Bench/GRPO-Instruct-7B
Text Generation
• 8B • Updated • 1
2B • Updated • 1
Aletheia-Bench/DPO-Think-7B
Text Generation
• 8B • Updated • 2
Aletheia-Bench/GRPO-Think-14B-16k
Text Generation
• 15B • Updated • 2
Aletheia-Bench/GRPO-Think-1.5B-16k
Text Generation
• 2B • Updated • 4
Aletheia-Bench/GRPO-Think-7B-16k
Text Generation
• 8B • Updated • 2