I like how explanation for GRPO is just a giant formula wth no explanation of what it does
Ivan Nikishev
dpe1
AI & ML interests
he he he
Recent Activity
new activity 1 day ago
HuggingFaceTB/nanowhale-100m:Nice new activity 14 days ago
arnir0/Tiny-LLM:tiny-llm