OpenThinkerAgent-32B SFT data-scaling ladder (models + matching datasets, 316->100K) plus TaskTrove & AgentTrove sources.
-
open-thoughts/OpenThinkerAgent-32B-SFT-316
Text Generation • 677k • Updated • 12 -
open-thoughts/OpenThinkerAgent-32B-SFT-1K
Text Generation • 677k • Updated • 11 -
open-thoughts/OpenThinkerAgent-32B-SFT-3.16K
Text Generation • 677k • Updated • 16 -
open-thoughts/OpenThinkerAgent-32B-SFT-10K
Text Generation • 677k • Updated • 30