trackld/Math12K_low_1.5B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 2B • Updated about 17 hours ago • 5
trackld/Math12K_high_1.5B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 2B • Updated about 17 hours ago • 7
trackld/Math12K_high_1.5B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 2B • Updated about 17 hours ago • 7
trackld/Math12K_low_1.5B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 2B • Updated about 17 hours ago • 5
trackld/Math12K_low_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated about 17 hours ago • 7
trackld/Math12K_high_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated about 17 hours ago • 6
trackld/Math12K_low_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated about 17 hours ago • 7
trackld/Math12K_high_3B_lr1.25e-6_bs1_gas_1_2GPU Text Generation • 242k • Updated about 17 hours ago • 6