gmongaras/medium_8192sl_gpu_64bs__squared__sm_norm__A_mask_type_neg_softplus__in_conv_k_2__att2 3B • Updated 18 days ago • 28 • 1
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy Paper • 2602.17363 • Published 28 days ago • 8 • 4
2Mamba2Furious: Linear in Complexity... Collection Pretrained models for the paper 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy (https://arxiv.org/abs/2602.17363) • 4 items • Updated 25 days ago • 1
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy Paper • 2602.17363 • Published 28 days ago • 8