Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tong Zhu's picture
12 46 58

Tong Zhu

Spico
chichi56's profile picture hitsmy's profile picture yzhangcs's profile picture
·
https://Spico197.github.io
  • TongZhu197
  • Spico197

AI & ML interests

Information Extraction, Mixture-of-Experts, LLM

Recent Activity

upvoted an article 12 days ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
published an article 12 days ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
upvoted an article 27 days ago
Transformers v5: Simple model definitions powering the AI ecosystem
View all activity

Organizations

SUDA-HUAWEI Joint Project's profile picture REx Team in Soochow University's profile picture LLaMA-MoE's profile picture MoE-Dynamic-Routing's profile picture

Spico 's models 7

Spico/LLaMA-MoE-v1-2_8-UniformSFT

Text Generation • 7B • Updated Feb 28, 2024 • 3

Spico/LLaMA-MoE-v1-2_8-DynamicSFT

Text Generation • 7B • Updated Feb 28, 2024

Spico/sheared-llama-2.7b-deita-6k-sft

Text Generation • 3B • Updated Feb 25, 2024 • 1

Spico/internlm2-7b-hf-llama

Text Generation • Updated Feb 23, 2024 • 1

Spico/mirror-chinese-mrcqa-alpha

Updated Dec 4, 2023

Spico/Humback-Myx

Text Generation • Updated Aug 19, 2023 • 8 • 3

Spico/Humback-M0

Text Generation • Updated Aug 18, 2023 • 4 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs