Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fugaku-LLM
community
Activity Feed
Follow
83
AI & ML interests
None defined yet.
Recent Activity
Taishi-N324
authored
a paper
2 days ago
On the Optimal Reasoning Length for RL-Trained Language Models
Taishi-N324
authored
a paper
4 months ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Taishi-N324
authored
a paper
6 months ago
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
View all activity
Team members
10
models
3
Sort: Recently updated
Fugaku-LLM/Fugaku-LLM-13B
Text Generation
•
Updated
Jan 10, 2025
•
130
Fugaku-LLM/Fugaku-LLM-13B-instruct-gguf
13B
•
Updated
May 9, 2024
•
29
•
41
Fugaku-LLM/Fugaku-LLM-13B-instruct
Text Generation
•
13B
•
Updated
May 9, 2024
•
117
•
28
datasets
0
None public yet