Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
254.6
TFLOPS
21
GadflyII
GadflyII
Follow
focuzz8's profile picture
pramjana's profile picture
AlexGS74's profile picture
27 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
23 days ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4:
SGLang and MTP
new
activity
about 1 month ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Model requests?
new
activity
about 1 month ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
View all activity
Organizations
GadflyII
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
23 days ago
SGLang and MTP
1
#2 opened about 1 month ago by
Michalea
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 1 month ago
Model requests?
12
#4 opened about 2 months ago by
pathosethoslogos
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯
👍
4
3
#5 opened about 1 month ago by
scottgl
New activity in
GadflyII/GLM-4.6V-NVFP4
about 1 month ago
Fails on a single DGX spark with errors below
1
#2 opened about 1 month ago by
Adrian1234
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
about 2 months ago
Update MXFP4 format to compressed-tensors
1
#3 opened about 2 months ago by
mgoin
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 2 months ago
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍
3
17
#1 opened about 2 months ago by
zenmagnets
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 2 months ago
MMLU PRO Benchmark
3
#3 opened about 2 months ago by
sevapru
vLLM 0.16?
1
#2 opened about 2 months ago by
MMaxHugg
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
2 months ago
Memory
1
#1 opened 2 months ago by
struxx
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
confused response
7
#8 opened 2 months ago by
jiangyizhi
updated
a model
2 months ago
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
Feb 4
•
411k
•
39
published
a model
2 months ago
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
Feb 4
•
411k
•
39
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
MTP quality, 47 layer
3
#7 opened 2 months ago by
Michalea
updated
a model
2 months ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
Feb 2
•
633
•
5
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
2 months ago
Upload folder using huggingface_hub
#1 opened 2 months ago by
GadflyII
published
a model
2 months ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
Feb 2
•
633
•
5
New activity in
GadflyII/GLM-4.6V-NVFP4
2 months ago
Well done nvfp4 quant
2
#1 opened 2 months ago by
josephbreda
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
Can't deploy by vllm 0.14.1 + transformers
8
#6 opened 2 months ago by
Butterfly-314
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
2 months ago
can not run
4
#1 opened 2 months ago by
aliez-ren
updated
a model
2 months ago
GadflyII/MiniMax-M2.1-NVFP4
Text Generation
•
Updated
Jan 26
•
65
•
6
Load more