Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
85.2
TFLOPS
222
94
229
Nicholas Broad
nbroad
Follow
coconutlabs's profile picture
krishaamer's profile picture
umutphp's profile picture
120 followers
·
83 following
nbroad1881
nbroad1881
nicholas-m-broad
nbroad.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 9 hours ago
nbroad/hf-inference-providers-data
liked
a dataset
about 12 hours ago
ianncity/KIMI-K2.5-1000000x
liked
a model
7 days ago
netflix/void-model
View all activity
Organizations
nbroad
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Voxtral-4B-TTS-2603
7 days ago
Why not open source 😐
👍
11
16
#16 opened 14 days ago by
Deathgod7890
New activity in
mistralai/Ministral-3-14B-Instruct-2512
about 2 months ago
Add pipeline tag so it can be used in HF inference providers
#22 opened about 2 months ago by
nbroad
New activity in
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
3 months ago
experiment
#10 opened 3 months ago by
nbroad
New activity in
skt/A.X-K1
3 months ago
Update generation_config.json
❤️
👍
2
1
#5 opened 3 months ago by
nbroad
New activity in
lmsys/SGLang-EAGLE3-Qwen3-Next-80B-A3B-Instruct-FP8-SpecForge-Meituan
3 months ago
What version of SGLang is required for the current model?
4
#1 opened 3 months ago by
zephyrrrr
New activity in
Qwen/Qwen3-Coder-480B-A35B-Instruct
7 months ago
Update chat_template and tool_parser
3
#27 opened 8 months ago by
rshcao
New activity in
mistralai/Mistral-Small-3.2-24B-Instruct-2506
8 months ago
add-chat-template
10
#27 opened 9 months ago by
baseten-admin
--limit_mm_per_prompt 'image=10' is not a valid field
2
#30 opened 8 months ago by
jtvino
New activity in
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
8 months ago
Update chat_template.jinja to match tokenizer_config.json
3
#5 opened 8 months ago by
nbroad
New activity in
nbroad/longformer-base-health-fact
8 months ago
test
#4 opened 8 months ago by
nbroad
New activity in
Alibaba-NLP/gte-multilingual-base
9 months ago
HERE IS HOW YOU USE THIS WITH TEI OR INFERENCE ENDPOINTS
15
#7 opened over 1 year ago by
nbroad
New activity in
Qwen/Qwen3-235B-A22B-FP8
10 months ago
yarn scale to 122k context length
#5 opened 10 months ago by
nbroad
New activity in
Qwen/Qwen3-235B-A22B
10 months ago
yarn scale to 122,880 context length
#41 opened 10 months ago by
nbroad
New activity in
rica40325/10_14dpo
11 months ago
Add readme with pipeline_tag
#1 opened 11 months ago by
nbroad
New activity in
BAAI/bge-reranker-v2-m3
11 months ago
Is bge-reranker-v2-m3 pointwise, listwise, or pairwise methods?
3
#31 opened over 1 year ago by
Rebecca19990101
New activity in
meta-llama/Llama-3.3-70B-Instruct
12 months ago
Multiple Tool Calls?
1
#111 opened 12 months ago by
nbroad
New activity in
LGAI-EXAONE/EXAONE-Deep-2.4B
about 1 year ago
Chat template difference with 32b
3
#2 opened about 1 year ago by
nbroad
New activity in
ibm-research/re2g-reranker-nq
about 1 year ago
HOW TO USE WITH TEI
1
#3 opened about 2 years ago by
nbroad
New activity in
huggingface/brand-assets
over 1 year ago
Request for social media icon vector
3
#4 opened over 1 year ago by
umarbutler
New activity in
Qwen/Qwen2.5-Math-7B-Instruct
over 1 year ago
How to prevent degenerate output?
2
#2 opened over 1 year ago by
nbroad
Load more