Alex Steiner's picture

Alex Steiner

einsteiner1983

·

pst2154

AI & ML interests

Data Science

Organizations

New activity in lukealonso/MiniMax-M2.5-NVFP4 4 months ago

KeyError: '110.w1.input_scale' with TRT

#3 opened 4 months ago by

New activity in galileo-ai/agent-leaderboard 8 months ago

https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

#7 opened 10 months ago by

New activity in google/gemma-3-27b-it-qat-q4_0-gguf 10 months ago

Where is the config.json?

#17 opened 10 months ago by

New activity in moonshotai/Kimi-K2-Instruct 11 months ago

Run 1T-param on A100/H100(80G)x8 using FP4

#9 opened 11 months ago by

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1 about 1 year ago

How to combine `thinking on/off` prompt with existing system prompt.

#8 opened about 1 year ago by

New activity in microsoft/Phi-3-mini-128k-instruct almost 2 years ago

When input tokens < 4096 but total input+output tokens >4096 the model produces poor output

#85 opened almost 2 years ago by