deepseek-ai/DeepSeek-R1-0528
Text Generation
•
685B
•
Updated
•
421k
•
•
2.39k
Models that I personally recommend.
Note My recommendation for the big-several-hundred-B MoE size class. I run it in non-thinking mode with assistant prefill. It's a bit too slow to run in thinking mode locally on CPU+GPU or CPU.
Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!
Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!
Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!