reasoning token
#4
by
lssj14
- opened
What is the reasoning token of granite model?
For example it is set as , in Qwen and DeepSeek.
Hi @lssj14 , thanks for your interest in the Granite family. A couple of notes about your question:
- This model is an accelerator (speculative decoder) for the Granite Code 20b model which does not support thinking
- None of the Granite Code family (Granite 2.x) support thinking
- In the Granite 3.x family, thinking was introduced in 3.2 (eg granite 3.2 2B)
- There is no single token for the 3.x family that stimulates thinking. Instead, you can enable/disable it via the
thinking=Trueflag to theapply_chat_templatefunction intransformers. Underneath, this translates to adding a section of prewritten system prompt to the user's system prompt, so if using the model behind a hosted chat API endpoint that does not support thethinkingargument (eg an OpenAI REST API), you can enable thinking by adding the respective piece of system prompt to your system prompt.
- NOTE: The system prompt snippet is different for 3.2 vs 3.3, so make sure to check the appropriate chat template to find the right snippet
lssj14
changed discussion status to
closed