feihu.hf
commited on
Commit
·
5576448
1
Parent(s):
0ac4048
update README
Browse files
README.md
CHANGED
|
@@ -227,7 +227,7 @@ For full technical details, see the [Qwen2.5-1M Technical Report](https://arxiv.
|
|
| 227 |
|
| 228 |
#### Step 1: Update Configuration File
|
| 229 |
|
| 230 |
-
|
| 231 |
|
| 232 |
```bash
|
| 233 |
export MODELNAME=Qwen3-30B-A3B-Thinking-2507
|
|
|
|
| 227 |
|
| 228 |
#### Step 1: Update Configuration File
|
| 229 |
|
| 230 |
+
Download the model and replace the content of your `config.json` with `config_1m.json`, which includes the config for length extrapolation and sparse attention.
|
| 231 |
|
| 232 |
```bash
|
| 233 |
export MODELNAME=Qwen3-30B-A3B-Thinking-2507
|