metadata
license: apache-2.0
base_model:
- meta-llama/Llama-3.1-8B-Instruct
Accuracy
| Task | Context Length | meta-llama/ Llama-3.1-8B-Instruct |
Llama-3.1-8B-Instruct- FP8-dynamic- QKV-Cache-FP8- Per-Head |
Llama-3.1-8B-Instruct- FP8-dynamic- QKV-Cache-FP8- Per-Tensor |
Llama-3.1-8B-Instruct- QKV-Cache-FP8- Per-Head |
Llama-3.1-8B-Instruct- QKV-Cache-FP8- Per-Tensor |
|---|---|---|---|---|---|---|
| NIAH Single 2 |
4096 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| 16384 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
| 32768 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
| 65536 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
| 131072 | 99.2 | 99.6 | 99.4 | 99.4 | 99.0 |