New discussion

Cannot summarize 8000 tokens

2
#22 opened 21 days ago by
kalle07

llama.cpp support

🚀 7
#21 opened 23 days ago by
ngxson

VLLM 启动报错了

1
#20 opened 26 days ago by
qinghuiyyds

Update README.md

#17 opened 28 days ago by
byjiang1996

it run good in colab t4

10
#16 opened 30 days ago by
asdgad

4bit

1
#15 opened 30 days ago by
asdgad

run colab t4 but

5
#14 opened about 1 month ago by
asdgad

not run

👀 1
1
#13 opened about 1 month ago by
asdgad

Question regarding the FP8 version

1
#9 opened about 1 month ago by
thecr7guy

vLLM error

8
#8 opened about 1 month ago by
ccernat

It's really top_k = 2?

👍 2
1
#6 opened about 1 month ago by
CHNtentes

The demo script loads forever.

3
#1 opened about 1 month ago by
AliceThirty