Spaces:
Runtime error
Runtime error
File size: 1,196 Bytes
8d88deb 0aca48e 52c6809 8d88deb 0aca48e 8d88deb 52c6809 8d88deb 0aca48e 8d88deb 5eb40a3 c53337b 98cad23 c53337b 5eb40a3 d424223 c53337b 98cad23 c53337b 98cad23 7379eee |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 |
---
title: Apertus on FastAPI
emoji: π
colorFrom: red
colorTo: gray
sdk: docker
app_port: 8000
fullWidth: false
suggested_storage: small
suggested_hardware: t4-medium
tags:
- apertus
license: apache-2.0
---
Apertus transformer on FastAPI
------------------------------
A FastAPI-based Python application that provides an API to interface with the Apertus LLM from the Swiss AI Initiative.
The [OpenAI-compatible API](https://medium.com/data-science/how-to-build-an-openai-compatible-api-87c8edea2f06)
is inspired by [openai-compatible-fastapi](https://github.com/ritun16/openai-compatible-fastapi).
The goal is to use the [Apertus Format API](https://github.com/swiss-ai/apertus_format) in a lightweight service with an efficient, paged attention mechanism.
If you're debugging locally, it may help you to use [tiny-random/apertus](https://huggingface.co/tiny-random/apertus) instead of the full model.
## Terms of use
For more information on Apertus, go to https://huggingface.co/swiss-ai
This is an open source project under the [Apache 2.0 License](LICENSE)
If you have further suggestions please leave them on [Codeberg Issues](https://codeberg.org/loleg/fastapi-apertus/issues)
|