File size: 1,196 Bytes
8d88deb
 
0aca48e
 
52c6809
8d88deb
0aca48e
8d88deb
 
52c6809
8d88deb
0aca48e
 
8d88deb
 
 
5eb40a3
 
 
 
 
c53337b
 
98cad23
c53337b
5eb40a3
d424223
 
 
 
c53337b
98cad23
c53337b
98cad23
7379eee
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
title: Apertus on FastAPI
emoji: πŸŒ–
colorFrom: red
colorTo: gray
sdk: docker
app_port: 8000
fullWidth: false
suggested_storage: small
suggested_hardware: t4-medium
tags:
- apertus
license: apache-2.0
---


Apertus transformer on FastAPI
------------------------------

A FastAPI-based Python application that provides an API to interface with the Apertus LLM from the Swiss AI Initiative.

The [OpenAI-compatible API](https://medium.com/data-science/how-to-build-an-openai-compatible-api-87c8edea2f06)
is inspired by [openai-compatible-fastapi](https://github.com/ritun16/openai-compatible-fastapi).

The goal is to use the [Apertus Format API](https://github.com/swiss-ai/apertus_format) in a lightweight service with an efficient, paged attention mechanism.

If you're debugging locally, it may help you to use [tiny-random/apertus](https://huggingface.co/tiny-random/apertus) instead of the full model.

## Terms of use

For more information on Apertus, go to https://huggingface.co/swiss-ai

This is an open source project under the [Apache 2.0 License](LICENSE)

If you have further suggestions please leave them on [Codeberg Issues](https://codeberg.org/loleg/fastapi-apertus/issues)