Instructions to use FreedomIntelligence/Apollo-0.5B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FreedomIntelligence/Apollo-0.5B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="FreedomIntelligence/Apollo-0.5B") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("FreedomIntelligence/Apollo-0.5B") model = AutoModelForCausalLM.from_pretrained("FreedomIntelligence/Apollo-0.5B") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use FreedomIntelligence/Apollo-0.5B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "FreedomIntelligence/Apollo-0.5B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FreedomIntelligence/Apollo-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/FreedomIntelligence/Apollo-0.5B
- SGLang
How to use FreedomIntelligence/Apollo-0.5B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "FreedomIntelligence/Apollo-0.5B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FreedomIntelligence/Apollo-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "FreedomIntelligence/Apollo-0.5B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FreedomIntelligence/Apollo-0.5B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use FreedomIntelligence/Apollo-0.5B with Docker Model Runner:
docker model run hf.co/FreedomIntelligence/Apollo-0.5B
Commit History
Update README.md 563809e verified
Upload result.png 3b847a7 verified
Update README.md ed6f1dd verified
Upload result.png 8928296 verified
Upload README.md 71a127b verified
Update README.md 06a6932 verified
Update README.md 5828669 verified
Update README.md 843d9b1 verified
Nuo Chen commited on
Update README.md 58c9f59 verified
Nuo Chen commited on
Update config.json 7c51fb2 verified
Nuo Chen commited on
Update README.md 6a37d14 verified
Upload tokenizer da0cc81 verified
Nuo Chen commited on
Update README.md 536fb42 verified
Update README.md de41384 verified
Upload dataset.png 07d1f9b verified
Upload result.png 8ed45ff verified
Update README.md 2a71ddf verified
Update README.md 91303f9 verified
Upload Qwen2ForCausalLM c76040b verified
Nuo Chen commited on
Upload Qwen2ForCausalLM 9962346 verified
Nuo Chen commited on
Update README.md 0432d4a verified
Update README.md cd25a1a verified
Update README.md 99614b7 verified
Update config.json 5b21000 verified
Update README.md bdabf65 verified
Update README.md 801d4f4 verified
Update README.md 9981482 verified
Upload result.png 65b96eb verified
Upload apollo_medium_final.png 20bb405 verified
Create assets/logo f3636b5 verified
Update README.md fbe068d verified
Update README.md c9952fd verified
Delete README .md 263c152 verified
Upload README .md bfb9a90 verified
Upload Qwen2ForCausalLM f5171f6 verified
Nuo Chen commited on