Atlas Inference

community

AI & ML interests

We built Atlas, a pure Rust LLM inference engine with custom CUDA kernels for the NVIDIA DGX Spark GB10, and we are gearing up for an open source release. 102 tok/s on Qwen3.5-35B 3x over vLLM on the same hardware, 2GB binary, 2 minute cold start (at LEAST 10x smaller and faster)

Recent Activity

nologik updated a Space 22 days ago

Atlas-Inference/README

AzeezIsh published a Space 3 months ago

Atlas-Inference/README

View all activity

Atlas-Inference 's datasets

None public yet

AI & ML interests

Recent Activity

Team members 2

Atlas-Inference 's datasets