Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
47.1
TFLOPS
15
1
7
Jeremy Haschal
JermemyHaschal
Follow
Mi6paulino's profile picture
1 follower
Β·
5 following
AI & ML interests
None yet
Recent Activity
reacted
to
OzTianlu
's
post
with π€
about 12 hours ago
O(1) inference is the foundational design of Spartacus-1B-Instruct π‘οΈ ! https://huggingface.co/NoesisLab/Spartacus-1B-Instruct We have successfully replaced the KV-cache bottleneck inherent in Softmax Attention with Causal Monoid State Compression. By defining the causal history as a monoid recurrence, , the entire prefix is lossily compressed into a fixed-size state matrix per head. The technical core of this architecture relies on the associativity of the monoid operator: Training: parallel prefix scan using Triton-accelerated JIT kernels to compute all prefix states simultaneously. Inference: True sequential updates. Memory and time complexity per token are decoupled from sequence length. Explicit Causality: We discard RoPE and attention masks. Causality is a first-class citizen, explicitly modeled through learned, content-dependent decay gates. Current zero-shot benchmarks demonstrate that Spartacus-1B-Instruct (1.3B) is already outperforming established sub-quadratic models like Mamba-1.4B and RWKV-6-1.6B on ARC-Challenge (0.3063). Recent integration of structured Chain-of-Thought (CoT) data has further pushed reasoning accuracy to 75%. The "Spartacus" era is about scaling intelligence, not the memory wall βΎοΈ.
new
activity
6 days ago
TheDrummer/Rocinante-X-12B-v1-GGUF:
Comparison with Rivermind-Lux-12B-v1b?
reacted
to
Reubencf
's
post
with π₯
24 days ago
π’ New release! World_events Dataset now available featuring global events spanning 2023 through 2025 π https://huggingface.co/collections/Reubencf/world-events π 2026 dataset dropping soon
View all activity
Organizations
None yet
models
2
Sort:Β Recently updated
JermemyHaschal/llama-joycaption-beta-one-hf-llava-gguf
8B
β’
Updated
Aug 14, 2025
β’
28
JermemyHaschal/Phigments12-Q6_K-GGUF
3B
β’
Updated
Apr 22, 2024
β’
4
datasets
0
None public yet