Models: H
Collection
Attention-only transformers, sweep over number of heads (for fixed head dimension) • 7 items • Updated
YAML Metadata Warning: empty or missing yaml metadata in repo card
Check out the documentation for more information.