Papers
arxiv:2606.03645

The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models

Published on May 29
· Submitted by
Liuyuan Wen
on Jun 5
Authors:
,
,
,

Abstract

Large language models show arithmetic fragility due to geometric structures in residual streams, where neural noise causes quantization failures that can be detected and corrected through geometric analysis.

Large Language Models exhibit paradoxical fragility in fundamental arithmetic, implying a disconnect between internal computation and discrete output. By analyzing the residual stream geometry during multi-operand addition, we identify the Iso-Raw-Sum Trajectory (IRST), a geometric structure where representations are anchored by semantic digits and modulated by continuous carry fibers. We propose the Noisy Quantization Model to explain this geometry, framing arithmetic errors as Geometric Slippages caused by internal neural noise pushing a continuous, latent Carry Potential across quantization thresholds. This geometric framework further elucidates Probe Versatility, explaining how lightweight probes can disentangle coexisting latent signals (such as ground truth versus hallucination) from a single activation vector. Finally, we validate these insights through a geometric consistency check method that effectively detects and corrects these quantization failures during inference. Our code is available at https://github.com/RL-MIND/Shape-of-Addition.

Community

Paper author Paper submitter

We introduce The Shape of Addition, a mechanistic interpretability study of why LLMs can still fail at basic multi-operand addition.

By probing residual-stream activations at each generated digit, we find that arithmetic states are organized into Iso-Raw-Sum Trajectories (IRSTs): continuous raw-sum fibers passing through digit basins and further stratified by carry states. This geometry explains common off-by-one arithmetic errors as geometric slippages, where noisy latent carry representations cross quantization thresholds before discrete token output.

We further propose a Noisy Quantization Model to characterize these failures, and validate the framework with a dual-stream consistency check that can detect and correct some quantization errors during inference. The results suggest that LLMs may internally retain correct arithmetic components even when the final token prediction is wrong.

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.03645
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.03645 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2606.03645 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.03645 in a Space README.md to link it from this page.

Collections including this paper 1