On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 4 days ago • 6
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 20
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16, 2025 • 5