Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper β’ 2502.05171 β’ Published Feb 7, 2025 β’ 154
Transformers Can Do Arithmetic with the Right Embeddings Paper β’ 2405.17399 β’ Published May 27, 2024 β’ 54
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models Paper β’ 2306.13651 β’ Published Jun 23, 2023 β’ 16
On the Reliability of Watermarks for Large Language Models Paper β’ 2306.04634 β’ Published Jun 7, 2023 β’ 6
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust Paper β’ 2305.20030 β’ Published May 31, 2023 β’ 9