Tele-AI/TELEVAL
Viewer • Updated • 39.8k • 301 • 1
None defined yet.
Swift-SVD: Theoretical Optimality Meets Practical Efficiency in Low-Rank LLM Compression
KV-CoRE: Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs