Mano: Restriking Manifold Optimization for LLM Training
Paper
• 2601.23000 • Published
• 3
None defined yet.
Optimizing Few-Step Generation with Adaptive Matching Distillation
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers