DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published 14 days ago • 61
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 7 days ago • 234
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 27 days ago • 497
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 21 days ago • 187
arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-4M-400K-0.1-reverse-padzero-99-128D-3L-2H-512I Text Generation • 990k • Updated 24 days ago • 1.25k • 1