MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 10 days ago • 43
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 16 days ago • 36
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 16 days ago • 25
Youssofal/MiniMax-M2.7-Abliterated-Heretic-GGUF Text Generation • 229B • Updated 16 days ago • 6.6k • 39
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published Mar 23 • 34
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 14 days ago • 66
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 72