SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 147
DeepSeek-V3.1 Collection DeepSeek's new 3.1 update to their V3 models! • 6 items • Updated 14 days ago • 8
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published Jun 10, 2025 • 27
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 67 items • Updated 8 days ago • 295
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published May 5, 2025 • 80
Discovering symbolic expressions with parallelized tree search Paper • 2407.04405 • Published Jul 5, 2024 • 1
Models for Open Hands (Open Devin) Collection Models for Open Hands(Open Devin) trained on the Devinator Dataset. • 3 items • Updated Oct 25, 2024 • 3
LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context Paper • 2412.17596 • Published Dec 23, 2024 • 6
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 6 items • Updated Apr 14, 2025 • 16
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs Dec 4, 2024 • 80