RLFR: Extending Reinforcement Learning for LLMs with Flow Environment Paper • 2510.10201 • Published Oct 11, 2025 • 35
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 148k • 91
MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models Paper • 2508.13938 • Published Aug 19, 2025 • 1
Skywork/Skywork-o1-Open-Llama-3.1-8B Text Generation • 8B • Updated Aug 29, 2025 • 215 • • 115
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46