Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale Paper • 2601.10338 • Published 1 day ago • 3
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published 1 day ago • 4
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 1 day ago • 8
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published 1 day ago • 10
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 1 day ago • 14
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 1 day ago • 15
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published 1 day ago • 20
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering Paper • 2601.10402 • Published 1 day ago • 24
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning Paper • 2601.07641 • Published 4 days ago • 35
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 2 days ago • 63
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 1 day ago • 137
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs Paper • 2601.06786 • Published 6 days ago • 5
VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding Paper • 2601.07290 • Published 5 days ago • 6
JudgeRLVR: Judge First, Generate Second for Efficient Reasoning Paper • 2601.08468 • Published 3 days ago • 5
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 6 days ago • 73
e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings Paper • 2601.03666 • Published 10 days ago • 3