PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 5 days ago • 28
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 14 days ago • 50
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Paper • 2602.16855 • Published 14 days ago • 46
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 9 days ago • 30
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 27 days ago • 20
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 30 days ago • 204
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 167
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published Jan 9 • 23