CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 5 days ago • 43
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 6 days ago • 10
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Paper • 2604.01128 • Published 6 days ago • 12
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 18 days ago • 321
Story2Proposal: A Scaffold for Structured Scientific Paper Writing Paper • 2603.27065 • Published 10 days ago • 21
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 9 days ago • 29
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization Paper • 2603.28342 • Published 8 days ago • 25
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 12 days ago • 125
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search Paper • 2603.22341 • Published 17 days ago • 37
Effective Strategies for Asynchronous Software Engineering Agents Paper • 2603.21489 • Published 15 days ago • 6
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 21 days ago • 92
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation Paper • 2603.18886 • Published 19 days ago • 6
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Paper • 2603.18815 • Published 19 days ago • 14
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 19 days ago • 66
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 22 days ago • 153
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published 28 days ago • 52
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams Paper • 2603.07392 • Published about 1 month ago • 18