Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR Paper • 2603.26246 • Published 6 days ago • 1 • 2
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published 2 days ago • 39 • 2
Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal Paper • 2603.28224 • Published 3 days ago • 3 • 2
AutoWeather4D: Autonomous Driving Video Weather Conversion via G-Buffer Dual-Pass Editing Paper • 2603.26546 • Published 6 days ago • 5 • 2
BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation Paper • 2603.25732 • Published 7 days ago • 10 • 2
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 6 days ago • 45 • 2
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Paper • 2603.29557 • Published 2 days ago • 12 • 2
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 3 days ago • 233 • 4
MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model Paper • 2603.26357 • Published 6 days ago • 1 • 2
TrajectoryMover: Generative Movement of Object Trajectories in Videos Paper • 2603.29092 • Published 3 days ago • 1 • 2
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 3 days ago • 69 • 3
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development Paper • 2603.27460 • Published 5 days ago • 51 • 3
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published 3 days ago • 6 • 2
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal Paper • 2603.22794 • Published 9 days ago • 1 • 2
PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models Paper • 2603.28763 • Published 3 days ago • 4 • 2
MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models Paper • 2603.28590 • Published 3 days ago • 17 • 2