Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published 5 days ago • 38
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Paper • 2412.13871 • Published Dec 18, 2024 • 18