Haoning Wu, Teo PRO

teowu

https://teowu.github.io

AI & ML interests

Lead of Q-Future: https://github.com/Q-Future. I love MLLMs/LMMs/LVLMs/(any names you call them). Part of two great MoE VLMs as core contributors: Kimi-VL & Aria. Living and Cooking in Singapore Now.

Recent Activity

new activity 3 days ago

moonshotai/Kimi-K2.5:多图输入的占位符或顺序？

upvoted a paper 5 days ago

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

authored a paper 6 days ago

Kimi K2.5: Visual Agentic Intelligence

View all activity

Organizations

upvoted a paper 5 days ago

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Paper • 2602.02537 • Published 12 days ago • 5

upvoted a paper 6 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 6 days ago • 208

upvoted a paper 7 months ago

Generative Frame Sampler for Long Video Understanding

Paper • 2503.09146 • Published Mar 12, 2025 • 1

upvoted a collection 8 months ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 13 days ago • 78

upvoted an article 8 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

upvoted 3 papers 8 months ago

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Paper • 2503.06885 • Published Mar 10, 2025 • 4

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29, 2025 • 38

upvoted a collection 10 months ago

Kimi-VL Thinking

Collection

3 items • Updated Apr 17, 2025 • 1

upvoted 3 papers 10 months ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10, 2025 • 43

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 137

upvoted 4 papers 12 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 157

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 213

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 20

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20, 2025 • 29

upvoted 4 papers about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 438

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 46

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 29

Haoning Wu, Teo PRO

AI & ML interests

Recent Activity

Organizations

teowu's activity

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation