chfeng

university

https://github.com/cfeng16

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

russwang authored a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

russwang authored a paper 2 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

russwang authored a paper 3 months ago

Agent Learning via Early Experience

View all activity

russwang

authored a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

russwang

authored a paper 2 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 31

russwang

authored 2 papers 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28, 2025 • 47

russwang

authored 3 papers 4 months ago

What makes Reasoning Models Different? Follow the Reasoning Leader for Efficient Decoding

Paper • 2506.06998 • Published Jun 8, 2025 • 1

CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning

Paper • 2507.00045 • Published Jun 23, 2025 • 1

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84

chfeng

authored a paper 7 months ago

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published Jun 11, 2025 • 22

russwang

authored 2 papers 7 months ago

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published Jun 11, 2025 • 22

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5, 2025 • 34

chfeng

authored 2 papers 9 months ago

This&That: Language-Gesture Controlled Video Generation for Robot Planning

Paper • 2407.05530 • Published Jul 8, 2024 • 4

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10, 2025 • 20

russwang

authored a paper 9 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10, 2025 • 20

chfeng

authored a paper 12 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21, 2025 • 15

russwang

authored 2 papers about 1 year ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 6

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 11

russwang

authored a paper over 1 year ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

chfeng

authored 3 papers almost 2 years ago

AVA-AVD: Audio-Visual Speaker Diarization in the Wild

Paper • 2111.14448 • Published Nov 29, 2021

Self-Supervised Video Forensics by Audio-Visual Anomaly Detection

Paper • 2301.01767 • Published Jan 4, 2023

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

Paper • 2309.03118 • Published Sep 6, 2023 • 2

AI & ML interests

Recent Activity

Team members 2

chfeng123's activity