Long Video Benchmark

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sy1998 authored a paper 5 days ago

Video-BrowseComp: Benchmarking Agentic Video Research on Open Web

yzwang authored a paper 25 days ago

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

yzwang authored a paper 25 days ago

AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video Generation

View all activity

sy1998

authored a paper 5 days ago

Video-BrowseComp: Benchmarking Agentic Video Research on Open Web

Paper • 2512.23044 • Published 8 days ago • 9

yzwang

authored 2 papers 25 days ago

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

Paper • 2512.08294 • Published 28 days ago • 17

AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided Image-to-Video Generation

Paper • 2512.01334 • Published Dec 1, 2025

JUNJIE99

authored a paper about 2 months ago

MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval

Paper • 2509.26378 • Published Sep 30, 2025

yzwang

authored a paper 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

Shitao

authored a paper 7 months ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78

yzwang

authored a paper 7 months ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78

JUNJIE99

authored 4 papers 7 months ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Paper • 2502.12558 • Published Feb 18, 2025

Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval

Paper • 2502.11431 • Published Feb 17, 2025

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Paper • 2506.10821 • Published Jun 12, 2025 • 19

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78

sy1998

authored a paper 7 months ago

EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models

Paper • 2506.01667 • Published Jun 2, 2025 • 21

yzwang

authored a paper 7 months ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Paper • 2502.12558 • Published Feb 18, 2025

sy1998

authored 4 papers 8 months ago

Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding

Paper • 2503.18478 • Published Mar 24, 2025 • 1

yzwang

authored a paper 11 months ago

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Paper • 2502.06788 • Published Feb 10, 2025 • 13

yzwang

authored a paper about 1 year ago

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Paper • 2406.10638 • Published Jun 15, 2024

Shitao

authored a paper about 1 year ago

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

AI & ML interests

Recent Activity

Team members 4

LVBench's activity