5 5 4

Martin Ziqiao Ma PRO

marstin

http://www.ziqiaoma.com/

AI & ML interests

https://huggingface.co/Seed42Lab

Recent Activity

authored a paper 5 days ago

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

authored a paper 5 days ago

Next-Embedding Prediction Makes Strong Vision Learners

upvoted a paper 8 days ago

Next-Embedding Prediction Makes Strong Vision Learners

View all activity

Organizations

authored 2 papers 5 days ago

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Paper • 2512.01078 • Published 27 days ago • 33

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 9 days ago • 79

upvoted a paper 8 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 9 days ago • 79

upvoted a paper 23 days ago

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Paper • 2512.01078 • Published 27 days ago • 33

authored 4 papers about 2 months ago

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Paper • 2508.08113 • Published Aug 11 • 11

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Paper • 2510.02292 • Published Oct 2 • 1

Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry

Paper • 2510.25595 • Published Oct 29

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3 • 31

upvoted a paper about 2 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3 • 31

liked a dataset 3 months ago

cheryyunl/ROVER

Viewer • Updated Nov 8 • 1.31k • 161 • 8

updated a Space 3 months ago

VLM-Lens

👀

[EMNLP 2025 Demo] VLM-Lens: Extracting VLM representations

published a Space 3 months ago

VLM-Lens

👀

[EMNLP 2025 Demo] VLM-Lens: Extracting VLM representations

New activity in sled-umich/InfEdit 4 months ago

License?

#2 opened about 2 years ago by

cian0

What

#3 opened about 2 years ago by

NoenD

updated a model 5 months ago

sled-umich/groundhog-7b

Updated Jul 22

updated 3 datasets 6 months ago

authored a paper 6 months ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Paper • 2506.21876 • Published Jun 27 • 28

upvoted a paper 6 months ago

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23 • 6

Martin Ziqiao Ma PRO

AI & ML interests

Recent Activity

Organizations

marstin's activity

VLM-Lens

VLM-Lens

License?

What