VidEoMT: Your ViT is Secretly Also a Video Segmentation Model
Paper
โข 2602.17807 โข Published
โข 6
None defined yet.
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting