V$^{2}$-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence Paper • 2511.20886 • Published Nov 25
Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis Paper • 2511.20186 • Published Nov 25
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection Paper • 2504.04517 • Published Apr 6
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval Paper • 2405.10160 • Published May 16, 2024 • 1
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval Paper • 2310.08276 • Published Oct 12, 2023
Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection Paper • 2507.21816 • Published Jul 29
Semantic-Aware Ship Detection with Vision-Language Integration Paper • 2508.15930 • Published Aug 21
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Paper • 2408.09110 • Published Aug 17, 2024 • 2
EarthSynth: Generating Informative Earth Observation with Diffusion Models Paper • 2505.12108 • Published May 17 • 2