microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 346k • 1.57k
llava-hf/llava-onevision-qwen2-72b-ov-hf Image-Text-to-Text • 73B • Updated Jun 18, 2025 • 2.21k • 10
facebook/metaclip-h14-fullcc2.5b Zero-Shot Image Classification • 1.0B • Updated Jan 11, 2024 • 14.9k • 49