ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 214k • 1.08k
openai/clip-vit-large-patch14 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 7.72M • 1.94k
HuggingFaceTB/SmolVLM2-2.2B-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 108k • 296
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.2M • 392
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 28 days ago • 214k • 1.56k
microsoft/Phi-3-vision-128k-instruct Text Generation • 4B • Updated 28 days ago • 28.5k • 969
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13, 2025 • 4.11k • • 2.06k
microsoft/table-transformer-structure-recognition-v1.1-all Object Detection • 28.8M • Updated Nov 18, 2023 • 363k • 78