TIPSv2 Collection TIPSv2 foundational vision-language models. Webpage: https://gdm-tipsv2.github.io/ • 9 items • Updated 14 days ago • 25
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 27 days ago • 880
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published Dec 2, 2024 • 89
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 207
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 113
view article Article ChatGPT-4o's Image Generation Capabilities and Its Wild Examples Apr 5, 2025 • 22
Canary ASR/AST Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 8 days ago • 34
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 306
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Paper • 2403.05530 • Published Mar 8, 2024 • 64
Exploring Large Language Models' Cognitive Moral Development through Defining Issues Test Paper • 2309.13356 • Published Sep 23, 2023 • 38