Rethinking Video Generation Model for the Embodied World Paper β’ 2601.15282 β’ Published 8 days ago β’ 42
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Paper β’ 2601.11141 β’ Published 13 days ago β’ 20
Running on Zero MCP 1.87k Z Image Turbo πΌ 1.87k Generate stunning AI images from text descriptions in seconds
Running on Zero Featured 1.23k Qwen Image Multiple Angles 3D Camera π₯ 1.23k Adjust camera angles in images using 3D controls or sliders
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper β’ 2512.04677 β’ Published Dec 4, 2025 β’ 170
Transition Matching Distillation for Fast Video Generation Paper β’ 2601.09881 β’ Published 14 days ago β’ 32
google/translategemma-4b-it Image-Text-to-Text β’ 5B β’ Updated about 15 hours ago β’ 76.6k β’ 567
google/translategemma-12b-it Image-Text-to-Text β’ 13B β’ Updated about 15 hours ago β’ 58.7k β’ 232
view article Article Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR 23 days ago β’ 71
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper β’ 2512.11799 β’ Published Dec 12, 2025 β’ 30