Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bezzam
's Collections
VibeVoice
Neural codecs
Omnilingual ASR (1,600+ Languages)
Multimodel audio
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
Multimodel audio
updated
Dec 8, 2025
Upvote
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition
•
2B
•
Updated
Jan 4, 2024
•
73.7k
•
962
stepfun-ai/Step-Audio-2-mini
Any-to-Any
•
Updated
24 days ago
•
1.59k
•
253
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
Jul 28, 2025
•
350k
•
660
Upvote
-
Share collection
View history
Collection guide
Browse collections