Multilingual DistilWhisper Collection Multilingual Distilwhisper allows for better ASR performance in target languages by adding lightweight CLSR modules on top of whisper-small. • 3 items • Updated Mar 18, 2024 • 6
Spire Collection Extending Tower to the speech modality. Spire models are multimodal LLMs capable of transcribing and translating English into 9 different languages. • 5 items • Updated Sep 30, 2025 • 3
From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM Paper • 2503.10620 • Published Mar 13, 2025 • 7
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 244
view article Article DEMO: French Spoken Language Understanding with the new speech resources from NAVER LABS Europe Aug 28, 2024 • 10
DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Paper • 2311.01070 • Published Nov 2, 2023 • 3
mHuBERT-147 models Collection Compact yet powerful multilingual speech representation models based on the HuBERT architecture. • 3 items • Updated Jun 4, 2024 • 8