RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated Oct 23, 2025 • 15.5k • 14
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16 Text Generation • 2B • Updated Oct 22, 2025 • 225 • 3
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic Text Generation • 9B • Updated Oct 14, 2025 • 1.49k • 2
RedHatAI/Voxtral-Mini-3B-2507-FP8-dynamic Automatic Speech Recognition • 5B • Updated Oct 13, 2025 • 310 • 9
RedHatAI/whisper-large-v3-turbo-quantized.w4a16 Automatic Speech Recognition • 0.2B • Updated Oct 13, 2025 • 224 • 6
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic Text Generation • 236B • Updated Oct 3, 2025 • 51 • 4
RedHatAI/Voxtral-Small-24B-2507-FP8-dynamic Automatic Speech Recognition • 24B • Updated Sep 26, 2025 • 20k
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 25, 2025 • 375 • 2
RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16 Text Generation • 11B • Updated Sep 23, 2025 • 18 • 1
RedHatAI/Qwen2.5-Coder-14B-Instruct-FP8-dynamic Text Generation • 15B • Updated Sep 23, 2025 • 93 • 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated Sep 22, 2025 • 20.3k • 30
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 22, 2025 • 5.84k • 19