razhan/DOLMA-speech
Viewer • Updated • 41.7k • 137 • 1
How to use razhan/whisper-base-hac with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="razhan/whisper-base-hac") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("razhan/whisper-base-hac")
model = AutoModelForSpeechSeq2Seq.from_pretrained("razhan/whisper-base-hac")This model is a fine-tuned version of openai/whisper-base on the razhan/DOLMA-speech hawrami dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 1.9468 | 1.0 | 27 | 0.9999 | 0.9633 | 0.3797 |
| 0.6132 | 2.0 | 54 | 0.4681 | 0.6078 | 0.1469 |
| 0.3976 | 3.0 | 81 | 0.3668 | 0.5161 | 0.1128 |
| 0.3485 | 4.0 | 108 | 0.3360 | 0.4889 | 0.1065 |
| 0.3292 | 5.0 | 135 | 0.3272 | 0.4792 | 0.1039 |
Base model
openai/whisper-base