Malasar ASR Resources
Collection
7 items • Updated • 1
How to use vrclc/Malasar_small_DTF with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="vrclc/Malasar_small_DTF") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("vrclc/Malasar_small_DTF")
model = AutoModelForMultimodalLM.from_pretrained("vrclc/Malasar_small_DTF")This model is a fine-tuned version of openai/whisper-small on the Spoken Bible Corpus: Malasar dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.1612 | 11.3636 | 250 | 0.3185 | 64.0206 |
| 0.0114 | 22.7273 | 500 | 0.4682 | 65.7339 |
| 0.0016 | 34.0909 | 750 | 0.5380 | 59.9657 |
| 0.0004 | 45.4545 | 1000 | 0.5761 | 59.8515 |
| 0.0003 | 56.8182 | 1250 | 0.5969 | 59.6802 |
| 0.0002 | 68.1818 | 1500 | 0.6104 | 60.3655 |
| 0.0002 | 79.5455 | 1750 | 0.6181 | 60.1942 |
| 0.0002 | 90.9091 | 2000 | 0.6211 | 60.1371 |
@misc{multistage2024,
title={Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages},
author={Leena G Pillai, Kavya Manohar, Basil K Raju, Elizabeth Sherly},
year={2024},
eprint={2411.04573},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2411.04573},
}
Base model
openai/whisper-small