Problems with timestamps

by jalberth - opened Feb 21, 2025

Feb 21, 2025

I can't get return_timestamps=True to save the transcription with timestamps. I'm using the simple code from the model card and saving the res variable as JSON.

d93

Feb 21, 2025

•

edited Feb 21, 2025

pipe = pipeline(
"automatic-speech-recognition",
model=model,
tokenizer=processor.tokenizer,
feature_extractor=processor.feature_extractor,
torch_dtype=torch_dtype,
device=device,
return_timestamps=True,
)

I tried with a short audio.mp3, so I had to decrease chunk_length_s to 10 in the call to pipe to get it to work.

jalberth

Feb 23, 2025

Perfect, now I got it to work. I wrongly put the argument in the generate_kwargs.

thx for the clarification.

Lauler changed discussion status to closed Feb 25, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment