You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have tried the following code according to README.md.
fromfaster_whisperimportWhisperModelimporttimemodel_size="./faster-distil-whisper-large-v2"model=WhisperModel(model_size, device="cuda", compute_type="float16")
t1=time.perf_counter()
segments, info=model.transcribe(
"............./../0.mp3",
# beam_size=5,language="zh",
condition_on_previous_text=False,
)
print(time.perf_counter() -t1)
print(
"Detected language '%s' with probability %f"% (info.language, info.language_probability)
)
foriinsegments:
print(i.text)
output is :
0.06374595290981233
Detected language 'zh' with probability 1.000000
to me, so I want to say that I want to say,
if you're if you're to my their research to try to...
Audio is in Chinese (madarain), I couldn't figure out why it outputs in English. Any help will be appreciated.
The text was updated successfully, but these errors were encountered:
I have tried the following code according to README.md.
output is :
Audio is in Chinese (madarain), I couldn't figure out why it outputs in English. Any help will be appreciated.
The text was updated successfully, but these errors were encountered: