Fail to transcribe in Chinese #808

mru4913 · 2024-04-25T09:55:00Z

I have tried the following code according to README.md.

from faster_whisper import WhisperModel
import time

model_size = "./faster-distil-whisper-large-v2"

model = WhisperModel(model_size, device="cuda", compute_type="float16")

t1 = time.perf_counter()
segments, info = model.transcribe(
    "............./../0.mp3",
    # beam_size=5,
    language="zh",
    condition_on_previous_text=False,
)
print(time.perf_counter() - t1)
print(
    "Detected language '%s' with probability %f"
    % (info.language, info.language_probability)
)
for i in segments:
    print(i.text)

output is :

0.06374595290981233
Detected language 'zh' with probability 1.000000
 to me, so I want to say that I want to say,
 if you're if you're to my their research to try to...

Audio is in Chinese (madarain), I couldn't figure out why it outputs in English. Any help will be appreciated.

Purfview · 2024-04-26T10:25:56Z

Distil models are English only, you need to use a multilanguage model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail to transcribe in Chinese #808

Fail to transcribe in Chinese #808

mru4913 commented Apr 25, 2024 •

edited

Purfview commented Apr 26, 2024 •

edited

Fail to transcribe in Chinese #808

Fail to transcribe in Chinese #808

Comments

mru4913 commented Apr 25, 2024 • edited

Purfview commented Apr 26, 2024 • edited

mru4913 commented Apr 25, 2024 •

edited

Purfview commented Apr 26, 2024 •

edited