AI 模型库

共 个模型
下载量 收藏数 最新 automatic-speech-recognition ×

whisperkit-coreml

automatic-speech-recognition
argmaxinc · argmaxinc/whisperkit-coreml

--- pretty_name: "WhisperKit" viewer: false library_name: whisperkit tags: - whisper - whisperkit - coreml - asr - quantized - automatic-speech-recognition --- WhisperKit

10,910,125 388

speaker-diarization-3.1

automatic-speech-recognition
pyannote · pyannote/speaker-diarization-3.1

自动语音识别

10,249,640 2057

whisper-large-v3-turbo

automatic-speech-recognition
openai · openai/whisper-large-v3-turbo

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et

6,876,575 3002

whisper-large-v3

automatic-speech-recognition
openai · openai/whisper-large-v3

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et

4,998,671 5669

wav2vec2-large-xlsr-53-russian

automatic-speech-recognition
jonatasgrosman · jonatasgrosman/wav2vec2-large-xlsr-53-russian

Fine-tuned XLSR-53 large model for speech recognition in Russian

4,152,128 74

voice-activity-detection

automatic-speech-recognition
pyannote · pyannote/voice-activity-detection

自动语音识别

3,518,729 233

mms-300m-1130-forced-aligner

automatic-speech-recognition
MahmoudAshraf · MahmoudAshraf/mms-300m-1130-forced-aligner

Forced Alignment with Hugging Face CTC Models This Python package provides an efficient way to perform forced alignment between text and audio using Hugging Face's pretrained models. it also features

3,477,232 87

wav2vec2-large-xlsr-53-portuguese

automatic-speech-recognition
jonatasgrosman · jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

Fine-tuned XLSR-53 large model for speech recognition in Portuguese

3,458,442 54

speaker-diarization-community-1

automatic-speech-recognition
pyannote · pyannote/speaker-diarization-community-1

自动语音识别

2,856,155 361

whisper-small

automatic-speech-recognition
openai · openai/whisper-small

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many

2,293,475 556

Qwen3-ASR-1.7B

automatic-speech-recognition
Qwen · Qwen/Qwen3-ASR-1.7B

The Qwen3-ASR family includes Qwen3-ASR-1.7B and Qwen3-ASR-0.6B, which support language identification and ASR for 52 languages and dialects. Both leverage large-scale speech training data and the str

2,021,550 793