speech_recognition/FunASR

Fork 0

mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

游雁 298ddd13fb funasr2

2023-12-15 23:46:41 +08:00

14 KiB

Raw Blame History

Pretrained Models on Huggingface

Model License

Apache License 2.0

Model Zoo

Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.

Speech Recognition Models

Paraformer Models

Model Name	Language	Training Data	Vocab Size	Parameter	Offline/Online	Notes
Paraformer-large	CN & EN	Alibaba Speech Data (60000hours)	8404	220M	Offline	Duration of input wav <= 20s

UniASR Models

Conformer Models

RNN-T Models

Multi-talker Speech Recognition Models

MFCCA Models

Voice Activity Detection Models

Model Name	Training Data	Parameters	Sampling Rate	Notes
FSMN-VAD	Alibaba Speech Data (5000hours)	0.4M	16000

Punctuation Restoration Models

Model Name	Training Data	Parameters	Vocab Size	Offline/Online	Notes
CT-Transformer	Alibaba Text Data	70M	272727	Offline	offline punctuation model

14 KiB

Raw Blame History

Pretrained Models on Huggingface

Model License

Model Zoo

Speech Recognition Models

Paraformer Models

UniASR Models

Conformer Models

RNN-T Models

Multi-talker Speech Recognition Models

MFCCA Models

Voice Activity Detection Models

Punctuation Restoration Models

Language Models

Speaker Verification Models

Speaker diarization Models

Timestamp Prediction Models

14 KiB Raw Blame History

Pretrained Models on Huggingface

Model License

Model Zoo

Speech Recognition Models

Paraformer Models

UniASR Models

Conformer Models

RNN-T Models

Multi-talker Speech Recognition Models

MFCCA Models

Voice Activity Detection Models

Punctuation Restoration Models

Language Models

Speaker Verification Models

Speaker diarization Models

Timestamp Prediction Models

14 KiB

Raw Blame History