mirror of
https://github.com/modelscope/FunASR
synced 2025-09-15 14:48:36 +08:00
14 KiB
14 KiB
Pretrained Models on Huggingface
Model License
- Apache License 2.0
Model Zoo
Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.
Speech Recognition Models
Paraformer Models
| Model Name | Language | Training Data | Vocab Size | Parameter | Offline/Online | Notes |
|---|---|---|---|---|---|---|
| Paraformer-large | CN & EN | Alibaba Speech Data (60000hours) | 8404 | 220M | Offline | Duration of input wav <= 20s |
UniASR Models
Conformer Models
RNN-T Models
Multi-talker Speech Recognition Models
MFCCA Models
Voice Activity Detection Models
| Model Name | Training Data | Parameters | Sampling Rate | Notes |
|---|---|---|---|---|
| FSMN-VAD | Alibaba Speech Data (5000hours) | 0.4M | 16000 |
Punctuation Restoration Models
| Model Name | Training Data | Parameters | Vocab Size | Offline/Online | Notes |
|---|---|---|---|---|---|
| CT-Transformer | Alibaba Text Data | 70M | 272727 | Offline | offline punctuation model |