mirror of
https://github.com/FunAudioLLM/SenseVoice.git
synced 2025-09-15 15:08:35 +08:00
update docs
This commit is contained in:
parent
950e7ac13b
commit
bbda25b472
@ -50,7 +50,7 @@ Model Zoo:
|
||||
We compared the performance of multilingual speech recognition between SenseVoice and Whisper on open-source benchmark datasets, including AISHELL-1, AISHELL-2, Wenetspeech, LibriSpeech, and Common Voice. n terms of Chinese and Cantonese recognition, the SenseVoice-Small model has advantages.
|
||||
|
||||
<div align="center">
|
||||
<img src="image/asr_results.png" width="1000" />
|
||||
<img src="image/asr_results1.png" width="500" /><img src="image/asr_results2.png" width="500" />
|
||||
</div>
|
||||
|
||||
## Speech Emotion Recognition
|
||||
|
||||
@ -50,7 +50,7 @@ SenseVoice是具有音频理解能力的音频基础模型,包括语音识别
|
||||
我们在开源基准数据集(包括 AISHELL-1、AISHELL-2、Wenetspeech、Librispeech和Common Voice)上比较了SenseVoice与Whisper的多语言语音识别性能和推理效率。在中文和粤语识别效果上,SenseVoice-Small模型具有明显的效果优势。
|
||||
|
||||
<div align="center">
|
||||
<img src="image/asr_results.png" width="1000" />
|
||||
<img src="image/asr_results1.png" width="500" /><img src="image/asr_results2.png" width="500" />
|
||||
</div>
|
||||
|
||||
## 情感识别
|
||||
|
||||
BIN
image/asr_results1.png
Normal file
BIN
image/asr_results1.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 80 KiB |
BIN
image/asr_results2.png
Normal file
BIN
image/asr_results2.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 74 KiB |
Loading…
Reference in New Issue
Block a user