mirror of
https://github.com/FunAudioLLM/SenseVoice.git
synced 2025-09-15 15:08:35 +08:00
sensevoice
This commit is contained in:
parent
3fe73f18a6
commit
e9b6e53f98
@ -39,10 +39,11 @@ Model Zoo:
|
||||
|
||||
<a name="What's News"></a>
|
||||
# What's News 🔥
|
||||
- 2024/7: The SenseVoice-Small voice understanding model is open-sourced, providing support for multilingual speech recognition, speech emotion recognition, and acoustic event detection capabilities in Mandarin, Cantonese, English, Japanese, and Korean.
|
||||
- 2024/7: The [SenseVoice-Small](https://www.modelscope.cn/models/iic/SenseVoiceSmall) voice understanding model is open-sourced, providing support for multilingual speech recognition, speech emotion recognition, and acoustic event detection capabilities in Mandarin, Cantonese, English, Japanese, and Korean.
|
||||
- 2024/7: The CosyVoice for natural speech generation with multi-language, timbre, and emotion control. [CosyVoice repo](https://https://github.com/FunAudioLLM/CosyVoice) and [CosyVoice space](https://www.modelscope.cn/studios/iic/CosyVoice-300M).
|
||||
- 2024/7: [FunASR](https://github.com/modelscope/FunASR) is a fundamental speech recognition toolkit that offers a variety of features, including speech recognition (ASR), Voice Activity Detection (VAD), Punctuation Restoration, Language Models, Speaker Verification, Speaker Diarization and multi-talker ASR.
|
||||
|
||||
<a name="Benchmarks"></a>
|
||||
- <a name="Benchmarks"></a>
|
||||
# Benchmarks 📝
|
||||
|
||||
## Multilingual Speech Recognition
|
||||
|
||||
@ -38,8 +38,9 @@ SenseVoice是具有音频理解能力的音频基础模型,包括语音识别
|
||||
|
||||
<a name="最新动态"></a>
|
||||
# 最新动态 🔥
|
||||
- 2024/7: SenseVoice-Small多语言音频理解模型开源,支持中、粤、英、日、韩语的多语言语音识别,情感识别和事件检测能力。
|
||||
- 2024/7: [SenseVoice-Small](https://www.modelscope.cn/models/iic/SenseVoiceSmall) 多语言音频理解模型开源,支持中、粤、英、日、韩语的多语言语音识别,情感识别和事件检测能力。
|
||||
- 2024/7: CosyVoice致力于自然语音生成,支持多语言、音色和情感控制。[CosyVoice repo](https://https://github.com/FunAudioLLM/CosyVoice) and [CosyVoice 在线体验](https://www.modelscope.cn/studios/iic/CosyVoice-300M).
|
||||
- 2024/7: [FunASR](https://github.com/modelscope/FunASR) 是一个基础语音识别工具包,提供多种功能,包括语音识别(ASR)、语音端点检测(VAD)、标点恢复、语言模型、说话人验证、说话人分离和多人对话语音识别等。
|
||||
|
||||
<a name="Benchmarks"></a>
|
||||
# Benchmarks 📝
|
||||
|
||||
Loading…
Reference in New Issue
Block a user