diff --git a/README.md b/README.md index cb6094c0a..487cc5ed2 100644 --- a/README.md +++ b/README.md @@ -29,6 +29,7 @@ ## What's new: +- 2024/09/26: Offline File Transcription Service 4.6, Offline File Transcription Service of English 1.7,Real-time Transcription Service 1.11 released,fix memory leak & Support the SensevoiceSmall onnx model;File Transcription Service 2.0 GPU released, Fix GPU memory leak; ([docs](runtime/readme.md)); - 2024/09/25:keyword spotting models are new supported. Supports fine-tuning and inference for four models: [fsmn_kws](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online), [fsmn_kws_mt](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online), [sanm_kws](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-offline), [sanm_kws_streaming](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online). - 2024/07/04:[SenseVoice](https://github.com/FunAudioLLM/SenseVoice) is a speech foundation model with multiple speech understanding capabilities, including ASR, LID, SER, and AED. - 2024/07/01: Offline File Transcription Service GPU 1.1 released, optimize BladeDISC model compatibility issues; ref to ([docs](runtime/readme.md)) diff --git a/README_zh.md b/README_zh.md index fc7584100..1f0eaa7ac 100644 --- a/README_zh.md +++ b/README_zh.md @@ -33,6 +33,7 @@ FunASR希望在语音识别的学术研究和工业应用之间架起一座桥 ## 最新动态 +- 2024/09/26: 中文离线文件转写服务 4.6、英文离线文件转写服务 1.7、中文实时语音听写服务 1.11 发布,修复ONNX内存泄漏、支持SensevoiceSmall onnx模型;中文离线文件转写服务GPU 2.0 发布,修复显存泄漏; 详细信息参阅([部署文档](runtime/readme_cn.md)) - 2024/09/25:新增语音唤醒模型,支持[fsmn_kws](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online), [fsmn_kws_mt](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online), [sanm_kws](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-offline), [sanm_kws_streaming](https://modelscope.cn/models/iic/speech_sanm_kws_phone-xiaoyun-commands-online) 4个模型的微调和推理。 - 2024/07/04:[SenseVoice](https://github.com/FunAudioLLM/SenseVoice) 是一个基础语音理解模型,具备多种语音理解能力,涵盖了自动语音识别(ASR)、语言识别(LID)、情感识别(SER)以及音频事件检测(AED)。 - 2024/07/01:中文离线文件转写服务GPU版本 1.1发布,优化bladedisc模型兼容性问题;详细信息参阅([部署文档](runtime/readme_cn.md)) diff --git a/runtime/docs/SDK_advanced_guide_offline.md b/runtime/docs/SDK_advanced_guide_offline.md index 799727f8f..f75d89916 100644 --- a/runtime/docs/SDK_advanced_guide_offline.md +++ b/runtime/docs/SDK_advanced_guide_offline.md @@ -12,6 +12,7 @@ This document serves as a development guide for the FunASR offline file transcri | TIME | INFO | IMAGE VERSION | IMAGE ID | |------------|----------------------------------------------------------------------------------------------------------------------------------|------------------------------|--------------| +| 2024.09.26 | Fix memory leak, Support the SensevoiceSmall onnx model | funasr-runtime-sdk-cpu-0.4.6 | 8651c6b8a1ae | | 2024.05.15 | Adapting to FunASR 1.0 model structure | funasr-runtime-sdk-cpu-0.4.5 | 058b9882ae67 | | 2024.03.05 | docker image supports ARM64 platform, update modelscope | funasr-runtime-sdk-cpu-0.4.4 | 2dc87b86dc49 | | 2024.01.25 | Optimized the VAD (Voice Activity Detection) data processing method, significantly reducing peak memory usage; memory leak optimization| funasr-runtime-sdk-cpu-0.4.2 | befdc7b179ed | @@ -35,9 +36,9 @@ If you do not have Docker installed, please refer to [Docker Installation](https ### Pulling and launching images Use the following command to pull and launch the Docker image for the FunASR runtime-SDK: ```shell -sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 +sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 -sudo docker run -p 10095:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 +sudo docker run -p 10095:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 ``` Introduction to command parameters: diff --git a/runtime/docs/SDK_advanced_guide_offline_en.md b/runtime/docs/SDK_advanced_guide_offline_en.md index 4f61416a3..3fef92888 100644 --- a/runtime/docs/SDK_advanced_guide_offline_en.md +++ b/runtime/docs/SDK_advanced_guide_offline_en.md @@ -6,6 +6,7 @@ This document serves as a development guide for the FunASR offline file transcri | TIME | INFO | IMAGE VERSION | IMAGE ID | |------------|-----------------------------------------|---------------------------------|--------------| +| 2024.09.26 | Fix memory leak | funasr-runtime-sdk-en-cpu-0.1.7 | f6c5a7b59eb6 | | 2024.05.15 | Adapting to FunASR 1.0 model structure | funasr-runtime-sdk-en-cpu-0.1.6 | 84d781d07997 | | 2024.03.05 | docker image supports ARM64 platform, update modelscope | funasr-runtime-sdk-en-cpu-0.1.5 | 7cca2abc5901 | | 2024.01.25 | Optimized the VAD (Voice Activity Detection) data processing method, significantly reducing peak memory usage; memory leak optimization| funasr-runtime-sdk-en-cpu-0.1.3 | c00f9ce7a195 | @@ -25,9 +26,9 @@ If you do not have Docker installed, please refer to [Docker Installation](https ### Pulling and launching images Use the following command to pull and launch the Docker image for the FunASR runtime-SDK: ```shell -sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.6 +sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.7 -sudo docker run -p 10097:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.6 +sudo docker run -p 10097:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.7 ``` Introduction to command parameters: ```text diff --git a/runtime/docs/SDK_advanced_guide_offline_en_zh.md b/runtime/docs/SDK_advanced_guide_offline_en_zh.md index 3d6534312..81f38875e 100644 --- a/runtime/docs/SDK_advanced_guide_offline_en_zh.md +++ b/runtime/docs/SDK_advanced_guide_offline_en_zh.md @@ -6,6 +6,7 @@ FunASR提供可一键本地或者云端服务器部署的英文离线文件转 | 时间 | 详情 | 镜像版本 | 镜像ID | |------------|---------------|---------------------------------|--------------| +| 2024.09.26 | 修复内存泄漏 | funasr-runtime-sdk-en-cpu-0.1.7 | f6c5a7b59eb6 | | 2024.05.15 | 适配FunASR 1.0模型结构 | funasr-runtime-sdk-en-cpu-0.1.6 | 84d781d07997 | | 2024.03.05 | docker镜像支持arm64平台,升级modelscope版本 | funasr-runtime-sdk-en-cpu-0.1.5 | 7cca2abc5901 | | 2024.01.25 | 优化vad数据处理方式,大幅降低峰值内存占用;内存泄漏优化 | funasr-runtime-sdk-en-cpu-0.1.3 | c00f9ce7a195 | @@ -40,11 +41,11 @@ docker安装失败请参考 [Docker Installation](https://alibaba-damo-academy.g 通过下述命令拉取并启动FunASR runtime-SDK的docker镜像: ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.6 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.7 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10097:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.6 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-en-cpu-0.1.7 ``` ### 服务端启动 diff --git a/runtime/docs/SDK_advanced_guide_offline_gpu.md b/runtime/docs/SDK_advanced_guide_offline_gpu.md index 6e7d00130..bb35cd6aa 100644 --- a/runtime/docs/SDK_advanced_guide_offline_gpu.md +++ b/runtime/docs/SDK_advanced_guide_offline_gpu.md @@ -12,6 +12,7 @@ This document serves as a development guide for the FunASR offline file transcri | TIME | INFO | IMAGE VERSION | IMAGE ID | |------------|----------------------------------------------------------------------------------------------------------------------------------|------------------------------|--------------| +| 2024.09.26 | Fix GPU memory leak | funasr-runtime-sdk-gpu-0.2.0 | d280bf7e495b | | 2024.07.01 | Optimize BladeDISC model compatibility issues | funasr-runtime-sdk-gpu-0.1.1 | 8875cbf9b99e | | 2024.06.27 | Offline File Transcription Software Package(GPU) 1.0 released | funasr-runtime-sdk-gpu-0.1.0 | b86066f4d018 | @@ -28,9 +29,9 @@ If you do not have Docker installed, please refer to [Docker Installation](https ### Pulling and launching images Use the following command to pull and launch the Docker image for the FunASR runtime-SDK: ```shell -sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.1.1 +sudo docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.2.0 -sudo docker run --gpus=all -p 10098:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.1.1 +sudo docker run --gpus=all -p 10098:10095 -it --privileged=true -v /root:/workspace/models registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.2.0 ``` Introduction to command parameters: diff --git a/runtime/docs/SDK_advanced_guide_offline_gpu_zh.md b/runtime/docs/SDK_advanced_guide_offline_gpu_zh.md index 86c532df6..a2706066e 100644 --- a/runtime/docs/SDK_advanced_guide_offline_gpu_zh.md +++ b/runtime/docs/SDK_advanced_guide_offline_gpu_zh.md @@ -10,6 +10,7 @@ FunASR离线文件转写GPU软件包,提供了一款功能强大的语音离 | 时间 | 详情 | 镜像版本 | 镜像ID | |------------|---------------------------------------------------|------------------------------|--------------| +| 2024.09.26 | 修复显存泄漏 | funasr-runtime-sdk-gpu-0.2.0 | d280bf7e495b | | 2024.07.01 | 优化bladedisc模型兼容性问题 | funasr-runtime-sdk-gpu-0.1.1 | 8875cbf9b99e | | 2024.06.27 | 离线文件转写服务GPU版本1.0 发布 | funasr-runtime-sdk-gpu-0.1.0 | b86066f4d018 | @@ -40,11 +41,11 @@ docker安装失败请参考 [Docker Installation](https://alibaba-damo-academy.g ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.1.1 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.2.0 mkdir -p ./funasr-runtime-resources/models sudo docker run --gpus=all -p 10098:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.1.1 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.2.0 ``` ### 服务端启动 diff --git a/runtime/docs/SDK_advanced_guide_offline_zh.md b/runtime/docs/SDK_advanced_guide_offline_zh.md index 3fd822d03..546943bf1 100644 --- a/runtime/docs/SDK_advanced_guide_offline_zh.md +++ b/runtime/docs/SDK_advanced_guide_offline_zh.md @@ -10,6 +10,7 @@ FunASR离线文件转写软件包,提供了一款功能强大的语音离线 | 时间 | 详情 | 镜像版本 | 镜像ID | |------------|---------------------------------------------------|------------------------------|--------------| +| 2024.09.26 | 修复内存泄漏、支持SensevoiceSmall onnx模型 | funasr-runtime-sdk-cpu-0.4.6 | 8651c6b8a1ae | | 2024.05.15 | 适配FunASR 1.0模型结构 | funasr-runtime-sdk-cpu-0.4.5 | 058b9882ae67 | | 2024.03.05 | docker镜像支持arm64平台,升级modelscope版本 | funasr-runtime-sdk-cpu-0.4.4 | 2dc87b86dc49 | | 2024.01.25 | 优化vad数据处理方式,大幅降低峰值内存占用;内存泄漏优化| funasr-runtime-sdk-cpu-0.4.2 | befdc7b179ed | @@ -49,11 +50,11 @@ docker安装失败请参考 [Docker Installation](https://alibaba-damo-academy.g ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10095:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 ``` ### 服务端启动 @@ -71,11 +72,13 @@ nohup bash run_server.sh \ --hotword /workspace/models/hotwords.txt > log.txt 2>&1 & # 如果您想关闭ssl,增加参数:--certfile 0 -# 如果您想使用时间戳或者nn热词模型进行部署,请设置--model-dir为对应模型: +# 如果您想使用SenseVoiceSmall模型、时间戳、nn热词模型进行部署,请设置--model-dir为对应模型: +# iic/SenseVoiceSmall-onnx # damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-onnx(时间戳) # damo/speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx(nn热词) # 如果您想在服务端加载热词,请在宿主机文件./funasr-runtime-resources/models/hotwords.txt配置热词(docker映射地址为/workspace/models/hotwords.txt): # 每行一个热词,格式(热词 权重):阿里巴巴 20(注:热词理论上无限制,但为了兼顾性能和效果,建议热词长度不超过10,个数不超过1k,权重1~100) +# SenseVoiceSmall-onnx识别结果中“<|zh|><|NEUTRAL|><|Speech|> ”分别为对应的语种、情感、事件信息 ``` 如果您想定制ngram,参考文档([如何训练LM](./lm_train_tutorial.md)) diff --git a/runtime/docs/SDK_advanced_guide_online.md b/runtime/docs/SDK_advanced_guide_online.md index be9e5e8e8..e9a959279 100644 --- a/runtime/docs/SDK_advanced_guide_online.md +++ b/runtime/docs/SDK_advanced_guide_online.md @@ -8,6 +8,7 @@ FunASR Real-time Speech Recognition Software Package integrates real-time versio | TIME | INFO | IMAGE VERSION | IMAGE ID | |------------|-------------------------------------------------------------------------------------|-------------------------------------|--------------| +| 2024.09.26 | Fix memory leak | funasr-runtime-sdk-online-cpu-0.1.11 | e51a36c42771 | | 2024.05.15 | Adapting to FunASR 1.0 model structure | funasr-runtime-sdk-online-cpu-0.1.10 | 1c2adfcff84d | | 2024.03.05 | docker image supports ARM64 platform, update modelscope | funasr-runtime-sdk-online-cpu-0.1.9 | 4a875e08c7a2 | | 2024.01.25 | Optimization of the client-side | funasr-runtime-sdk-online-cpu-0.1.7 | 2aa23805572e | diff --git a/runtime/docs/SDK_advanced_guide_online_zh.md b/runtime/docs/SDK_advanced_guide_online_zh.md index 26ca4bcb3..8c17caaeb 100644 --- a/runtime/docs/SDK_advanced_guide_online_zh.md +++ b/runtime/docs/SDK_advanced_guide_online_zh.md @@ -12,6 +12,7 @@ FunASR实时语音听写软件包,集成了实时版本的语音端点检测 | 时间 | 详情 | 镜像版本 | 镜像ID | |:-----------|:----------------------------------|--------------------------------------|--------------| +| 2024.09.26 | 修复内存泄漏 | funasr-runtime-sdk-online-cpu-0.1.11 | e51a36c42771 | | 2024.05.15 | 适配FunASR 1.0模型结构 | funasr-runtime-sdk-online-cpu-0.1.10 | 1c2adfcff84d | | 2024.03.05 | docker镜像支持arm64平台,升级modelscope版本 | funasr-runtime-sdk-online-cpu-0.1.9 | 4a875e08c7a2 | | 2024.01.25 | 客户端优化| funasr-runtime-sdk-online-cpu-0.1.7 | 2aa23805572e | @@ -39,11 +40,11 @@ docker安装失败请参考 [Docker Installation](https://alibaba-damo-academy.g ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10096:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 ``` ### 服务端启动 diff --git a/runtime/quick_start.md b/runtime/quick_start.md index d0f7c0e5c..28344786f 100644 --- a/runtime/quick_start.md +++ b/runtime/quick_start.md @@ -47,11 +47,11 @@ Use the following command to pull and launch the FunASR software package Docker ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10096:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 ``` ###### Server Start @@ -93,11 +93,11 @@ Use the following command to pull and launch the FunASR software package Docker ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10095:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 ``` ###### Server Start diff --git a/runtime/quick_start_zh.md b/runtime/quick_start_zh.md index 556602082..7ec9c585e 100644 --- a/runtime/quick_start_zh.md +++ b/runtime/quick_start_zh.md @@ -48,11 +48,11 @@ sudo bash install_docker.sh ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10096:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.10 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-online-cpu-0.1.11 ``` ###### 服务端启动 @@ -92,11 +92,11 @@ python3 funasr_wss_client.py --host "127.0.0.1" --port 10096 --mode 2pass ```shell sudo docker pull \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 mkdir -p ./funasr-runtime-resources/models sudo docker run -p 10095:10095 -it --privileged=true \ -v $PWD/funasr-runtime-resources/models:/workspace/models \ - registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.5 + registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-cpu-0.4.6 ``` ###### 服务端启动 diff --git a/runtime/readme.md b/runtime/readme.md index 5466ede42..960ebe393 100644 --- a/runtime/readme.md +++ b/runtime/readme.md @@ -17,6 +17,7 @@ Currently, the FunASR runtime-SDK supports the deployment of file transcription To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers. ### Whats-new +- 2024/09/26: File Transcription Service 2.0 GPU released, Fix GPU memory leak, docker image version funasr-runtime-sdk-gpu-0.2.0 (d280bf7e495b) - 2024/07/01: File Transcription Service 1.1 GPU released, optimize BladeDISC model compatibility issues, docker image version funasr-runtime-sdk-gpu-0.1.1 (8875cbf9b99e) - 2024/06/27: File Transcription Service 1.0 GPU released, supporting dynamic batch processing and multi-threading concurrency. In the long audio test set, the single-thread RTF is 0.0076, and multi-threads' speedup is 1200+ (compared to 330+ on CPU), ref to([docs](./docs/benchmark_libtorch_cpp.md)) , docker image version funasr-runtime-sdk-gpu-0.1.0 (b86066f4d018) @@ -32,6 +33,7 @@ Currently, the FunASR runtime-SDK supports the deployment of file transcription To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers. ### Whats-new +- 2024/09/26: Fix memory leak, docker image version funasr-runtime-sdk-en-cpu-0.1.7 (f6c5a7b59eb6). - 2024/05/15: Adapting to FunASR 1.0 model structure, docker image version funasr-runtime-sdk-en-cpu-0.1.6 (84d781d07997). - 2024/03/05: docker image supports ARM64 platform, update modelscope, docker image version funasr-runtime-sdk-en-cpu-0.1.5 (7cca2abc5901). - 2024/01/25: Optimized the VAD (Voice Activity Detection) data processing method,significantly reducing peak memory usage,memory leak optimization, docker image version funasr-runtime-sdk-en-cpu-0.1.3 (c00f9ce7a195). @@ -58,6 +60,7 @@ The FunASR real-time speech-to-text service software package not only performs r In order to meet the needs of different users for different scenarios, different tutorials are prepared: ### Whats-new +- 2024/09/26: Real-time Transcription Service 1.11 released,Fix memory leak, docker image version funasr-runtime-sdk-online-cpu-0.1.11 (e51a36c42771) - 2024/05/15: Real-time Transcription Service 1.10 released,adapting to FunASR 1.0 model structure, docker image version funasr-runtime-sdk-online-cpu-0.1.10 (1c2adfcff84d) - 2024/03/05: Real-time Transcription Service 1.9 released,docker image supports ARM64 platform, update modelscope, docker image version funasr-runtime-sdk-online-cpu-0.1.9 (4a875e08c7a2) - 2024/01/25: Real-time Transcription Service 1.7 released,optimization of the client-side, docker image version funasr-runtime-sdk-online-cpu-0.1.7 (2aa23805572e) @@ -89,6 +92,7 @@ Currently, the FunASR runtime-SDK supports the deployment of file transcription To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers. ### Whats-new +- 2024/09/26: File Transcription Service 4.6 released, Fix memory leak & Support the SensevoiceSmall onnx model, docker image version funasr-runtime-sdk-cpu-0.4.6 (8651c6b8a1ae) - 2024/05/15: File Transcription Service 4.5 released, adapting to FunASR 1.0 model structure, docker image version funasr-runtime-sdk-cpu-0.4.5 (058b9882ae67) - 2024/03/05: File Transcription Service 4.4 released, docker image supports ARM64 platform, update modelscope, docker image version funasr-runtime-sdk-cpu-0.4.4 (2dc87b86dc49) - 2024/01/25: File Transcription Service 4.2 released, optimized the VAD (Voice Activity Detection) data processing method, significantly reducing peak memory usage, memory leak optimization, docker image version funasr-runtime-sdk-cpu-0.4.2 (befdc7b179ed) diff --git a/runtime/readme_cn.md b/runtime/readme_cn.md index 3da312045..17c9a1a99 100644 --- a/runtime/readme_cn.md +++ b/runtime/readme_cn.md @@ -19,6 +19,7 @@ FunASR是由阿里巴巴通义实验室语音团队开源的一款语音识别 为了支持不同用户的需求,针对不同场景,准备了不同的图文教程: ### 最新动态 +- 2024/09/26: 中文离线文件转写服务GPU 2.0 发布,修复显存泄漏,docker镜像版本funasr-runtime-sdk-gpu-0.2.0 (d280bf7e495b) - 2024/07/01: 中文离线文件转写服务GPU 1.1 发布,优化bladedisc模型兼容性问题,docker镜像版本funasr-runtime-sdk-gpu-0.1.1 (8875cbf9b99e) - 2024/06/27: 中文离线文件转写服务GPU 1.0 发布,支持动态batch,支持多路并发,在长音频测试集上单线RTF为0.0076,多线加速比为1200+(CPU为330+),详见([文档](./docs/benchmark_libtorch_cpp.md)),docker镜像版本funasr-runtime-sdk-gpu-0.1.0 (b86066f4d018) @@ -33,6 +34,7 @@ FunASR是由阿里巴巴通义实验室语音团队开源的一款语音识别 为了支持不同用户的需求,针对不同场景,准备了不同的图文教程: ### 最新动态 +- 2024/09/26: 英文离线文件转写服务 1.7 发布,修复内存泄漏,docker镜像版本funasr-runtime-sdk-en-cpu-0.1.7 (f6c5a7b59eb6) - 2024/05/15: 英文离线文件转写服务 1.6 发布,适配FunASR 1.0模型结构,docker镜像版本funasr-runtime-sdk-en-cpu-0.1.6 (84d781d07997) - 2024/03/05: 英文离线文件转写服务 1.5 发布,docker镜像支持arm64平台,升级modelscope版本,docker镜像版本funasr-runtime-sdk-en-cpu-0.1.5 (7cca2abc5901) - 2024/01/25: 英文离线文件转写服务 1.3 发布,优化vad数据处理方式,大幅降低峰值内存占用,内存泄漏优化,docker镜像版本funasr-runtime-sdk-en-cpu-0.1.3 (c00f9ce7a195) @@ -51,6 +53,7 @@ FunASR实时语音听写服务软件包,既可以实时地进行语音转文 为了支持不同用户的需求,针对不同场景,准备了不同的图文教程: ### 最新动态 +- 2024/09/26: 中文实时语音听写服务 1.11 发布,修复内存泄漏,docker镜像版本funasr-runtime-sdk-online-cpu-0.1.11 (e51a36c42771) - 2024/05/15: 中文实时语音听写服务 1.10 发布,适配FunASR 1.0模型结构,docker镜像版本funasr-runtime-sdk-online-cpu-0.1.10 (1c2adfcff84d) - 2024/03/05: 中文实时语音听写服务 1.9 发布,docker镜像支持arm64平台,升级modelscope版本,docker镜像版本funasr-runtime-sdk-online-cpu-0.1.9 (4a875e08c7a2) - 2024/01/25: 中文实时语音听写服务 1.7 发布,客户端优化,docker镜像版本funasr-runtime-sdk-online-cpu-0.1.7 (2aa23805572e) @@ -74,6 +77,7 @@ FunASR实时语音听写服务软件包,既可以实时地进行语音转文 为了支持不同用户的需求,针对不同场景,准备了不同的图文教程: ### 最新动态 +- 2024/09/26: 中文离线文件转写服务 4.6 发布,修复内存泄漏、支持SensevoiceSmall onnx模型,docker镜像版本funasr-runtime-sdk-cpu-0.4.6 (8651c6b8a1ae) - 2024/05/15: 中文离线文件转写服务 4.5 发布,适配FunASR 1.0模型结构,docker镜像版本funasr-runtime-sdk-cpu-0.4.5 (058b9882ae67) - 2024/03/05: 中文离线文件转写服务 4.4 发布,docker镜像支持arm64平台,升级modelscope版本,docker镜像版本funasr-runtime-sdk-cpu-0.4.4 (2dc87b86dc49) - 2024/01/25: 中文离线文件转写服务 4.2 发布,优化vad数据处理方式,大幅降低峰值内存占用,内存泄漏优化,docker镜像版本funasr-runtime-sdk-cpu-0.4.2 (befdc7b179ed)