mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

History

lyblsgo d19c3a0399 modify include files		2023-04-23 12:51:58 +08:00
..
images	cpp onnxruntime	2023-03-03 22:07:58 +08:00
include	rename executable file;rm some unnecessary deps	2023-04-21 17:12:10 +08:00
src	modify include files	2023-04-23 12:51:58 +08:00
third_party	mv knf to thirdparty	2023-04-22 20:24:46 +08:00
wave	update files.	2023-03-07 12:13:40 +08:00
win	cpp onnxruntime	2023-03-03 22:07:58 +08:00
CMakeLists.txt	mv knf to thirdparty	2023-04-22 20:24:46 +08:00
CMakeSettings.json	update files.	2023-03-07 12:15:07 +08:00
readme.md	modify onnx readme	2023-04-21 18:51:50 +08:00

readme.md

ONNXRuntime-cpp

Export the model

Install modelscope and funasr

pip3 install torch torchaudio
pip install -U modelscope
pip install -U funasr

Export onnx model

python -m funasr.export.export_model --model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type onnx --quantize True

Building for Linux/Unix

Download onnxruntime

# download an appropriate onnxruntime from https://github.com/microsoft/onnxruntime/releases/tag/v1.14.0
# here we get a copy of onnxruntime for linux 64
wget https://github.com/microsoft/onnxruntime/releases/download/v1.14.0/onnxruntime-linux-x64-1.14.0.tgz
tar -zxvf onnxruntime-linux-x64-1.14.0.tgz

Install openblas

sudo apt-get install libopenblas-dev #ubuntu
# sudo yum -y install openblas-devel #centos

Build runtime

git clone https://github.com/alibaba-damo-academy/FunASR.git && cd funasr/runtime/onnxruntime
mkdir build && cd build
cmake  -DCMAKE_BUILD_TYPE=release .. -DONNXRUNTIME_DIR=/path/to/onnxruntime-linux-x64-1.14.0
make

Building for Windows

Ref to win/

Run the demo

funasr-onnx-offline /path/models_dir /path/wave_file quantize(true or false) use_vad(true or false)

The structure of /path/models_dir

config.yaml, am.mvn, model.onnx(or model_quant.onnx), (vad_model.onnx, vad.mvn if you use vad)

Acknowledge

This project is maintained by FunASR community.
We acknowledge mayong for contributing the onnxruntime(cpp api).
We borrowed a lot of code from FastASR for audio frontend and text-postprocess.