mirror of
https://github.com/modelscope/FunASR
synced 2025-09-15 14:48:36 +08:00
* add cmakelist * add paraformer-torch * add debug for funasr-onnx-offline * fix redefinition of jieba StdExtension.hpp * add loading torch models * update funasr-onnx-offline * add SwitchArg for wss-server * add SwitchArg for funasr-onnx-offline * update cmakelist * update funasr-onnx-offline-rtf * add define condition * add gpu define for offlne-stream * update com define * update offline-stream * update cmakelist * update func CompileHotwordEmbedding * add timestamp for paraformer-torch * add C10_USE_GLOG for paraformer-torch * update paraformer-torch * fix func FunASRWfstDecoderInit * update model.h * fix func FunASRWfstDecoderInit * fix tpass_stream * update paraformer-torch * add bladedisc for funasr-onnx-offline * update comdefine * update funasr-wss-server * add log for torch * fix GetValue BLADEDISC * fix log * update cmakelist * update warmup to 10 * update funasrruntime * add batch_size for wss-server * add batch for bins * add batch for offline-stream * add batch for paraformer * add batch for offline-stream * fix func SetBatchSize * add SetBatchSize for model * add SetBatchSize for model * fix func Forward * fix padding * update funasrruntime * add dec reset for batch * set batch default value * add argv for CutSplit * sort frame_queue * sorted msgs * fix FunOfflineInfer * add dynamic batch for fetch * fix FetchDynamic * update run_server.sh * update run_server.sh * cpp http post server support (#1739) * add cpp http server * add some comment * remove some comments * del debug infos * restore run_server.sh * adapt to new model struct * 修复了onnxruntime在macos下编译失败的错误 (#1748) * Add files via upload 增加macos的编译支持 * Add files via upload 增加macos支持 * Add files via upload target_link_directories(funasr PUBLIC ${ONNXRUNTIME_DIR}/lib) target_link_directories(funasr PUBLIC ${FFMPEG_DIR}/lib) 添加 if(APPLE) 限制 --------- Co-authored-by: Yabin Li <wucong.lyb@alibaba-inc.com> * Delete docs/images/wechat.png * Add files via upload * fixed the issues about seaco-onnx timestamp * fix bug (#1764) 当语音识别结果包含 `http` 时,标点符号预测会把它会被当成 url * fix empty asr result (#1765) 解码结果为空的语音片段,text 用空字符串 * docs * docs * docs * docs * docs * keep empty speech result (#1772) * docs * docs * update wechat QRcode * Add python funasr api support for websocket srv (#1777) * add python funasr_api supoort * change little to README.md * add core tools stream * modified a little * fix bug for timeout * support for buffer decode * add ffmpeg decode for buffer * auto frontend * auto frontend --------- Co-authored-by: 雾聪 <wucong.lyb@alibaba-inc.com> Co-authored-by: zhaomingwork <61895407+zhaomingwork@users.noreply.github.com> Co-authored-by: szsteven008 <97944818+szsteven008@users.noreply.github.com> Co-authored-by: Ephemeroptera <605686962@qq.com> Co-authored-by: 彭震东 <zhendong.peng@qq.com> Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com> Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>
2.3 KiB
2.3 KiB
python funasr_api
This is the api for python to use funasr engine, only support 2pass server.
For install
Install websocket-client and ffmpeg
pip install websocket-client
apt install ffmpeg -y
recognizer examples
support many audio type as ffmpeg support, detail see FunASR/runtime/funasr_api/example.py
# create an recognizer
rcg = FunasrApi(
uri="wss://www.funasr.com:10096/"
)
# recognizer by filepath
text=rcg.rec_file("asr_example.mp3")
print("recognizer by filepath result=",text)
# recognizer by buffer
# rec_buf(audio_buf,ffmpeg_decode=False),set ffmpeg_decode=True if audio is not PCM or WAV type
with open("asr_example.wav", "rb") as f:
audio_bytes = f.read()
text=rcg.rec_buf(audio_bytes)
print("recognizer by buffer result=",text)
streaming recognizer examples,use FunasrApi.audio2wav to covert to WAV type if need
rcg = FunasrApi(
uri="wss://www.funasr.com:10096/"
)
#define call_back function for msg
def on_msg(msg):
print("stream msg=",msg)
stream=rcg.create_stream(msg_callback=on_msg)
wav_path = "asr_example.wav"
with open(wav_path, "rb") as f:
audio_bytes = f.read()
# use FunasrApi's audio2wav to covert other audio to PCM if needed
#import os
#from funasr_tools import FunasrTools
#file_ext=os.path.splitext(wav_path)[-1].upper()
#if not file_ext =="PCM" and not file_ext =="WAV":
# audio_bytes=FunasrTools.audio2wav(audio_bytes)
stride = int(60 * 10 / 10 / 1000 * 16000 * 2)
chunk_num = (len(audio_bytes) - 1) // stride + 1
for i in range(chunk_num):
beg = i * stride
data = audio_bytes[beg : beg + stride]
stream.feed_chunk(data)
final_result=stream.wait_for_end()
print("asr_example.wav stream_result=",final_result)
Acknowledge
- This project is maintained by FunASR community.
- We acknowledge zhaoming for contributing the websocket service.
- We acknowledge cgisky1980 for contributing the websocket service of offline model.