Commit Graph

132 Commits

Author SHA1 Message Date
游雁
5a8f379084 vad + asr 2023-12-21 21:08:46 +08:00
游雁
a1b0cd33d5 rename register tables 2023-12-21 14:20:21 +08:00
游雁
00ea1186f9 funasr2 2023-12-19 22:53:18 +08:00
游雁
0e622e694e funasr2 2023-12-19 21:58:14 +08:00
游雁
298ddd13fb funasr2 2023-12-15 23:46:41 +08:00
游雁
7012ca2efc funasr2 paraformer biciparaformer contextuaparaformer 2023-12-13 20:08:55 +08:00
游雁
806a03609d funasr2 paraformer biciparaformer contextuaparaformer 2023-12-13 19:43:13 +08:00
游雁
d77910eb6d funasr2 2023-12-11 13:42:40 +08:00
zhifu gao
81acb17544
update with main (#1152)
* v0.8.7

* update cmd version

* set openfst HAVE_BIN/HAVE_SCRIPT off for win32

* 修复为支持新版本的热词 (#1137)

* update CMakeLists.txt

* Revert "update CMakeLists.txt"

This reverts commit 54bcd1f674.

* rm log.h for wins-websocket

* fix bug of websocket lock blocking

* update funasr-wss-server

* update model-revision by model name

* update funasr-wss-server-2pass

* 增加分角色语音识别对ERes2Net模型的支持。

* Update README.md (#1140)

minor fix

* automatically configure parameters such as decoder-thread-num

* update docs

* update docs

* update docs

* 分角色语音识别支持更多的模型

* update spk inference

* remove never use code (#1151)

---------

Co-authored-by: 雾聪 <wucong.lyb@alibaba-inc.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>
Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com>
Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
2023-12-06 19:54:37 +08:00
游雁
27f31cd42b funasr2 2023-12-06 17:01:12 +08:00
shixian.shi
72fecc8e03 update asr_spk inference for shot utt 2023-11-24 14:29:33 +08:00
zhifu gao
b57b98364f
funasr v2 setup (#1106)
* funasr v2 setup
2023-11-22 00:36:35 +08:00
游雁
244c033fba python cli 2023-11-17 15:19:53 +08:00
北念
b717714951 add __init__.py 2023-10-20 09:53:47 +08:00
北念
d53b970aec fix import module dependency 2023-10-19 10:43:04 +08:00
北念
7c9b310e79 add whisper model inference pipeline 2023-10-12 14:31:39 +08:00
shixian.shi
78c78c39a9 big fix for speaker pipeline 2023-10-10 17:11:15 +08:00
Lizerui9926
35caed5dbc
Merge pull request #996 from alibaba-damo-academy/dev_lzr_en
update asr postprocess_utils
2023-10-10 16:00:50 +08:00
北念
a4de8b2a0a update asr postprocess_utils 2023-10-10 15:49:04 +08:00
shixian.shi
8a0930d682 paraformer-speaker inference pipeline 2023-10-10 11:35:42 +08:00
Yabin Li
61ed60695a
coauthor:duj12, add itn;add timestamp、hotword to 2pass; (#966)
* Add ITN,include openfst/gflags in onnxruntime/third_party.

* 2pass server support Hotword and Timestamp. The start_time of each segment need to be fix.

* add global time start and end of each frame(both online and offline), support two-pass timestamp(both segment and token level).

* update websocket cmake.

* 2pass server support itn, hw and tp.

* Add local build and run. Add timestamp in 2pass server, update cmakelist.

* fix filemode bug in h5, avoid 2pass wss server close before final.

* offline server add itn.

* offline server add ITN.

* update hotword model dir.

* Add Acknowledgement to WeTextProcessing(https://github.com/wenet-e2e/WeTextProcessing)

* adapted to original FunASR.

* adapted to itn timestamp hotword

* merge from main (#949)

* fix empty timestamp list inference

* punc large

* fix decoding_ind none bug

* fix decoding_ind none bug

* docs

* setup

* change eng punc in offline model

* update contextual export

* update proc for oov in hotword onnx inference

* add python http code (#940)

* funasr-onnx 0.2.2

* funasr-onnx 0.2.3

* bug fix in timestamp inference

* fix bug in timestamp inference

* Update preprocessor.py

---------

Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: 游雁 <zhifu.gzf@alibaba-inc.com>
Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com>
Co-authored-by: mengzhe.cmz <mengzhe.cmz@alibaba-inc.com>
Co-authored-by: Xian Shi <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: chenmengzheAAA <123789350+chenmengzheAAA@users.noreply.github.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>

* update docs

* update deploy_tools

---------

Co-authored-by: dujing <dujing@xmov.ai>
Co-authored-by: Jean Du <37294470+duj12@users.noreply.github.com>
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: 游雁 <zhifu.gzf@alibaba-inc.com>
Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com>
Co-authored-by: mengzhe.cmz <mengzhe.cmz@alibaba-inc.com>
Co-authored-by: Xian Shi <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: chenmengzheAAA <123789350+chenmengzheAAA@users.noreply.github.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>
2023-09-19 10:09:58 +08:00
Marlowe
c29e5071bd
fix error:reference before assignment for write_flag when using kaldi_ark (#904)
Co-authored-by: ZihanLiao <liaozihan1@xdf.cn>
2023-09-01 01:15:32 +08:00
shixian.shi
c73d1a8e81 update func cif_wo_hidden 2023-08-14 19:31:55 +08:00
shixian.shi
e740ec08b7 fix bug for timestamp inference 2023-08-10 16:08:26 +08:00
jmwang66
651737380b
Merge branch 'main' into dev_wjm_modelscope 2023-08-09 16:48:02 +08:00
jmwang66
993f226f35
Merge pull request #806 from alibaba-damo-academy/dev_wjm_sd
update eend-ola
2023-08-07 16:09:39 +08:00
志浩
4bc6db3ef8 TOLD: add TOLD/SOND recipe on callhome 2023-08-01 17:03:39 +08:00
嘉渊
2d22eaba7c update 2023-07-27 14:46:40 +08:00
shixian.shi
1054daf44a remove assert in ts_prediction_lfr6_standard 2023-07-20 18:57:16 +08:00
游雁
7a207808bc np fix bug 2023-07-06 19:13:00 +08:00
嘉渊
8b7c32b0f6 update eend_ola 2023-07-06 16:28:31 +08:00
游雁
c798a0c7bb export 2023-06-29 20:00:02 +08:00
jmwang66
98abc0e5ac
update setup (#686)
* update

* update setup

* update setup

* update setup

* update setup

* update setup

* update setup

* update

* update

* update setup
2023-06-29 16:30:39 +08:00
游雁
f4eb3174b3 sdk utils revison 2023-06-28 14:45:54 +08:00
游雁
7da5b31e25 Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
add
2023-06-27 16:57:43 +08:00
游雁
c506a5f67e sdk utils 2023-06-27 16:57:13 +08:00
haoneng.lhn
8042c51745 bug fix 2023-06-27 15:48:22 +08:00
haoneng.lhn
a0defaf40e fix loading multi-channel mp3 file bug 2023-06-27 13:12:42 +08:00
haoneng.lhn
e677eb4b13 fix torchaudio load mp3 bug 2023-06-26 17:13:22 +08:00
aky15
fe63877bc8
Dev aky2 (#561)
* support resume model from pai

* add padding for streaming rnnt conv input

* fix large dataset training bug

* bug fix

* modify aishell rnnt egs to support wav input

* add libri_100 rnnt recipe

* bug fix

---------

Co-authored-by: aky15 <ankeyu.aky@11.17.44.249>
2023-05-30 17:05:34 +08:00
嘉渊
3a15e5392b update repo 2023-05-26 10:56:17 +08:00
嘉渊
f5b35ba23d update repo 2023-05-17 15:28:19 +08:00
嘉渊
d1374e9c80 update repo 2023-05-17 15:15:51 +08:00
嘉渊
8629d30d3d update repo 2023-05-16 15:06:25 +08:00
嘉渊
86768c77c7 update repo 2023-05-16 11:02:59 +08:00
嘉渊
08213a9ee6 update repo 2023-05-16 10:47:08 +08:00
嘉渊
72b95eb051 update repo 2023-05-15 14:29:49 +08:00
嘉渊
52002aaddf update repo 2023-05-15 14:26:14 +08:00
嘉渊
e23b7dc34f update repo 2023-05-15 11:05:43 +08:00
嘉渊
688fb902dd update repo 2023-05-15 10:59:59 +08:00