雾聪
9e11c526b0
runtime_sdk_download_tool
2024-01-25 11:15:47 +08:00
zhifu gao
2c3183b611
Funasr1.0 ( #1284 )
...
* funasr1.0 update
* funasr1.0 paraformer-en
* update with main (#1281 )
* Funasr1.0 (#1279 )
* funasr1.0 update
* funasr1.0 paraformer-en
* update speaker infer
* update device
* update device
* update raw_text
* update infer
* update
* update infer
* bug fix
---------
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
* punc bugfix
* reduce_channels
---------
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
2024-01-23 11:16:34 +08:00
shixian.shi
1233c0d3ff
code update
2024-01-15 20:34:47 +08:00
游雁
40d1f80030
funasr1.0 streaming demo
2024-01-12 12:05:08 +08:00
游雁
cf2f14345a
funasr1.0 fsmn-vad streaming
2024-01-12 00:01:25 +08:00
游雁
c0e72dd1ba
Merge branch 'funasr1.0' of github.com:alibaba-damo-academy/FunASR into funasr1.0
...
add
2024-01-11 17:36:59 +08:00
游雁
a75bbb028e
funasr1.0 paraformer_streaming
2024-01-11 17:36:30 +08:00
shixian.shi
7037971392
update asr with speaker
2024-01-11 17:03:00 +08:00
游雁
1028a8a036
funasr1.0 paraformer_streaming WavFrontendOnline
2024-01-10 17:42:53 +08:00
游雁
d8b586e02c
funasr1.0 modelscope
2024-01-09 20:33:12 +08:00
游雁
f14f9f8d15
funasr1.0 infer url modelscope
2024-01-09 00:13:51 +08:00
游雁
e6a7bbe1ca
load_audio_text_image_video
2024-01-05 17:00:11 +08:00
游雁
4f98546f36
load_audio_text_image_video
2024-01-05 16:55:07 +08:00
游雁
32905d8cde
funasr1.0
2024-01-05 11:52:48 +08:00
游雁
5a8f379084
vad + asr
2023-12-21 21:08:46 +08:00
游雁
a1b0cd33d5
rename register tables
2023-12-21 14:20:21 +08:00
游雁
00ea1186f9
funasr2
2023-12-19 22:53:18 +08:00
游雁
0e622e694e
funasr2
2023-12-19 21:58:14 +08:00
游雁
298ddd13fb
funasr2
2023-12-15 23:46:41 +08:00
游雁
7012ca2efc
funasr2 paraformer biciparaformer contextuaparaformer
2023-12-13 20:08:55 +08:00
游雁
806a03609d
funasr2 paraformer biciparaformer contextuaparaformer
2023-12-13 19:43:13 +08:00
游雁
d77910eb6d
funasr2
2023-12-11 13:42:40 +08:00
zhifu gao
81acb17544
update with main ( #1152 )
...
* v0.8.7
* update cmd version
* set openfst HAVE_BIN/HAVE_SCRIPT off for win32
* 修复为支持新版本的热词 (#1137 )
* update CMakeLists.txt
* Revert "update CMakeLists.txt"
This reverts commit 54bcd1f674 .
* rm log.h for wins-websocket
* fix bug of websocket lock blocking
* update funasr-wss-server
* update model-revision by model name
* update funasr-wss-server-2pass
* 增加分角色语音识别对ERes2Net模型的支持。
* Update README.md (#1140 )
minor fix
* automatically configure parameters such as decoder-thread-num
* update docs
* update docs
* update docs
* 分角色语音识别支持更多的模型
* update spk inference
* remove never use code (#1151 )
---------
Co-authored-by: 雾聪 <wucong.lyb@alibaba-inc.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>
Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com>
Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
2023-12-06 19:54:37 +08:00
游雁
27f31cd42b
funasr2
2023-12-06 17:01:12 +08:00
shixian.shi
72fecc8e03
update asr_spk inference for shot utt
2023-11-24 14:29:33 +08:00
zhifu gao
b57b98364f
funasr v2 setup ( #1106 )
...
* funasr v2 setup
2023-11-22 00:36:35 +08:00
游雁
244c033fba
python cli
2023-11-17 15:19:53 +08:00
北念
b717714951
add __init__.py
2023-10-20 09:53:47 +08:00
北念
d53b970aec
fix import module dependency
2023-10-19 10:43:04 +08:00
北念
7c9b310e79
add whisper model inference pipeline
2023-10-12 14:31:39 +08:00
shixian.shi
78c78c39a9
big fix for speaker pipeline
2023-10-10 17:11:15 +08:00
Lizerui9926
35caed5dbc
Merge pull request #996 from alibaba-damo-academy/dev_lzr_en
...
update asr postprocess_utils
2023-10-10 16:00:50 +08:00
北念
a4de8b2a0a
update asr postprocess_utils
2023-10-10 15:49:04 +08:00
shixian.shi
8a0930d682
paraformer-speaker inference pipeline
2023-10-10 11:35:42 +08:00
Yabin Li
61ed60695a
coauthor:duj12, add itn;add timestamp、hotword to 2pass; ( #966 )
...
* Add ITN,include openfst/gflags in onnxruntime/third_party.
* 2pass server support Hotword and Timestamp. The start_time of each segment need to be fix.
* add global time start and end of each frame(both online and offline), support two-pass timestamp(both segment and token level).
* update websocket cmake.
* 2pass server support itn, hw and tp.
* Add local build and run. Add timestamp in 2pass server, update cmakelist.
* fix filemode bug in h5, avoid 2pass wss server close before final.
* offline server add itn.
* offline server add ITN.
* update hotword model dir.
* Add Acknowledgement to WeTextProcessing(https://github.com/wenet-e2e/WeTextProcessing )
* adapted to original FunASR.
* adapted to itn timestamp hotword
* merge from main (#949 )
* fix empty timestamp list inference
* punc large
* fix decoding_ind none bug
* fix decoding_ind none bug
* docs
* setup
* change eng punc in offline model
* update contextual export
* update proc for oov in hotword onnx inference
* add python http code (#940 )
* funasr-onnx 0.2.2
* funasr-onnx 0.2.3
* bug fix in timestamp inference
* fix bug in timestamp inference
* Update preprocessor.py
---------
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: 游雁 <zhifu.gzf@alibaba-inc.com>
Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com>
Co-authored-by: mengzhe.cmz <mengzhe.cmz@alibaba-inc.com>
Co-authored-by: Xian Shi <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: chenmengzheAAA <123789350+chenmengzheAAA@users.noreply.github.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>
* update docs
* update deploy_tools
---------
Co-authored-by: dujing <dujing@xmov.ai>
Co-authored-by: Jean Du <37294470+duj12@users.noreply.github.com>
Co-authored-by: shixian.shi <shixian.shi@alibaba-inc.com>
Co-authored-by: 游雁 <zhifu.gzf@alibaba-inc.com>
Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com>
Co-authored-by: mengzhe.cmz <mengzhe.cmz@alibaba-inc.com>
Co-authored-by: Xian Shi <40013335+R1ckShi@users.noreply.github.com>
Co-authored-by: chenmengzheAAA <123789350+chenmengzheAAA@users.noreply.github.com>
Co-authored-by: 夜雨飘零 <yeyupiaoling@foxmail.com>
2023-09-19 10:09:58 +08:00
Marlowe
c29e5071bd
fix error:reference before assignment for write_flag when using kaldi_ark ( #904 )
...
Co-authored-by: ZihanLiao <liaozihan1@xdf.cn>
2023-09-01 01:15:32 +08:00
shixian.shi
c73d1a8e81
update func cif_wo_hidden
2023-08-14 19:31:55 +08:00
shixian.shi
e740ec08b7
fix bug for timestamp inference
2023-08-10 16:08:26 +08:00
jmwang66
651737380b
Merge branch 'main' into dev_wjm_modelscope
2023-08-09 16:48:02 +08:00
jmwang66
993f226f35
Merge pull request #806 from alibaba-damo-academy/dev_wjm_sd
...
update eend-ola
2023-08-07 16:09:39 +08:00
志浩
4bc6db3ef8
TOLD: add TOLD/SOND recipe on callhome
2023-08-01 17:03:39 +08:00
嘉渊
2d22eaba7c
update
2023-07-27 14:46:40 +08:00
shixian.shi
1054daf44a
remove assert in ts_prediction_lfr6_standard
2023-07-20 18:57:16 +08:00
游雁
7a207808bc
np fix bug
2023-07-06 19:13:00 +08:00
嘉渊
8b7c32b0f6
update eend_ola
2023-07-06 16:28:31 +08:00
游雁
c798a0c7bb
export
2023-06-29 20:00:02 +08:00
jmwang66
98abc0e5ac
update setup ( #686 )
...
* update
* update setup
* update setup
* update setup
* update setup
* update setup
* update setup
* update
* update
* update setup
2023-06-29 16:30:39 +08:00
游雁
f4eb3174b3
sdk utils revison
2023-06-28 14:45:54 +08:00
游雁
7da5b31e25
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
add
2023-06-27 16:57:43 +08:00
游雁
c506a5f67e
sdk utils
2023-06-27 16:57:13 +08:00
haoneng.lhn
8042c51745
bug fix
2023-06-27 15:48:22 +08:00
haoneng.lhn
a0defaf40e
fix loading multi-channel mp3 file bug
2023-06-27 13:12:42 +08:00
haoneng.lhn
e677eb4b13
fix torchaudio load mp3 bug
2023-06-26 17:13:22 +08:00
aky15
fe63877bc8
Dev aky2 ( #561 )
...
* support resume model from pai
* add padding for streaming rnnt conv input
* fix large dataset training bug
* bug fix
* modify aishell rnnt egs to support wav input
* add libri_100 rnnt recipe
* bug fix
---------
Co-authored-by: aky15 <ankeyu.aky@11.17.44.249>
2023-05-30 17:05:34 +08:00
嘉渊
3a15e5392b
update repo
2023-05-26 10:56:17 +08:00
嘉渊
f5b35ba23d
update repo
2023-05-17 15:28:19 +08:00
嘉渊
d1374e9c80
update repo
2023-05-17 15:15:51 +08:00
嘉渊
8629d30d3d
update repo
2023-05-16 15:06:25 +08:00
嘉渊
86768c77c7
update repo
2023-05-16 11:02:59 +08:00
嘉渊
08213a9ee6
update repo
2023-05-16 10:47:08 +08:00
嘉渊
72b95eb051
update repo
2023-05-15 14:29:49 +08:00
嘉渊
52002aaddf
update repo
2023-05-15 14:26:14 +08:00
嘉渊
e23b7dc34f
update repo
2023-05-15 11:05:43 +08:00
嘉渊
688fb902dd
update repo
2023-05-15 10:59:59 +08:00
游雁
9dad49c3a1
websocket new version for offline 2pass send bytes
2023-05-13 00:20:19 +08:00
嘉渊
c2bf708f87
update repo
2023-05-11 16:10:59 +08:00
游雁
2b458b1a71
paraformer long batch infer sort
2023-05-10 21:59:41 +08:00
游雁
a97daeb247
paraformer long batch infer
2023-05-10 19:08:54 +08:00
shixian.shi
633d68f354
update timestamp_tools
2023-05-10 11:32:36 +08:00
shixian.shi
2502160fcd
update sentence timestamp for ClipVedio
2023-05-09 21:02:22 +08:00
smohan-speech
3b7e4b0d34
add speaker-attributed ASR task for alimeeting
2023-05-06 16:38:09 +08:00
smohan-speech
a73123bcfc
add speaker-attributed ASR task for alimeeting
2023-05-06 16:17:48 +08:00
嘉渊
6de0f96851
update
2023-04-25 16:41:15 +08:00
嘉渊
ce30011976
update
2023-04-25 16:37:16 +08:00
嘉渊
6e66a74ae6
update
2023-04-25 16:33:00 +08:00
嘉渊
7436acc5dd
update
2023-04-25 16:29:39 +08:00
嘉渊
70f9a8f890
update
2023-04-25 01:29:12 +08:00
嘉渊
e86b95e747
update
2023-04-24 22:57:04 +08:00
嘉渊
f2b9780b29
update
2023-04-24 22:47:00 +08:00
haoneng.lhn
a8e92e4fb4
update data filtering recipe
2023-04-23 15:03:56 +08:00
speech_asr
993fdd8ecf
update
2023-04-20 17:01:47 +08:00
speech_asr
eac9f111b5
update
2023-04-20 16:59:26 +08:00
speech_asr
3e77fd4430
update
2023-04-20 16:41:22 +08:00
speech_asr
d6cc6896e4
update
2023-04-20 16:33:30 +08:00
speech_asr
518465d089
update
2023-04-20 16:07:01 +08:00
speech_asr
a29166b9a0
update
2023-04-20 16:03:54 +08:00
speech_asr
200d1ede05
update
2023-04-20 15:56:25 +08:00
speech_asr
c452b2a3c7
update
2023-04-20 15:43:29 +08:00
speech_asr
68852c3072
update
2023-04-20 15:35:25 +08:00
speech_asr
43c30967b0
update
2023-04-20 11:48:19 +08:00
speech_asr
02f2a3c2ec
update
2023-04-20 11:38:20 +08:00
speech_asr
7522c59e74
update
2023-04-20 11:16:49 +08:00
speech_asr
680cdb55bb
update
2023-04-19 14:49:36 +08:00
speech_asr
58fb22cb2b
update
2023-04-19 10:09:51 +08:00
speech_asr
05d4176e88
update
2023-04-18 19:28:33 +08:00
speech_asr
831d00aec2
update
2023-04-17 16:26:40 +08:00
speech_asr
d9ad40bf6f
update
2023-04-17 11:45:41 +08:00
speech_asr
6659c37d81
update
2023-04-17 11:23:37 +08:00
speech_asr
bd7455ec7d
update
2023-04-12 10:43:01 +08:00
北念
cf843d144a
fix compute cer problems
2023-04-04 14:26:22 +08:00