FunASR

mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

Author	SHA1	Message	Date
维石	8a03879937	update sensevoice with pitch	2024-09-29 17:37:55 +08:00
游雁	7ce917b596	extract	2024-09-25 20:24:14 +08:00
游雁	20280f5db5	extract	2024-09-25 20:21:45 +08:00
游雁	925227e9f1	extract	2024-09-25 20:19:39 +08:00
游雁	6d2434f257	token extract	2024-09-25 11:21:13 +08:00
游雁	09bb6d8d03	token extract	2024-09-25 10:52:49 +08:00
志浩	49903ec044	add support mixture of kaldi_ark or sound	2024-09-24 20:17:06 +08:00
志浩	4b840fd668	add batch support for token extraction	2024-09-24 17:59:02 +08:00
志浩	c37e04ea49	add batch support for token extraction	2024-09-24 17:45:55 +08:00
志浩	4f96a06d13	add batch support for token extraction	2024-09-24 17:33:41 +08:00
志浩	bc0608d380	add extract token run_mode	2024-09-24 17:20:33 +08:00
志浩	ce5b79d234	add extract token run_mode	2024-09-24 17:15:30 +08:00
志浩	1fb762d9be	add extract token run_mode	2024-09-24 17:06:52 +08:00
志浩	0fa8e2976f	add extract token run_mode	2024-09-24 17:03:17 +08:00
志浩	6fc9354924	add text ibest writer for SenseVoiceL	2024-09-24 16:32:01 +08:00
志浩	797bd57f91	add text ibest writer for SenseVoiceL	2024-09-24 15:54:42 +08:00
志浩	e9557a0ee7	insert VQ into sensevoice encoder	2024-09-18 11:00:16 +08:00
游雁	032d429a94	wss llm	2024-08-26 17:52:30 +08:00
游雁	11586f7ebd	update	2024-08-02 11:20:07 +08:00
游雁	5dd4495406	update	2024-08-02 00:36:30 +08:00
游雁	16001677ac	update	2024-08-01 23:50:59 +08:00
游雁	d577dda0ed	update	2024-08-01 23:40:53 +08:00
游雁	5851fc53cd	sdpa bugfix	2024-07-24 15:02:50 +08:00
游雁	20d32f68e8	sdpa bugfix	2024-07-24 01:08:52 +08:00
游雁	54e630159d	sdpa bugfix	2024-07-24 00:57:10 +08:00
游雁	609c0e7e0d	sdpa bugfix	2024-07-24 00:40:52 +08:00
游雁	b9bc982e4f	sdpa bugfix	2024-07-24 00:33:04 +08:00
游雁	318d81be4a	sdpa bugfix	2024-07-24 00:28:56 +08:00
游雁	dfc52059c0	sensevoicesmall	2024-07-23 19:00:14 +08:00
游雁	90b1557996	update	2024-07-22 13:49:12 +08:00
游雁	6aacee8f9e	update	2024-07-18 16:31:49 +08:00
游雁	340b6efef2	update	2024-07-18 13:48:51 +08:00
游雁	b03c8a5c35	update	2024-07-18 13:48:27 +08:00
游雁	b84a203d16	inference	2024-06-24 17:05:43 +08:00
lzr265946	5ac34941d1	sensevoice	2024-06-21 17:18:16 +08:00
lzr265946	1dbbfe13e8	sensevoice	2024-06-21 16:57:05 +08:00
lzr265946	cc141d0fb0	sensevoice	2024-06-21 15:09:30 +08:00
lzr265946	04b1015cb4	Merge branch 'dev_gzf_deepspeed' of https://github.com/alibaba-damo-academy/FunASR into dev_gzf_deepspeed	2024-06-21 14:50:44 +08:00
北念	a6f8fe789b	sensevoice	2024-06-21 11:51:18 +08:00
lzr265946	83d24b95fe	Merge branch 'dev_gzf_deepspeed' of https://github.com/alibaba-damo-academy/FunASR into dev_gzf_deepspeed	2024-06-21 11:43:41 +08:00
游雁	a2071e3726	sensevoice	2024-06-21 10:04:27 +08:00
lzr265946	73c8868af7	Merge branch 'dev_gzf_deepspeed' of https://github.com/alibaba-damo-academy/FunASR into dev_gzf_deepspeed	2024-06-20 20:01:28 +08:00
zhifu gao	ad99b262eb	update with main (#1817 ) * add cmakelist * add paraformer-torch * add debug for funasr-onnx-offline * fix redefinition of jieba StdExtension.hpp * add loading torch models * update funasr-onnx-offline * add SwitchArg for wss-server * add SwitchArg for funasr-onnx-offline * update cmakelist * update funasr-onnx-offline-rtf * add define condition * add gpu define for offlne-stream * update com define * update offline-stream * update cmakelist * update func CompileHotwordEmbedding * add timestamp for paraformer-torch * add C10_USE_GLOG for paraformer-torch * update paraformer-torch * fix func FunASRWfstDecoderInit * update model.h * fix func FunASRWfstDecoderInit * fix tpass_stream * update paraformer-torch * add bladedisc for funasr-onnx-offline * update comdefine * update funasr-wss-server * add log for torch * fix GetValue BLADEDISC * fix log * update cmakelist * update warmup to 10 * update funasrruntime * add batch_size for wss-server * add batch for bins * add batch for offline-stream * add batch for paraformer * add batch for offline-stream * fix func SetBatchSize * add SetBatchSize for model * add SetBatchSize for model * fix func Forward * fix padding * update funasrruntime * add dec reset for batch * set batch default value * add argv for CutSplit * sort frame_queue * sorted msgs * fix FunOfflineInfer * add dynamic batch for fetch * fix FetchDynamic * update run_server.sh * update run_server.sh * cpp http post server support (#1739) * add cpp http server * add some comment * remove some comments * del debug infos * restore run_server.sh * adapt to new model struct * 修复了onnxruntime在macos下编译失败的错误 (#1748) * Add files via upload 增加macos的编译支持 * Add files via upload 增加macos支持 * Add files via upload target_link_directories(funasr PUBLIC ${ONNXRUNTIME_DIR}/lib) target_link_directories(funasr PUBLIC ${FFMPEG_DIR}/lib) 添加 if(APPLE) 限制 --------- Co-authored-by: Yabin Li <wucong.lyb@alibaba-inc.com> * Delete docs/images/wechat.png * Add files via upload * fixed the issues about seaco-onnx timestamp * fix bug (#1764) 当语音识别结果包含 `http` 时，标点符号预测会把它会被当成 url * fix empty asr result (#1765) 解码结果为空的语音片段，text 用空字符串 * update export * update export * docs * docs * update export name * docs * update * docs * docs * keep empty speech result (#1772) * docs * docs * update wechat QRcode * Add python funasr api support for websocket srv (#1777) * add python funasr_api supoort * change little to README.md * add core tools stream * modified a little * fix bug for timeout * support for buffer decode * add ffmpeg decode for buffer * libtorch demo * update libtorch infer * update utils * update demo * update demo * update libtorch inference * update model class * update seaco paraformer * bug fix * bug fix * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * Dev gzf exp (#1785) * resume from step * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * train_loss_avg train_acc_avg * train_loss_avg train_acc_avg * train_loss_avg train_acc_avg * log step * wav is not exist * wav is not exist * decoding * decoding * decoding * wechat * decoding key * decoding key * decoding key * decoding key * decoding key * decoding key * dynamic batch * start_data_split_i=0 * total_time/accum_grad * total_time/accum_grad * total_time/accum_grad * update avg slice * update avg slice * sensevoice sanm * sensevoice sanm * sensevoice sanm --------- Co-authored-by: 北念 <lzr265946@alibaba-inc.com> * auto frontend * update paraformer timestamp * [Optimization] support bladedisc fp16 optimization (#1790) * add cif_v1 and cif_export * Update SDK_advanced_guide_offline_zh.md * add cif_wo_hidden_v1 * [fix] fix empty asr result (#1794) * english timestamp for valilla paraformer * wechat * [fix] better solution for handling empty result (#1796) * update scripts * modify the qformer adaptor (#1804) Co-authored-by: nichongjia-2007 <nichongjia@gmail.com> * add ctc inference code (#1806) Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com> * Update auto_model.py 修复空字串进入speaker model时报raw_text变量不存在的bug * Update auto_model.py 修复识别出空串后spk_model内变量未定义问题 * update model name * fix paramter 'quantize' unused issue (#1813) Co-authored-by: ZihanLiao <liaozihan1@xdf.cn> * wechat * Update cif_predictor.py (#1811) * Update cif_predictor.py * modify cif_v1_export under extreme cases, max_label_len calculated by batch_len misaligns with token_num * Update cif_predictor.py torch.cumsum precision degradation, using float64 instead * update code --------- Co-authored-by: 雾聪 <wucong.lyb@alibaba-inc.com> Co-authored-by: zhaomingwork <61895407+zhaomingwork@users.noreply.github.com> Co-authored-by: szsteven008 <97944818+szsteven008@users.noreply.github.com> Co-authored-by: Ephemeroptera <605686962@qq.com> Co-authored-by: 彭震东 <zhendong.peng@qq.com> Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com> Co-authored-by: 维石 <shixian.shi@alibaba-inc.com> Co-authored-by: 北念 <lzr265946@alibaba-inc.com> Co-authored-by: xiaowan0322 <wanchen.swc@alibaba-inc.com> Co-authored-by: zhuangzhong <zhuangzhong@corp.netease.com> Co-authored-by: Xingchen Song(宋星辰) <xingchensong1996@163.com> Co-authored-by: nichongjia-2007 <nichongjia@gmail.com> Co-authored-by: haoneng.lhn <haoneng.lhn@alibaba-inc.com> Co-authored-by: liugz18 <57401541+liugz18@users.noreply.github.com> Co-authored-by: Marlowe <54339989+ZihanLiao@users.noreply.github.com> Co-authored-by: ZihanLiao <liaozihan1@xdf.cn> Co-authored-by: zhong zhuang <zhuangz@lamda.nju.edu.cn>	2024-06-19 10:27:21 +08:00
游雁	45d7aa9004	decoding	2024-06-19 10:26:40 +08:00
Shi Xian	6c467e6f0a	Merge pull request #1825 from modelscope/dev_libt Dev libt	2024-06-18 10:01:56 +08:00
维石	9377eed41e	update code	2024-06-17 20:20:00 +08:00
北念	ada76b6312	sensevoice	2024-06-17 13:36:22 +08:00
游雁	59bc02b089	decoding	2024-06-14 13:59:49 +08:00
游雁	67329a74a5	decoding	2024-06-14 11:04:49 +08:00
zhifu gao	32e7836645	update with main (#1786 ) * add cmakelist * add paraformer-torch * add debug for funasr-onnx-offline * fix redefinition of jieba StdExtension.hpp * add loading torch models * update funasr-onnx-offline * add SwitchArg for wss-server * add SwitchArg for funasr-onnx-offline * update cmakelist * update funasr-onnx-offline-rtf * add define condition * add gpu define for offlne-stream * update com define * update offline-stream * update cmakelist * update func CompileHotwordEmbedding * add timestamp for paraformer-torch * add C10_USE_GLOG for paraformer-torch * update paraformer-torch * fix func FunASRWfstDecoderInit * update model.h * fix func FunASRWfstDecoderInit * fix tpass_stream * update paraformer-torch * add bladedisc for funasr-onnx-offline * update comdefine * update funasr-wss-server * add log for torch * fix GetValue BLADEDISC * fix log * update cmakelist * update warmup to 10 * update funasrruntime * add batch_size for wss-server * add batch for bins * add batch for offline-stream * add batch for paraformer * add batch for offline-stream * fix func SetBatchSize * add SetBatchSize for model * add SetBatchSize for model * fix func Forward * fix padding * update funasrruntime * add dec reset for batch * set batch default value * add argv for CutSplit * sort frame_queue * sorted msgs * fix FunOfflineInfer * add dynamic batch for fetch * fix FetchDynamic * update run_server.sh * update run_server.sh * cpp http post server support (#1739) * add cpp http server * add some comment * remove some comments * del debug infos * restore run_server.sh * adapt to new model struct * 修复了onnxruntime在macos下编译失败的错误 (#1748) * Add files via upload 增加macos的编译支持 * Add files via upload 增加macos支持 * Add files via upload target_link_directories(funasr PUBLIC ${ONNXRUNTIME_DIR}/lib) target_link_directories(funasr PUBLIC ${FFMPEG_DIR}/lib) 添加 if(APPLE) 限制 --------- Co-authored-by: Yabin Li <wucong.lyb@alibaba-inc.com> * Delete docs/images/wechat.png * Add files via upload * fixed the issues about seaco-onnx timestamp * fix bug (#1764) 当语音识别结果包含 `http` 时，标点符号预测会把它会被当成 url * fix empty asr result (#1765) 解码结果为空的语音片段，text 用空字符串 * docs * docs * docs * docs * docs * keep empty speech result (#1772) * docs * docs * update wechat QRcode * Add python funasr api support for websocket srv (#1777) * add python funasr_api supoort * change little to README.md * add core tools stream * modified a little * fix bug for timeout * support for buffer decode * add ffmpeg decode for buffer * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * auto frontend * Dev gzf exp (#1785) * resume from step * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * batch * train_loss_avg train_acc_avg * train_loss_avg train_acc_avg * train_loss_avg train_acc_avg * log step * wav is not exist * wav is not exist * decoding * decoding * decoding * wechat * decoding key * decoding key * decoding key * decoding key * decoding key * decoding key * dynamic batch * start_data_split_i=0 * total_time/accum_grad * total_time/accum_grad * total_time/accum_grad * update avg slice * update avg slice * sensevoice sanm * sensevoice sanm * sensevoice sanm --------- Co-authored-by: 北念 <lzr265946@alibaba-inc.com> * auto frontend --------- Co-authored-by: 雾聪 <wucong.lyb@alibaba-inc.com> Co-authored-by: zhaomingwork <61895407+zhaomingwork@users.noreply.github.com> Co-authored-by: szsteven008 <97944818+szsteven008@users.noreply.github.com> Co-authored-by: Ephemeroptera <605686962@qq.com> Co-authored-by: 彭震东 <zhendong.peng@qq.com> Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com> Co-authored-by: 维石 <shixian.shi@alibaba-inc.com> Co-authored-by: 北念 <lzr265946@alibaba-inc.com>	2024-06-06 09:54:35 +08:00

1 2

87 Commits