Commit Graph

4818 Commits

Author SHA1 Message Date
志浩
8a06c8d44e cut paragraph for streaming s2s 2024-09-13 10:36:34 +08:00
志浩
4b65ccee2a cut paragraph for streaming s2s 2024-09-12 19:51:43 +08:00
志浩
77b75ae3b2 cut paragraph for streaming s2s 2024-09-12 19:34:42 +08:00
志浩
7a9e0545a9 cut paragraph for streaming s2s 2024-09-12 19:19:38 +08:00
志浩
2617c07387 Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-12 19:08:55 +08:00
志浩
ba736edb14 cut paragraph for streaming s2s 2024-09-12 19:08:47 +08:00
木守
1b9300a4c3 Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-12 19:03:07 +08:00
木守
4019430ff9 update 2024-09-12 19:03:03 +08:00
游雁
f4b5af8473 Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
merge
2024-09-12 17:50:48 +08:00
游雁
e412f73f9c speech2speech 2024-09-12 17:50:25 +08:00
木守
4ca208e061 Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-12 17:47:19 +08:00
木守
a521ebc97c update 2024-09-12 17:47:14 +08:00
游雁
881b3f1661 speech2speech 2024-09-12 17:16:34 +08:00
志浩
3cfffe28a7 fix volume bug for mp3 converter 2024-09-12 16:54:46 +08:00
志浩
cce50009b0 merge mp3 converter 2024-09-12 15:41:11 +08:00
志浩
a6f10797ba add mp3 converter 2024-09-12 15:39:27 +08:00
游雁
a131fc9038 speech2speech 2024-09-12 15:06:48 +08:00
游雁
e9fb52d788 Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
merge
2024-09-12 14:21:50 +08:00
游雁
a79efa3b8d wss sdk bug 2024-09-12 14:21:39 +08:00
志浩
1587194e75 fix streaming speech gen bug 2024-09-12 11:45:45 +08:00
志浩
d7bc9c54b6 Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-12 10:19:23 +08:00
志浩
880154aa33 streaming generate v3 2024-09-12 10:18:37 +08:00
木守
bc693221f1 refactor: update file path comments 2024-09-11 21:38:49 +08:00
志浩
87fc94dc56 call streaming speech generation 2024-09-11 20:44:13 +08:00
志浩
d6e401b2da call streaming speech generation 2024-09-11 20:37:11 +08:00
志浩
397977f4ae Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-11 20:30:23 +08:00
志浩
94a94c4247 add simulated streaming inference for gpt-4o s2s chat 2024-09-11 20:29:50 +08:00
木守
68c770f67c update 2024-09-11 17:57:48 +08:00
木守
9e69c9fa6a Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-11 17:34:37 +08:00
木守
8ce7dad057 update 2024-09-11 17:34:33 +08:00
游雁
eebe29719b speech2speech 2024-09-11 17:02:52 +08:00
游雁
bdd66d1865 Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
merge
2024-09-11 16:16:24 +08:00
游雁
d46c2df2e1 speech2speech 2024-09-11 16:15:35 +08:00
木守
a333073086 update 2024-09-11 16:04:19 +08:00
木守
08865c2be4 Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-11 15:52:48 +08:00
木守
c0782e19a8 update 2024-09-11 15:52:44 +08:00
游雁
6c51cc2e0a speech2speech 2024-09-11 12:30:03 +08:00
zhifu.gzf
11dcc53e40 Merge branch dzh_tts_4o into dev_gzf_deepspeed
Title: add time counter for FM 

本次代码评审主要涉及在多个文件中添加了时间记录(使用`time.time()`),引入了新的模型类`LLMASRXvecSlotTTS`,增强了文本到语音(TTS)模型的功能,包括处理外部提示(outside_prompt),优化了模型结构和推理流程,并在多处进行了代码结构和功能的调整,以支持更复杂的模型训练和推理过程,特别是加入了对模型性能的监控和不同模块的融合。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18311489
2024-09-10 16:34:12 +08:00
neo.dzh
069665ef83 feat: Resolve conflict, auto committed by CodeFlow 2024-09-10 16:28:02 +08:00
志浩
b0f4cdc8e6 add time counter for FM 2024-09-10 15:49:30 +08:00
志浩
15c13be8f2 batchified cfg inference for FM 2024-09-10 15:42:11 +08:00
志浩
a24ef81aad add offline inference for e2e tts model in GPT-4o 2024-09-10 15:27:33 +08:00
木守
b15b4ca20f update 2024-09-10 12:12:29 +08:00
木守
03446f19ec Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed 2024-09-10 12:05:43 +08:00
木守
f0280afefc update 2024-09-10 12:03:14 +08:00
志浩
bb21822b7f add multi-lingual tiktoken for 25Hz models 2024-09-10 11:34:10 +08:00
木守
6e22735f65 chore: update model paths in websocket server 2024-09-09 20:36:39 +08:00
木守
bd4309ea3c refactor: adjust parameters for websocket server 2024-09-09 20:31:08 +08:00
zhifu.gzf
0941f8edad Merge branch dev_gzf_llm2 into dev_gzf_deepspeed
Title: wss 

本次代码评审主要加入了时间戳打印,用于追踪WebSocket服务器中语音处理的关键步骤时间点,并新增了一个多轮对话处理的WebSocket服务器实现,包括语音合成和模型推理过程,提高了代码的可追溯性和诊断能力。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18228062
2024-09-04 15:25:11 +08:00
游雁
b10a1ab523 wss 2024-09-04 12:53:40 +08:00