志浩
7a9e0545a9
cut paragraph for streaming s2s
2024-09-12 19:19:38 +08:00
志浩
2617c07387
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-12 19:08:55 +08:00
志浩
ba736edb14
cut paragraph for streaming s2s
2024-09-12 19:08:47 +08:00
木守
1b9300a4c3
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-12 19:03:07 +08:00
木守
4019430ff9
update
2024-09-12 19:03:03 +08:00
游雁
f4b5af8473
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
...
merge
2024-09-12 17:50:48 +08:00
游雁
e412f73f9c
speech2speech
2024-09-12 17:50:25 +08:00
木守
4ca208e061
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-12 17:47:19 +08:00
木守
a521ebc97c
update
2024-09-12 17:47:14 +08:00
游雁
881b3f1661
speech2speech
2024-09-12 17:16:34 +08:00
志浩
3cfffe28a7
fix volume bug for mp3 converter
2024-09-12 16:54:46 +08:00
志浩
cce50009b0
merge mp3 converter
2024-09-12 15:41:11 +08:00
志浩
a6f10797ba
add mp3 converter
2024-09-12 15:39:27 +08:00
游雁
a131fc9038
speech2speech
2024-09-12 15:06:48 +08:00
游雁
e9fb52d788
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
...
merge
2024-09-12 14:21:50 +08:00
游雁
a79efa3b8d
wss sdk bug
2024-09-12 14:21:39 +08:00
志浩
1587194e75
fix streaming speech gen bug
2024-09-12 11:45:45 +08:00
志浩
d7bc9c54b6
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-12 10:19:23 +08:00
志浩
880154aa33
streaming generate v3
2024-09-12 10:18:37 +08:00
木守
bc693221f1
refactor: update file path comments
2024-09-11 21:38:49 +08:00
志浩
87fc94dc56
call streaming speech generation
2024-09-11 20:44:13 +08:00
志浩
d6e401b2da
call streaming speech generation
2024-09-11 20:37:11 +08:00
志浩
397977f4ae
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-11 20:30:23 +08:00
志浩
94a94c4247
add simulated streaming inference for gpt-4o s2s chat
2024-09-11 20:29:50 +08:00
木守
68c770f67c
update
2024-09-11 17:57:48 +08:00
木守
9e69c9fa6a
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-11 17:34:37 +08:00
木守
8ce7dad057
update
2024-09-11 17:34:33 +08:00
游雁
eebe29719b
speech2speech
2024-09-11 17:02:52 +08:00
游雁
bdd66d1865
Merge branch 'dev_gzf_deepspeed' of gitlab.alibaba-inc.com:zhifu.gzf/FunASR into dev_gzf_deepspeed
...
merge
2024-09-11 16:16:24 +08:00
游雁
d46c2df2e1
speech2speech
2024-09-11 16:15:35 +08:00
木守
a333073086
update
2024-09-11 16:04:19 +08:00
木守
08865c2be4
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-11 15:52:48 +08:00
木守
c0782e19a8
update
2024-09-11 15:52:44 +08:00
游雁
6c51cc2e0a
speech2speech
2024-09-11 12:30:03 +08:00
zhifu.gzf
11dcc53e40
Merge branch dzh_tts_4o into dev_gzf_deepspeed
...
Title: add time counter for FM
本次代码评审主要涉及在多个文件中添加了时间记录(使用`time.time()`),引入了新的模型类`LLMASRXvecSlotTTS`,增强了文本到语音(TTS)模型的功能,包括处理外部提示(outside_prompt),优化了模型结构和推理流程,并在多处进行了代码结构和功能的调整,以支持更复杂的模型训练和推理过程,特别是加入了对模型性能的监控和不同模块的融合。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18311489
2024-09-10 16:34:12 +08:00
neo.dzh
069665ef83
feat: Resolve conflict, auto committed by CodeFlow
2024-09-10 16:28:02 +08:00
志浩
b0f4cdc8e6
add time counter for FM
2024-09-10 15:49:30 +08:00
志浩
15c13be8f2
batchified cfg inference for FM
2024-09-10 15:42:11 +08:00
志浩
a24ef81aad
add offline inference for e2e tts model in GPT-4o
2024-09-10 15:27:33 +08:00
木守
b15b4ca20f
update
2024-09-10 12:12:29 +08:00
木守
03446f19ec
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-10 12:05:43 +08:00
木守
f0280afefc
update
2024-09-10 12:03:14 +08:00
志浩
bb21822b7f
add multi-lingual tiktoken for 25Hz models
2024-09-10 11:34:10 +08:00
木守
6e22735f65
chore: update model paths in websocket server
2024-09-09 20:36:39 +08:00
木守
bd4309ea3c
refactor: adjust parameters for websocket server
2024-09-09 20:31:08 +08:00
zhifu.gzf
0941f8edad
Merge branch dev_gzf_llm2 into dev_gzf_deepspeed
...
Title: wss
本次代码评审主要加入了时间戳打印,用于追踪WebSocket服务器中语音处理的关键步骤时间点,并新增了一个多轮对话处理的WebSocket服务器实现,包括语音合成和模型推理过程,提高了代码的可追溯性和诊断能力。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18228062
2024-09-04 15:25:11 +08:00
游雁
b10a1ab523
wss
2024-09-04 12:53:40 +08:00
木守
245ba8fd45
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
2024-09-03 19:45:35 +08:00
木守
281ebc3202
streaming
2024-09-03 19:45:29 +08:00
游雁
ef02bf322e
wss
2024-09-03 14:42:27 +08:00