志浩
|
15c13be8f2
|
batchified cfg inference for FM
|
2024-09-10 15:42:11 +08:00 |
|
志浩
|
a24ef81aad
|
add offline inference for e2e tts model in GPT-4o
|
2024-09-10 15:27:33 +08:00 |
|
木守
|
b15b4ca20f
|
update
|
2024-09-10 12:12:29 +08:00 |
|
木守
|
03446f19ec
|
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
|
2024-09-10 12:05:43 +08:00 |
|
木守
|
f0280afefc
|
update
|
2024-09-10 12:03:14 +08:00 |
|
志浩
|
bb21822b7f
|
add multi-lingual tiktoken for 25Hz models
|
2024-09-10 11:34:10 +08:00 |
|
木守
|
6e22735f65
|
chore: update model paths in websocket server
|
2024-09-09 20:36:39 +08:00 |
|
木守
|
bd4309ea3c
|
refactor: adjust parameters for websocket server
|
2024-09-09 20:31:08 +08:00 |
|
zhifu.gzf
|
0941f8edad
|
Merge branch dev_gzf_llm2 into dev_gzf_deepspeed
Title: wss
本次代码评审主要加入了时间戳打印,用于追踪WebSocket服务器中语音处理的关键步骤时间点,并新增了一个多轮对话处理的WebSocket服务器实现,包括语音合成和模型推理过程,提高了代码的可追溯性和诊断能力。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18228062
|
2024-09-04 15:25:11 +08:00 |
|
游雁
|
b10a1ab523
|
wss
|
2024-09-04 12:53:40 +08:00 |
|
木守
|
245ba8fd45
|
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
|
2024-09-03 19:45:35 +08:00 |
|
木守
|
281ebc3202
|
streaming
|
2024-09-03 19:45:29 +08:00 |
|
游雁
|
ef02bf322e
|
wss
|
2024-09-03 14:42:27 +08:00 |
|
木守
|
467056fc2f
|
streaming
|
2024-09-03 14:08:05 +08:00 |
|
yangyexin.yyx
|
6aa4f30555
|
Merge branch llm_dev_gzf into dev_gzf_deepspeed
Title: lora
本次代码评审主要涉及对一个基于WebSocket的语音识别与处理系统的更新,包括添加日志打印、修改默认参数、增加对LoRA模型的支持、调整错误处理逻辑、优化音频处理和文本显示逻辑,以及代码结构和注释的若干改进,旨在提升系统稳定性和用户体验。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18202582
|
2024-09-03 13:55:59 +08:00 |
|
zhifu.gzf
|
963472437c
|
feat: Resolve conflict, auto committed by CodeFlow
|
2024-09-03 13:54:42 +08:00 |
|
游雁
|
a9668ad075
|
lora
|
2024-09-03 12:51:16 +08:00 |
|
游雁
|
9f8107ff9e
|
lora
|
2024-09-03 11:48:49 +08:00 |
|
游雁
|
9de6a04db9
|
lora
|
2024-09-03 11:34:07 +08:00 |
|
游雁
|
5a2d8a38ac
|
lora
|
2024-09-03 11:26:25 +08:00 |
|
游雁
|
29717f4361
|
lora
|
2024-09-03 11:23:34 +08:00 |
|
游雁
|
8fb3ce8796
|
ws
|
2024-09-02 19:15:59 +08:00 |
|
木守
|
57729f40a6
|
streaming
|
2024-09-02 19:03:48 +08:00 |
|
木守
|
013f02e3a3
|
streaming
|
2024-09-02 15:49:30 +08:00 |
|
木守
|
561de8db92
|
streaming
|
2024-09-02 15:02:31 +08:00 |
|
木守
|
b60b9b6454
|
streaming
|
2024-09-02 14:27:09 +08:00 |
|
木守
|
b7f0e6894f
|
streaming
|
2024-09-02 14:23:45 +08:00 |
|
zhifu.gzf
|
623fd16f34
|
Merge branch dev_lr_deepspeed into dev_gzf_deepspeed
Title: Add llm tts to client process.
本次代码评审主要增强了 WebSocket 服务端和客户端的语音合成(TTS)功能,添加了语音数据发送、接收处理和计数逻辑,优化了模块结构,引入了`NlsTtsSynthesizer`类来管理语音合成流程,并调整了错误处理和连接管理,使得语音传输更稳定且可追踪。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18178626
|
2024-09-02 10:22:55 +08:00 |
|
木守
|
32657bf651
|
streaming asr/s2tt
|
2024-08-29 18:53:24 +08:00 |
|
木守
|
47492abac5
|
streaming asr/s2tt
|
2024-08-29 16:05:08 +08:00 |
|
jichi.lr
|
e99201c9b0
|
Add llm tts to client process.
|
2024-08-29 11:42:20 +08:00 |
|
yangyexin.yyx
|
038f752e58
|
streaming
|
2024-08-28 14:29:04 +08:00 |
|
yangyexin.yyx
|
71b6ecbb39
|
streaming
|
2024-08-28 14:20:14 +08:00 |
|
yangyexin.yyx
|
1eb7507c24
|
Merge branch 'dev_gzf_deepspeed' of http://gitlab.alibaba-inc.com/zhifu.gzf/FunASR into dev_gzf_deepspeed
|
2024-08-28 09:48:41 +08:00 |
|
yangyexin.yyx
|
366603d4ed
|
streaming asr
|
2024-08-28 09:48:06 +08:00 |
|
游雁
|
7674885f5e
|
age gender
|
2024-08-27 16:35:28 +08:00 |
|
游雁
|
032d429a94
|
wss llm
|
2024-08-26 17:52:30 +08:00 |
|
游雁
|
260d037d55
|
wss llm
|
2024-08-26 15:59:35 +08:00 |
|
游雁
|
f2af56b678
|
wss llm
|
2024-08-26 15:00:41 +08:00 |
|
游雁
|
e97ae7f3f8
|
add
|
2024-08-23 15:45:57 +08:00 |
|
游雁
|
f80d5c6398
|
add
|
2024-08-23 15:32:05 +08:00 |
|
游雁
|
70bdbabcb2
|
docs
|
2024-08-22 11:32:22 +08:00 |
|
dcaaaa
|
2d29a079ee
|
deal conflict with datasets
|
2024-08-21 18:31:47 +08:00 |
|
dcaaaa
|
5e6fd09a49
|
add llm semantic vad model code
|
2024-08-21 18:00:08 +08:00 |
|
游雁
|
ed9fd49d46
|
kv
|
2024-08-20 13:56:52 +08:00 |
|
游雁
|
c75040f1be
|
Merge branch 'dev_gzf_deepspeed' of github.com:alibaba-damo-academy/FunASR into dev_gzf_deepspeed
merge
|
2024-08-19 17:28:46 +08:00 |
|
zhifu gao
|
ce0767020c
|
Llm (#2025)
* text2text speech2text
* text2text speech2text
|
2024-08-19 17:24:57 +08:00 |
|
游雁
|
598e22a0a3
|
kv cache
|
2024-08-19 16:01:24 +08:00 |
|
游雁
|
f33138ab2e
|
kv cache
|
2024-08-19 14:57:25 +08:00 |
|
游雁
|
3e0bd69f83
|
kv cache
|
2024-08-19 11:54:49 +08:00 |
|