维石
|
8a03879937
|
update sensevoice with pitch
|
2024-09-29 17:37:55 +08:00 |
|
游雁
|
4e57ba7b92
|
v3
|
2024-09-26 13:47:00 +08:00 |
|
游雁
|
0076212ac5
|
v3
|
2024-09-26 13:42:03 +08:00 |
|
游雁
|
55f06fb4c9
|
v3
|
2024-09-26 11:57:32 +08:00 |
|
游雁
|
aaa0325322
|
v3
|
2024-09-26 11:52:05 +08:00 |
|
游雁
|
7ce917b596
|
extract
|
2024-09-25 20:24:14 +08:00 |
|
游雁
|
20280f5db5
|
extract
|
2024-09-25 20:21:45 +08:00 |
|
游雁
|
925227e9f1
|
extract
|
2024-09-25 20:19:39 +08:00 |
|
游雁
|
610a28085a
|
token extract
|
2024-09-25 11:48:28 +08:00 |
|
游雁
|
05eb900af2
|
token extract
|
2024-09-25 11:46:12 +08:00 |
|
游雁
|
6d2434f257
|
token extract
|
2024-09-25 11:21:13 +08:00 |
|
游雁
|
09bb6d8d03
|
token extract
|
2024-09-25 10:52:49 +08:00 |
|
志浩
|
851474632d
|
add extract_token binary
|
2024-09-24 23:26:34 +08:00 |
|
志浩
|
2892a70cd8
|
add extract_token binary
|
2024-09-24 23:10:49 +08:00 |
|
志浩
|
342d781f0a
|
add extract_token binary
|
2024-09-24 22:56:50 +08:00 |
|
志浩
|
7c3f3e4931
|
add extract_token binary
|
2024-09-24 22:50:06 +08:00 |
|
志浩
|
45d82ba6fd
|
add extract_token binary
|
2024-09-24 22:44:16 +08:00 |
|
志浩
|
b372ab6d74
|
add extract_token binary
|
2024-09-24 22:27:25 +08:00 |
|
志浩
|
49903ec044
|
add support mixture of kaldi_ark or sound
|
2024-09-24 20:17:06 +08:00 |
|
志浩
|
43af70b129
|
add support mixture of kaldi_ark or sound
|
2024-09-24 20:09:30 +08:00 |
|
志浩
|
ef817d0a7d
|
add support mixture of kaldi_ark or sound
|
2024-09-24 19:59:07 +08:00 |
|
志浩
|
0a65aaf266
|
add support mixture of kaldi_ark or sound
|
2024-09-24 19:36:18 +08:00 |
|
志浩
|
d104000f82
|
add support mixture of kaldi_ark or sound
|
2024-09-24 18:59:27 +08:00 |
|
志浩
|
4b840fd668
|
add batch support for token extraction
|
2024-09-24 17:59:02 +08:00 |
|
志浩
|
752abbb3ca
|
add batch support for token extraction
|
2024-09-24 17:52:30 +08:00 |
|
志浩
|
c37e04ea49
|
add batch support for token extraction
|
2024-09-24 17:45:55 +08:00 |
|
志浩
|
4f96a06d13
|
add batch support for token extraction
|
2024-09-24 17:33:41 +08:00 |
|
志浩
|
bc0608d380
|
add extract token run_mode
|
2024-09-24 17:20:33 +08:00 |
|
志浩
|
ce5b79d234
|
add extract token run_mode
|
2024-09-24 17:15:30 +08:00 |
|
志浩
|
1fb762d9be
|
add extract token run_mode
|
2024-09-24 17:06:52 +08:00 |
|
志浩
|
0fa8e2976f
|
add extract token run_mode
|
2024-09-24 17:03:17 +08:00 |
|
志浩
|
6fc9354924
|
add text ibest writer for SenseVoiceL
|
2024-09-24 16:32:01 +08:00 |
|
志浩
|
797bd57f91
|
add text ibest writer for SenseVoiceL
|
2024-09-24 15:54:42 +08:00 |
|
游雁
|
d45230c6ba
|
batch
|
2024-09-21 11:46:52 +08:00 |
|
游雁
|
204fcd7900
|
batch
|
2024-09-21 11:29:53 +08:00 |
|
志浩
|
829b1f8c72
|
redo:fix mp3 bug
|
2024-09-18 23:04:25 +08:00 |
|
shixian.shi
|
9edbcd5420
|
Merge branch dzh_straming_test into dev_gzf_deepspeed
Title: align others with dev_gzf_deepspeed
本次代码评审主要涉及添加CTC模块、优化MP3编码处理、简化音素令牌处理和引入新库,旨在增强模型的语音识别与文本转语音功能,同时改进音频处理逻辑。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18421381
|
2024-09-18 22:26:21 +08:00 |
|
zhifu.gzf
|
60bc4176c2
|
feat: Resolve conflict, auto committed by CodeFlow
|
2024-09-18 22:23:45 +08:00 |
|
志浩
|
79c188be33
|
align others with dev_gzf_deepspeed
|
2024-09-18 18:15:32 +08:00 |
|
志浩
|
7edf6a30d9
|
align others with dev_gzf_deepspeed
|
2024-09-18 18:10:07 +08:00 |
|
志浩
|
669fedef82
|
align others with dev_gzf_deepspeed
|
2024-09-18 18:04:35 +08:00 |
|
志浩
|
f0da9da31d
|
add lameenc mp3 encoder for 4o
|
2024-09-18 17:42:07 +08:00 |
|
志浩
|
f32b1c1bdc
|
add cross fade for 4o
|
2024-09-18 16:51:55 +08:00 |
|
志浩
|
e8ad966fa7
|
add cross fade for 4o
|
2024-09-18 16:47:12 +08:00 |
|
志浩
|
c1a4abe273
|
add cross fade for 4o
|
2024-09-18 16:44:24 +08:00 |
|
志浩
|
8df772de10
|
add cross fade for 4o
|
2024-09-18 16:40:35 +08:00 |
|
志浩
|
f7fa394bb1
|
add cross fade for 4o
|
2024-09-18 16:15:34 +08:00 |
|
志浩
|
cf6d7249d7
|
add cross fade for 4o
|
2024-09-18 16:09:43 +08:00 |
|
志浩
|
6e4362ac1f
|
add cross fade for 4o
|
2024-09-18 15:47:50 +08:00 |
|
志浩
|
f3d43be322
|
add cross fade for 4o
|
2024-09-18 15:29:51 +08:00 |
|