Commit Graph

4895 Commits

Author SHA1 Message Date
维石
8a03879937 update sensevoice with pitch 2024-09-29 17:37:55 +08:00
游雁
4e57ba7b92 v3 2024-09-26 13:47:00 +08:00
游雁
0076212ac5 v3 2024-09-26 13:42:03 +08:00
游雁
55f06fb4c9 v3 2024-09-26 11:57:32 +08:00
游雁
aaa0325322 v3 2024-09-26 11:52:05 +08:00
游雁
7ce917b596 extract 2024-09-25 20:24:14 +08:00
游雁
20280f5db5 extract 2024-09-25 20:21:45 +08:00
游雁
925227e9f1 extract 2024-09-25 20:19:39 +08:00
游雁
610a28085a token extract 2024-09-25 11:48:28 +08:00
游雁
05eb900af2 token extract 2024-09-25 11:46:12 +08:00
游雁
6d2434f257 token extract 2024-09-25 11:21:13 +08:00
游雁
09bb6d8d03 token extract 2024-09-25 10:52:49 +08:00
志浩
851474632d add extract_token binary 2024-09-24 23:26:34 +08:00
志浩
2892a70cd8 add extract_token binary 2024-09-24 23:10:49 +08:00
志浩
342d781f0a add extract_token binary 2024-09-24 22:56:50 +08:00
志浩
7c3f3e4931 add extract_token binary 2024-09-24 22:50:06 +08:00
志浩
45d82ba6fd add extract_token binary 2024-09-24 22:44:16 +08:00
志浩
b372ab6d74 add extract_token binary 2024-09-24 22:27:25 +08:00
志浩
49903ec044 add support mixture of kaldi_ark or sound 2024-09-24 20:17:06 +08:00
志浩
43af70b129 add support mixture of kaldi_ark or sound 2024-09-24 20:09:30 +08:00
志浩
ef817d0a7d add support mixture of kaldi_ark or sound 2024-09-24 19:59:07 +08:00
志浩
0a65aaf266 add support mixture of kaldi_ark or sound 2024-09-24 19:36:18 +08:00
志浩
d104000f82 add support mixture of kaldi_ark or sound 2024-09-24 18:59:27 +08:00
志浩
4b840fd668 add batch support for token extraction 2024-09-24 17:59:02 +08:00
志浩
752abbb3ca add batch support for token extraction 2024-09-24 17:52:30 +08:00
志浩
c37e04ea49 add batch support for token extraction 2024-09-24 17:45:55 +08:00
志浩
4f96a06d13 add batch support for token extraction 2024-09-24 17:33:41 +08:00
志浩
bc0608d380 add extract token run_mode 2024-09-24 17:20:33 +08:00
志浩
ce5b79d234 add extract token run_mode 2024-09-24 17:15:30 +08:00
志浩
1fb762d9be add extract token run_mode 2024-09-24 17:06:52 +08:00
志浩
0fa8e2976f add extract token run_mode 2024-09-24 17:03:17 +08:00
志浩
6fc9354924 add text ibest writer for SenseVoiceL 2024-09-24 16:32:01 +08:00
志浩
797bd57f91 add text ibest writer for SenseVoiceL 2024-09-24 15:54:42 +08:00
游雁
d45230c6ba batch 2024-09-21 11:46:52 +08:00
游雁
204fcd7900 batch 2024-09-21 11:29:53 +08:00
志浩
829b1f8c72 redo:fix mp3 bug 2024-09-18 23:04:25 +08:00
shixian.shi
9edbcd5420 Merge branch dzh_straming_test into dev_gzf_deepspeed
Title: align others with dev_gzf_deepspeed 

本次代码评审主要涉及添加CTC模块、优化MP3编码处理、简化音素令牌处理和引入新库,旨在增强模型的语音识别与文本转语音功能,同时改进音频处理逻辑。
Link: https://code.alibaba-inc.com/zhifu.gzf/FunASR/codereview/18421381
2024-09-18 22:26:21 +08:00
zhifu.gzf
60bc4176c2 feat: Resolve conflict, auto committed by CodeFlow 2024-09-18 22:23:45 +08:00
志浩
79c188be33 align others with dev_gzf_deepspeed 2024-09-18 18:15:32 +08:00
志浩
7edf6a30d9 align others with dev_gzf_deepspeed 2024-09-18 18:10:07 +08:00
志浩
669fedef82 align others with dev_gzf_deepspeed 2024-09-18 18:04:35 +08:00
志浩
f0da9da31d add lameenc mp3 encoder for 4o 2024-09-18 17:42:07 +08:00
志浩
f32b1c1bdc add cross fade for 4o 2024-09-18 16:51:55 +08:00
志浩
e8ad966fa7 add cross fade for 4o 2024-09-18 16:47:12 +08:00
志浩
c1a4abe273 add cross fade for 4o 2024-09-18 16:44:24 +08:00
志浩
8df772de10 add cross fade for 4o 2024-09-18 16:40:35 +08:00
志浩
f7fa394bb1 add cross fade for 4o 2024-09-18 16:15:34 +08:00
志浩
cf6d7249d7 add cross fade for 4o 2024-09-18 16:09:43 +08:00
志浩
6e4362ac1f add cross fade for 4o 2024-09-18 15:47:50 +08:00
志浩
f3d43be322 add cross fade for 4o 2024-09-18 15:29:51 +08:00