Commit Graph

95 Commits

Author SHA1 Message Date
维石
e7351db81b update export 2024-05-28 19:07:22 +08:00
zhifu gao
4adb76a6ed
Dev gzf exp (#1707)
* resume from step

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* train_loss_avg train_acc_avg

* train_loss_avg train_acc_avg

* train_loss_avg train_acc_avg

* log step

* wav is not exist

* wav is not exist

* decoding

* decoding

* decoding

* wechat

* decoding key

* decoding key

* decoding key

* decoding key

* decoding key

* decoding key

* dynamic batch
2024-05-08 19:21:58 +08:00
zhifu gao
b1c186fd00
Dev gzf exp (#1700)
* resume from step

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* batch

* train_loss_avg train_acc_avg

* train_loss_avg train_acc_avg

* train_loss_avg train_acc_avg

* log step

* wav is not exist

* wav is not exist

* decoding

* decoding

* decoding

* wechat

* decoding key

* decoding key

* decoding key

* decoding key

* decoding key
2024-05-08 00:31:29 +08:00
jianganghan
f8b4924060
fix bug for blank audio (#1656) 2024-04-25 10:42:19 +08:00
zhifu gao
861147c730
Dev gzf exp (#1654)
* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* bugfix

* update with main (#1631)

* update seaco finetune

* v1.0.24

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>

* sensevoice

* sensevoice

* sensevoice

* update with main (#1638)

* update seaco finetune

* v1.0.24

* update rwkv template

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* whisper

* whisper

* update style

* update style

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>
2024-04-24 16:03:38 +08:00
维石
dee1354d0d empty result bug fix 2024-04-19 14:57:31 +08:00
zhifu gao
eaf9dda9e4
Dev gzf exp (#1624)
* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune
2024-04-17 15:05:37 +08:00
游雁
da340e6a6c add 2024-04-12 15:01:54 +08:00
gaochangfeng
3260fb879b
Dev gcf (#1611)
* 添加默认对Speech和BGM的输出格式约束

* 推理时可以合并vad的切分

* fix

---------

Co-authored-by: 常材 <gaochangfeng.gcf@alibaba-inc.com>
2024-04-12 11:37:22 +08:00
zhifu gao
6fa8ee48e1
Dev gzf new (#1567)
* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* train

* whisper_lib for sense voice

* aishell recipe
2024-03-30 10:13:42 +08:00
游雁
e84f17adca update 2024-03-26 12:34:26 +08:00
游雁
447222c00e install requirements automatically 2024-03-25 12:37:35 +08:00
游雁
8c1016ca77 install requirements automatically 2024-03-25 11:48:17 +08:00
游雁
d29f201e32 vad conf 2024-03-19 12:04:50 +08:00
游雁
60a7d39f39 vad conf 2024-03-19 11:56:57 +08:00
游雁
5a637c6995 vad conf 2024-03-19 11:45:09 +08:00
游雁
fae73ee414 vad conf 2024-03-19 11:14:59 +08:00
游雁
bab0675c36 ffmpeg 2024-03-18 15:22:14 +08:00
zhifu gao
5023dd0422
Dev gzf llm (#1503)
* update

* update

* update

* update onnx

* update with main (#1492)

* contextual&seaco ONNX export (#1481)

* contextual&seaco ONNX export

* update ContextualEmbedderExport2

* update ContextualEmbedderExport2

* update code

* onnx (#1482)

* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx

* onnx

* onnx

* onnx

* onnx

* v1.0.15

* qwenaudio

* qwenaudio

* issue doc

* update

* update

* bugfix

* onnx

* update export calling

* update codes

* remove useless code

* update code

---------

Co-authored-by: zhifu gao <zhifu.gzf@alibaba-inc.com>

* acknowledge

---------

Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>

* update onnx

* update onnx

* train update

* train update

* train update

* train update

---------

Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
2024-03-15 16:24:29 +08:00
zhifu gao
35b1c051f6
Dev gzf llm (#1493)
* update

* update

* update

* update onnx

* update with main (#1492)

* contextual&seaco ONNX export (#1481)

* contextual&seaco ONNX export

* update ContextualEmbedderExport2

* update ContextualEmbedderExport2

* update code

* onnx (#1482)

* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx

* onnx

* onnx

* onnx

* onnx

* v1.0.15

* qwenaudio

* qwenaudio

* issue doc

* update

* update

* bugfix

* onnx

* update export calling

* update codes

* remove useless code

* update code

---------

Co-authored-by: zhifu gao <zhifu.gzf@alibaba-inc.com>

* acknowledge

---------

Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>

* update onnx

* update onnx

---------

Co-authored-by: Shi Xian <40013335+R1ckShi@users.noreply.github.com>
2024-03-14 09:33:30 +08:00
zhifu gao
c3192dffdd
Dev gzf (#1480)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx

* onnx

* onnx

* onnx

* onnx

* v1.0.15

* qwenaudio

* qwenaudio

* issue doc

* update

* update

* bugfix
2024-03-12 17:27:02 +08:00
zhifu gao
15c4709beb
onnx (#1473)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx

* onnx

* onnx
2024-03-11 22:04:03 +08:00
zhifu gao
cc59310dbf
Dev gzf (#1469)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx

* onnx
2024-03-11 19:37:50 +08:00
zhifu gao
e847f85a14
Dev gzf (#1468)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx

* onnx
2024-03-11 19:32:07 +08:00
zhifu gao
a7d7a0f3a2
Dev gzf (#1467)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx

* export onnx

* dingding

* dingding

* llm

* doc

* onnx
2024-03-11 19:24:44 +08:00
zhifu gao
9d48230c4f
export onnx (#1457)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx

* export onnx
2024-03-11 10:48:50 +08:00
zhifu gao
f2d8ded57f
export onnx (#1455)
* qwenaudio qwenaudiochat

* qwenaudio qwenaudiochat

* whisper

* whisper

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* llm

* export onnx
2024-03-11 01:24:43 +08:00
zhifu gao
753d579531
Dev gzf (#1428)
* bugfix v1.0.13

* qwenaudio qwenaudiochat

* v1.0.14
2024-03-05 17:58:35 +08:00
zhifu gao
790bf54944
Dev gzf (#1422)
* fixbug

* qwenaudio

* qwenaudio whisper-openai v1.0.12
2024-03-04 20:35:06 +08:00
zhifu gao
cae52c52f3
Revert "Revert "Dev yf" (#1418)" (#1420)
This reverts commit d2c1204d91.
2024-03-04 18:43:26 +08:00
zhifu gao
d2c1204d91
Revert "Dev yf" (#1418) 2024-03-04 17:50:29 +08:00
语帆
920331972a commit 2024-03-04 17:47:25 +08:00
游雁
cc41a9ee88 whisper 2024-03-01 14:58:36 +08:00
语帆
6d7b945710 atsr 2024-03-01 11:10:44 +08:00
语帆
21b49bd56f atsr 2024-03-01 10:36:27 +08:00
语帆
ec98a8e138 atsr 2024-03-01 10:26:23 +08:00
语帆
8f63be3af7 atsr 2024-03-01 10:16:03 +08:00
jianganghan
f272eb4ef7
Fix two bugs for blank voice (empty speech): (#1403)
First, raw text would always be present whatever param return_raw_text specified.
Second, empty text from punc model would cause error during sentence assembly if sentence_timestamp specified true.
2024-02-29 09:35:34 +08:00
zhifu gao
b9cfd9953a
Dev gzf (#1402)
* init param
2024-02-28 20:44:21 +08:00
语帆
e59ec16e6a test 2024-02-28 16:56:58 +08:00
语帆
a88b51c544 test 2024-02-28 16:04:35 +08:00
语帆
ecd9e74b6e test 2024-02-28 16:00:44 +08:00
语帆
52fee96d71 test 2024-02-28 15:31:14 +08:00
语帆
debafeac37 test 2024-02-28 15:23:07 +08:00
Shi Xian
b52fc0ec28
Merge pull request #1401 from alibaba-damo-academy/dev_gzf
init param
2024-02-28 14:39:54 +08:00
游雁
7a4816651f init param 2024-02-28 14:38:05 +08:00
Shi Xian
ba589e05c1
Merge pull request #1393 from alibaba-damo-academy/dev_gzf
Dev gzf
2024-02-27 10:43:27 +08:00
shixian.shi
9844be44e9 bug fix for empty text 2024-02-27 10:36:40 +08:00
Yuming Zhang
7759ab5feb
fix bug: 模型初始化可传入参数disable_pbar=True (#1387)
Co-authored-by: 张玉明 <zhangyuming@wepie.com>
2024-02-23 18:28:26 +08:00
语帆
d60306e7a4 test 2024-02-23 16:47:15 +08:00