Update Dataset.md

This commit is contained in:
yhliang 2023-09-27 10:17:26 +08:00 committed by GitHub
parent 9366bd9bcf
commit 3b8cff497f
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -21,4 +21,5 @@ All transcriptions of the speech data are prepared in TextGrid format for each s
The three dataset for training mentioned above can be downloaded at [OpenSLR](https://openslr.org/resources.php). The participants can download via the following links. Particularly, in the baseline we provide convenient data preparation scripts for AliMeeting corpus. The three dataset for training mentioned above can be downloaded at [OpenSLR](https://openslr.org/resources.php). The participants can download via the following links. Particularly, in the baseline we provide convenient data preparation scripts for AliMeeting corpus.
- [AliMeeting](https://openslr.org/119/) - [AliMeeting](https://openslr.org/119/)
- [AISHELL-4](https://openslr.org/111/) - [AISHELL-4](https://openslr.org/111/)
- [CN-Celeb](https://openslr.org/82/) - [CN-Celeb](https://openslr.org/82/)
Now, the new test set is available [here](https://speech-lab-share-data.oss-cn-shanghai.aliyuncs.com/AliMeeting/openlr/Test_2023_Ali.tar.gz)