update m2met docs

This commit is contained in:
yhliang 2023-05-10 16:54:45 +08:00
parent f77c5803f4
commit 1a6334f9dc
47 changed files with 255 additions and 135 deletions

View File

@ -22,7 +22,7 @@
## What's new:
### Multi-Channel Multi-Party Meeting Transcription 2.0 (M2MET2.0) Challenge
We are pleased to announce that the M2MeT2.0 challenge will be held in the near future. The baseline system is conducted on FunASR and is provided as a receipe of AliMeeting corpus. For more details you can see the guidence of M2MET2.0 ([CN](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/index.html)/[EN](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)).
We are pleased to announce that the M2MeT2.0 challenge has been accepted by the ASRU2023 challenge special session. The registration is now open. The baseline system is conducted on FunASR and is provided as a receipe of AliMeeting corpus. For more details you can see the guidence of M2MET2.0 ([CN](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/index.html)/[EN](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)).
### Release notes
For the release notes, please ref to [news](https://github.com/alibaba-damo-academy/FunASR/releases)

View File

@ -1,11 +1,31 @@
# Baseline
## Overview
We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.
We will release an E2E SA-ASR baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.
![model archietecture](images/sa_asr_arch.png)
## Quick start
#TODO: fill with the README.md of the baseline
To run the baseline, first you need to install FunASR and ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html))
There are two startup scripts, `run.sh` for training and evaluating on the old eval and test sets, and `run_m2met_2023_infer.sh` for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 ([M2MET2.0](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)) Challenge.
Before running `run.sh`, you must manually download and unpack the [AliMeeting](http://www.openslr.org/119/) corpus and place it in the `./dataset` directory:
```shell
dataset
|—— Eval_Ali_far
|—— Eval_Ali_near
|—— Test_Ali_far
|—— Test_Ali_near
|—— Train_Ali_far
|—— Train_Ali_near
Before running `run_m2met_2023_infer.sh`, you need to place the new test set `Test_2023_Ali_far` (to be released after the challenge starts) in the `./dataset` directory, which contains only raw audios. Then put the given `wav.scp`, `wav_raw.scp`, `segments`, `utt2spk` and `spk2utt` in the `./data/Test_2023_Ali_far` directory.
```shell
data/Test_2023_Ali_far
|—— wav.scp
|—— wav_raw.scp
|—— segments
|—— utt2spk
|—— spk2utt
```
For more details you can see [here](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md)
## Baseline results
The results of the baseline system are shown in Table 3. The speaker profile adopts the oracle speaker embedding during training. However, due to the lack of oracle speaker label during evaluation, the speaker profile provided by an additional spectral clustering is used. Meanwhile, the results of using the oracle speaker profile on Eval and Test Set are also provided to show the impact of speaker profile accuracy.

View File

@ -13,15 +13,15 @@ Building on the success of the previous M2MeT challenge, we are excited to propo
- $ May~8, 2023: $ Baseline release.
- $ May~15, 2023: $ Registration deadline, the due date for participants to join the Challenge.
- $ June~9, 2023: $ Test data release and leaderboard open.
- $ June~13, 2023: $ Final submission deadline.
- $ June~13, 2023: $ Final submission deadline and leaderboar close.
- $ June~19, 2023: $ Evaluation result and ranking release.
- $ July~3, 2023: $ Deadline for paper submission.
- $ July~10, 2023: $ Deadline for final paper submission.
- $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge session
- $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge Session
## Guidelines
Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023.
Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the [wechat group](https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html) of M2MET2.0 and keep up to date with the latest updates about the challenge.
[M2MET2.0 Registration](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link)

View File

@ -1,4 +1,4 @@
# Sphinx build info version 1
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
config: 9907eab6bf227ca0fc6db297f26919da
config: a62852d90c3e533904d811bbf85f977d
tags: 645f666f9bcd5a90fca523b33c5a78b7

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Baseline &#8212; m2met2 documentation</title>
<title>Baseline &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -44,7 +44,7 @@
<li class="right" >
<a href="Track_setting_and_evaluation.html" title="Track &amp; Evaluation"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Baseline</a></li>
</ul>
</div>
@ -55,7 +55,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -126,12 +126,32 @@
<h1>Baseline<a class="headerlink" href="#baseline" title="Permalink to this heading"></a></h1>
<section id="overview">
<h2>Overview<a class="headerlink" href="#overview" title="Permalink to this heading"></a></h2>
<p>We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR">FunASR</a> at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.</p>
<p>We will release an E2E SA-ASR baseline conducted on <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR">FunASR</a> at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.</p>
<p><img alt="model archietecture" src="_images/sa_asr_arch.png" /></p>
</section>
<section id="quick-start">
<h2>Quick start<a class="headerlink" href="#quick-start" title="Permalink to this heading"></a></h2>
<p>#TODO: fill with the README.md of the baseline</p>
<p>To run the baseline, first you need to install FunASR and ModelScope. (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/en/installation.html">installation</a>)<br />
There are two startup scripts, <code class="docutils literal notranslate"><span class="pre">run.sh</span></code> for training and evaluating on the old eval and test sets, and <code class="docutils literal notranslate"><span class="pre">run_m2met_2023_infer.sh</span></code> for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html">M2MET2.0</a>) Challenge.<br />
Before running <code class="docutils literal notranslate"><span class="pre">run.sh</span></code>, you must manually download and unpack the <a class="reference external" href="http://www.openslr.org/119/">AliMeeting</a> corpus and place it in the <code class="docutils literal notranslate"><span class="pre">./dataset</span></code> directory:</p>
<div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>dataset
<span class="p">|</span>——<span class="w"> </span>Eval_Ali_far
<span class="p">|</span>——<span class="w"> </span>Eval_Ali_near
<span class="p">|</span>——<span class="w"> </span>Test_Ali_far
<span class="p">|</span>——<span class="w"> </span>Test_Ali_near
<span class="p">|</span>——<span class="w"> </span>Train_Ali_far
<span class="p">|</span>——<span class="w"> </span>Train_Ali_near
Before<span class="w"> </span>running<span class="w"> </span><span class="sb">`</span>run_m2met_2023_infer.sh<span class="sb">`</span>,<span class="w"> </span>you<span class="w"> </span>need<span class="w"> </span>to<span class="w"> </span>place<span class="w"> </span>the<span class="w"> </span>new<span class="w"> </span><span class="nb">test</span><span class="w"> </span><span class="nb">set</span><span class="w"> </span><span class="sb">`</span>Test_2023_Ali_far<span class="sb">`</span><span class="w"> </span><span class="o">(</span>to<span class="w"> </span>be<span class="w"> </span>released<span class="w"> </span>after<span class="w"> </span>the<span class="w"> </span>challenge<span class="w"> </span>starts<span class="o">)</span><span class="w"> </span><span class="k">in</span><span class="w"> </span>the<span class="w"> </span><span class="sb">`</span>./dataset<span class="sb">`</span><span class="w"> </span>directory,<span class="w"> </span>which<span class="w"> </span>contains<span class="w"> </span>only<span class="w"> </span>raw<span class="w"> </span>audios.<span class="w"> </span>Then<span class="w"> </span>put<span class="w"> </span>the<span class="w"> </span>given<span class="w"> </span><span class="sb">`</span>wav.scp<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>wav_raw.scp<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>segments<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>utt2spk<span class="sb">`</span><span class="w"> </span>and<span class="w"> </span><span class="sb">`</span>spk2utt<span class="sb">`</span><span class="w"> </span><span class="k">in</span><span class="w"> </span>the<span class="w"> </span><span class="sb">`</span>./data/Test_2023_Ali_far<span class="sb">`</span><span class="w"> </span>directory.<span class="w"> </span>
<span class="sb">```</span>shell
data/Test_2023_Ali_far
<span class="p">|</span>——<span class="w"> </span>wav.scp
<span class="p">|</span>——<span class="w"> </span>wav_raw.scp
<span class="p">|</span>——<span class="w"> </span>segments
<span class="p">|</span>——<span class="w"> </span>utt2spk
<span class="p">|</span>——<span class="w"> </span>spk2utt
</pre></div>
</div>
<p>For more details you can see <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md">here</a></p>
</section>
<section id="baseline-results">
<h2>Baseline results<a class="headerlink" href="#baseline-results" title="Permalink to this heading"></a></h2>
@ -170,7 +190,7 @@
<li class="right" >
<a href="Track_setting_and_evaluation.html" title="Track &amp; Evaluation"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Baseline</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Contact &#8212; m2met2 documentation</title>
<title>Contact &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -40,7 +40,7 @@
<li class="right" >
<a href="Organizers.html" title="Organizers"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Contact</a></li>
</ul>
</div>
@ -51,7 +51,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -160,7 +160,7 @@
<li class="right" >
<a href="Organizers.html" title="Organizers"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Contact</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Datasets &#8212; m2met2 documentation</title>
<title>Datasets &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="Introduction.html" title="Introduction"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Datasets</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -181,7 +181,7 @@
<li class="right" >
<a href="Introduction.html" title="Introduction"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Datasets</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Introduction &#8212; m2met2 documentation</title>
<title>Introduction &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="index.html" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Introduction</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -139,16 +139,16 @@
<li><p><span class="math notranslate nohighlight">\( May~8, 2023: \)</span> Baseline release.</p></li>
<li><p><span class="math notranslate nohighlight">\( May~15, 2023: \)</span> Registration deadline, the due date for participants to join the Challenge.</p></li>
<li><p><span class="math notranslate nohighlight">\( June~9, 2023: \)</span> Test data release and leaderboard open.</p></li>
<li><p><span class="math notranslate nohighlight">\( June~13, 2023: \)</span> Final submission deadline.</p></li>
<li><p><span class="math notranslate nohighlight">\( June~13, 2023: \)</span> Final submission deadline and leaderboar close.</p></li>
<li><p><span class="math notranslate nohighlight">\( June~19, 2023: \)</span> Evaluation result and ranking release.</p></li>
<li><p><span class="math notranslate nohighlight">\( July~3, 2023: \)</span> Deadline for paper submission.</p></li>
<li><p><span class="math notranslate nohighlight">\( July~10, 2023: \)</span> Deadline for final paper submission.</p></li>
<li><p><span class="math notranslate nohighlight">\( December~12\ to\ 16, 2023: \)</span> ASRU Workshop and challenge session</p></li>
<li><p><span class="math notranslate nohighlight">\( December~12\ to\ 16, 2023: \)</span> ASRU Workshop and challenge Session</p></li>
</ul>
</section>
<section id="guidelines">
<h2>Guidelines<a class="headerlink" href="#guidelines" title="Permalink to this heading"></a></h2>
<p>Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023.</p>
<p>Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the <a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html">wechat group</a> of M2MET2.0 and keep up to date with the latest updates about the challenge.</p>
<p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MET2.0 Registration</a></p>
<p>Within three working days, the challenge organizer will send email invitations to eligible teams to participate in the challenge. All qualified teams are required to adhere to the challenge rules, which will be published on the challenge page. Prior to the ranking release time, each participant must submit a system description document detailing their approach and methods. The organizer will select the top ranking submissions to be included in the ASRU2023 Proceedings.</p>
</section>
@ -184,7 +184,7 @@
<li class="right" >
<a href="index.html" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Introduction</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Organizers &#8212; m2met2 documentation</title>
<title>Organizers &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -44,7 +44,7 @@
<li class="right" >
<a href="Rules.html" title="Rules"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Organizers</a></li>
</ul>
</div>
@ -55,7 +55,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -180,7 +180,7 @@ Email: <a class="reference external" href="mailto:sly&#46;zsl&#37;&#52;&#48;alib
<li class="right" >
<a href="Rules.html" title="Rules"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Organizers</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Rules &#8212; m2met2 documentation</title>
<title>Rules &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -44,7 +44,7 @@
<li class="right" >
<a href="Baseline.html" title="Baseline"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Rules</a></li>
</ul>
</div>
@ -55,7 +55,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -165,7 +165,7 @@
<li class="right" >
<a href="Baseline.html" title="Baseline"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Rules</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Track &amp; Evaluation &#8212; m2met2 documentation</title>
<title>Track &amp; Evaluation &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="Dataset.html" title="Datasets"
accesskey="P">previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Track &amp; Evaluation</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -180,7 +180,7 @@
<li class="right" >
<a href="Dataset.html" title="Datasets"
>previous</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Track &amp; Evaluation</a></li>
</ul>
</div>

Binary file not shown.

Before

Width:  |  Height:  |  Size: 141 KiB

After

Width:  |  Height:  |  Size: 183 KiB

View File

@ -1,11 +1,31 @@
# Baseline
## Overview
We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.
We will release an E2E SA-ASR baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.
![model archietecture](images/sa_asr_arch.png)
## Quick start
#TODO: fill with the README.md of the baseline
To run the baseline, first you need to install FunASR and ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html))
There are two startup scripts, `run.sh` for training and evaluating on the old eval and test sets, and `run_m2met_2023_infer.sh` for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 ([M2MET2.0](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)) Challenge.
Before running `run.sh`, you must manually download and unpack the [AliMeeting](http://www.openslr.org/119/) corpus and place it in the `./dataset` directory:
```shell
dataset
|—— Eval_Ali_far
|—— Eval_Ali_near
|—— Test_Ali_far
|—— Test_Ali_near
|—— Train_Ali_far
|—— Train_Ali_near
Before running `run_m2met_2023_infer.sh`, you need to place the new test set `Test_2023_Ali_far` (to be released after the challenge starts) in the `./dataset` directory, which contains only raw audios. Then put the given `wav.scp`, `wav_raw.scp`, `segments`, `utt2spk` and `spk2utt` in the `./data/Test_2023_Ali_far` directory.
```shell
data/Test_2023_Ali_far
|—— wav.scp
|—— wav_raw.scp
|—— segments
|—— utt2spk
|—— spk2utt
```
For more details you can see [here](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md)
## Baseline results
The results of the baseline system are shown in Table 3. The speaker profile adopts the oracle speaker embedding during training. However, due to the lack of oracle speaker label during evaluation, the speaker profile provided by an additional spectral clustering is used. Meanwhile, the results of using the oracle speaker profile on Eval and Test Set are also provided to show the impact of speaker profile accuracy.

View File

@ -13,15 +13,15 @@ Building on the success of the previous M2MeT challenge, we are excited to propo
- $ May~8, 2023: $ Baseline release.
- $ May~15, 2023: $ Registration deadline, the due date for participants to join the Challenge.
- $ June~9, 2023: $ Test data release and leaderboard open.
- $ June~13, 2023: $ Final submission deadline.
- $ June~13, 2023: $ Final submission deadline and leaderboar close.
- $ June~19, 2023: $ Evaluation result and ranking release.
- $ July~3, 2023: $ Deadline for paper submission.
- $ July~10, 2023: $ Deadline for final paper submission.
- $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge session
- $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge Session
## Guidelines
Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023.
Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the [wechat group](https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html) of M2MET2.0 and keep up to date with the latest updates about the challenge.
[M2MET2.0 Registration](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link)

View File

@ -14,7 +14,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Index &#8212; m2met2 documentation</title>
<title>Index &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -35,7 +35,7 @@
<li class="right" style="margin-right: 10px">
<a href="#" title="General Index"
accesskey="I">index</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Index</a></li>
</ul>
</div>
@ -46,7 +46,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -132,7 +132,7 @@
<li class="right" style="margin-right: 10px">
<a href="#" title="General Index"
>index</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Index</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0) &#8212; m2met2 documentation</title>
<title>ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0) &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -40,7 +40,7 @@
<li class="right" >
<a href="Introduction.html" title="Introduction"
accesskey="N">next</a> |</li>
<li class="nav-item nav-item-0"><a href="#">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="#">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)</a></li>
</ul>
</div>
@ -51,7 +51,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
#" class="text-logo">m2met2 documentation</a>
#" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -160,7 +160,7 @@ To facilitate reproducible research, we provide a comprehensive overview of the
<li class="right" >
<a href="Introduction.html" title="Introduction"
>next</a> |</li>
<li class="nav-item nav-item-0"><a href="#">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="#">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)</a></li>
</ul>
</div>

View File

@ -1,5 +1,5 @@
# Sphinx inventory version 2
# Project: m2met2
# Project: MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0
# Version:
# The remainder of this file is compressed using zlib.
xÚ…<EFBFBD>AOƒ0ÇïýïdôÀ2ñæ<C3B1>!™M <-•6@ìŠ<C3AC>bœŸÞ޶£·ö÷~ÿ÷^Þ† B¶JÀ ù#ï*îaãØªÑG Š;¥Y¥çŠCÞxbÚd†CÖpŸa¥ûŽ<C3BB>•n;5·çÜ7Ýõ5Sí—臹y¡Þ+F)Ê|•ö¬z;˜áºUõ<55>)~LŽìzƒ¿<Ûk2àŽj¡ZÅÅçÔH²W!Ï­<µ1|~¢‹ææÍŒ¨,ö®ÃÈö)ÅAü¤î—G}<7D>,I(&[ EDʸÀ9Å;ÆLÓ„lWk¸ÍÂLPóºCÇŽÿ²Øû)ð;937—­8«þËOº1×ÁúªY-³™ÒÖsV ô é,Ö

View File

@ -14,7 +14,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Search &#8212; m2met2 documentation</title>
<title>Search &#8212; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
@ -41,7 +41,7 @@
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="General Index"
accesskey="I">index</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Search</a></li>
</ul>
</div>
@ -52,7 +52,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 documentation</a>
index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
<div class="sidebar-block">
<div class="sidebar-toc">
@ -149,7 +149,7 @@
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="General Index"
>index</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">Search</a></li>
</ul>
</div>

File diff suppressed because one or more lines are too long

View File

@ -7,7 +7,7 @@ import guzzle_sphinx_theme
# -- Project information -----------------------------------------------------
# https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information
project = 'm2met2'
project = 'MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0'
copyright = '2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University'
author = 'Speech Lab, Alibaba Group; Audio, Speech and Language Processing Group, Northwestern Polytechnical University'

Binary file not shown.

Before

Width:  |  Height:  |  Size: 141 KiB

After

Width:  |  Height:  |  Size: 183 KiB

View File

@ -1,4 +1,4 @@
# Sphinx build info version 1
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
config: 5462207d1656a9ae4ca43c2890d094be
config: 06d9c1d4093817b45b9d4df7ab350eaf
tags: 645f666f9bcd5a90fca523b33c5a78b7

Binary file not shown.

Before

Width:  |  Height:  |  Size: 141 KiB

After

Width:  |  Height:  |  Size: 183 KiB

View File

@ -5,8 +5,27 @@
![model archietecture](images/sa_asr_arch.png)
## 快速开始
#TODO: fill with the README.md of the baseline
首先需要安装FunASR和ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html))
基线系统有训练和测试两个脚本,`run.sh` 是用于训练基线系统并在M2MET的验证与测试集上评估的而 `run_m2met_2023_infer.sh` 用于此次竞赛预备开放的全新测试集上测试同时生成符合竞赛最终提交格式的文件。
在运行 `run.sh`前,需要自行下载并解压[AliMeeting](http://www.openslr.org/119/)数据集并放置于`./dataset`目录下:
```shell
dataset
|—— Eval_Ali_far
|—— Eval_Ali_near
|—— Test_Ali_far
|—— Test_Ali_near
|—— Train_Ali_far
|—— Train_Ali_near
在运行 `run_m2met_2023_infer.sh`前, 需要将测试集`Test_2023_Ali_far`仅包含音频将于6.16发布)放置于`./dataset`目录下。然后将主办方提供的`wav.scp``wav_raw.scp``segments``utt2spk`和`spk2utt`放置于`./data/Test_2023_Ali_far`目录下。
```shell
data/Test_2023_Ali_far
|—— wav.scp
|—— wav_raw.scp
|—— segments
|—— utt2spk
|—— spk2utt
```
更多基线系统详情见[此处](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md)
## 基线结果
基线系统的结果如表3所示。在训练期间说话人档案采用了真实说话人嵌入。然而由于在评估过程中缺乏真实说话人标签因此使用了由额外的谱聚类提供的说话人特征。同时我们还提供了在评估和测试集上使用真实说话人档案的结果以显示说话人档案准确性的影响。
![baseline result](images/baseline_result.png)

View File

@ -1,5 +1,6 @@
# 简介
## 竞赛介绍
语音识别Automatic Speech Recognition、说话人日志Speaker Diarization等语音处理技术的最新发展激发了众多智能语音的广泛应用。然而会议场景由于其复杂的声学条件和不同的讲话风格包括重叠的讲话、不同数量的发言者、大会议室的远场信号以及环境噪声和混响仍然属于一项极具挑战性的任务。
为了推动会议场景语音识别的发展,已经有很多相关的挑战赛,如 Rich Transcription evaluation 和 CHIMEComputational Hearing in Multisource Environments 挑战赛。最新的CHIME挑战赛关注于远距离自动语音识别和开发能在各种不同拓扑结构的阵列和应用场景中通用的系统。然而不同语言之间的差异限制了非英语会议转录的进展。MISPMultimodal Information Based Speech Processing和M2MeTMulti-Channel Multi-Party Meeting Transcription挑战赛为推动普通话会议场景语音识别做出了贡献。MISP挑战赛侧重于用视听多模态的方法解决日常家庭环境中的远距离多麦克风信号处理问题而M2MeT挑战则侧重于解决离线会议室中会议转录的语音重叠问题。
@ -14,18 +15,18 @@ IASSP2022 M2MeT挑战的侧重点是会议场景它包括两个赛道
## 时间安排(AOE时间)
- $ 2023.4.29: $ 开放注册
- $ 2023.5.8: $ 基线发布
- $ 2023.5.15: $ 注册截止
- $ 2023.6.9: $ 测试集数据发布
- $ 2023.6.13: $ 最终结果提交截止
- $ 2023.6.19: $ 评估结果和排名发布
- $ 2023.7.3: $ 论文提交截止
- $ 2023.7.10: $ 最终版论文提交截止
- $ 2023.12.12: $ ASRU Workshop & challenge session
- $ 2023.5.11: $ 基线发布
- $ 2023.5.22: $ 注册截止
- $ 2023.6.16: $ 测试集数据发布,排行榜开放
- $ 2023.6.20: $ 最终结果提交截止,排行榜关闭
- $ 2023.6.26: $ 评估结果和排名发布
- $ 2023.7.3: $ 论文提交截止通过ASRU2023官方投稿选择竞赛Session
- $ 2023.7.10: $ 最终版论文提交截止通过ASRU2023官方投稿选择竞赛Session
- $ 2023.12.12: $ ASRU Workshop & Challenge Session
## 竞赛报名
来自学术界和工业界的有意向参赛者均应在2023年5月15日及之前填写下方的谷歌表单
来自学术界和工业界的有意向参赛者均应在2023年5月22日及之前填写下方的谷歌表单。同时欢迎广大参赛者加入[官方交流微信群](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html)交流并及时获取竞赛最新消息
[M2MET2.0报名](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link)

View File

@ -14,7 +14,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>索引 &#8212; m2met2 文档</title>
<title>索引 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -36,7 +36,7 @@
<li class="right" style="margin-right: 10px">
<a href="#" title="总索引"
accesskey="I">索引</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">索引</a></li>
</ul>
</div>
@ -47,7 +47,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -133,7 +133,7 @@
<li class="right" style="margin-right: 10px">
<a href="#" title="总索引"
>索引</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">索引</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>ASRU 2023 多通道多方会议转录挑战 2.0 &#8212; m2met2 文档</title>
<title>ASRU 2023 多通道多方会议转录挑战 2.0 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -41,7 +41,7 @@
<li class="right" >
<a href="%E7%AE%80%E4%BB%8B.html" title="简介"
accesskey="N">下一页</a> |</li>
<li class="nav-item nav-item-0"><a href="#">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="#">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">ASRU 2023 多通道多方会议转录挑战 2.0</a></li>
</ul>
</div>
@ -52,7 +52,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
#" class="text-logo">m2met2 文档</a>
#" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -161,7 +161,7 @@
<li class="right" >
<a href="%E7%AE%80%E4%BB%8B.html" title="简介"
>下一页</a> |</li>
<li class="nav-item nav-item-0"><a href="#">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="#">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">ASRU 2023 多通道多方会议转录挑战 2.0</a></li>
</ul>
</div>

View File

@ -14,7 +14,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>搜索 &#8212; m2met2 文档</title>
<title>搜索 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
@ -42,7 +42,7 @@
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="总索引"
accesskey="I">索引</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">搜索</a></li>
</ul>
</div>
@ -53,7 +53,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-toc">
@ -149,7 +149,7 @@
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="总索引"
>索引</a></li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">搜索</a></li>
</ul>
</div>

View File

@ -1 +1 @@
Search.setIndex({"docnames": ["index", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "filenames": ["index.rst", "\u57fa\u7ebf.md", "\u6570\u636e\u96c6.md", "\u7b80\u4ecb.md", "\u7ec4\u59d4\u4f1a.md", "\u8054\u7cfb\u65b9\u5f0f.md", "\u89c4\u5219.md", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30.md"], "titles": ["ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u7ade\u8d5b\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "terms": {"m2met": [0, 3, 5, 7], "asru2023": [0, 3], "m2met2": [0, 3, 5, 7], "funasr": 1, "sa": 1, "asr": [1, 3, 7], "speakerencod": 1, "modelscop": [1, 7], "todo": 1, "fill": 1, "with": 1, "the": 1, "readm": 1, "md": 1, "of": 1, "baselin": [1, 2], "aishel": [2, 7], "cn": [2, 4, 7], "celeb": [2, 7], "test": [2, 6, 7], "2023": [2, 3, 6, 7], "118": 2, "75": 2, "104": 2, "train": 2, "eval": [2, 6], "10": [2, 3, 7], "212": 2, "15": [2, 3], "30": 2, "456": 2, "25": 2, "13": [2, 3], "55": 2, "42": 2, "27": 2, "34": 2, "76": 2, "20": 2, "textgrid": 2, "id": 2, "openslr": 2, "automat": 3, "speech": 3, "recognit": 3, "speaker": 3, "diariz": 3, "rich": 3, "transcript": 3, "evalu": 3, "chime": 3, "comput": 3, "hear": 3, "in": 3, "multisourc": 3, "environ": 3, "misp": 3, "multimod": 3, "inform": 3, "base": 3, "process": 3, "multi": 3, "channel": 3, "parti": 3, "meet": 3, "iassp2022": 3, "29": 3, "19": 3, "12": 3, "asru": 3, "workshop": 3, "challeng": 3, "session": 3, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "star": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yzj": 4, "alibaba": 4, "inc": 4, "com": [4, 5], "sli": 4, "zsl": 4, "yanminqian": 4, "sjtu": 4, "zhuc": 4, "microsoft": 4, "wujian": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "alimeet": [5, 7], "gmail": 5, "cpcer": [6, 7], "las": 6, "rnnt": 6, "transform": 6, "aishell4": 7, "vad": 7, "cer": 7, "ins": 7, "sub": 7, "del": 7, "text": 7, "frac": 7, "mathcal": 7, "n_": 7, "total": 7, "time": 7, "100": 7, "hug": 7, "face": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"asru": 0, "2023": 0, "alimeet": 2, "aoe": 3}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0": [[0, "asru-2023-2-0"]], "\u76ee\u5f55:": [[0, null]], "\u57fa\u7ebf": [[1, "id1"]], "\u57fa\u7ebf\u6982\u8ff0": [[1, "id2"]], "\u5feb\u901f\u5f00\u59cb": [[1, "id3"]], "\u57fa\u7ebf\u7ed3\u679c": [[1, "id4"]], "\u6570\u636e\u96c6": [[2, "id1"]], "\u6570\u636e\u96c6\u6982\u8ff0": [[2, "id2"]], "Alimeeting\u6570\u636e\u96c6\u4ecb\u7ecd": [[2, "alimeeting"]], "\u83b7\u53d6\u6570\u636e": [[2, "id3"]], "\u7b80\u4ecb": [[3, "id1"]], "\u7ade\u8d5b\u4ecb\u7ecd": [[3, "id2"]], "\u65f6\u95f4\u5b89\u6392(AOE\u65f6\u95f4)": [[3, "aoe"]], "\u7ade\u8d5b\u62a5\u540d": [[3, "id3"]], "\u7ec4\u59d4\u4f1a": [[4, "id1"]], "\u8054\u7cfb\u65b9\u5f0f": [[5, "id1"]], "\u7ade\u8d5b\u89c4\u5219": [[6, "id1"]], "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30": [[7, "id1"]], "\u8bf4\u8bdd\u4eba\u76f8\u5173\u7684\u8bed\u97f3\u8bc6\u522b": [[7, "id2"]], "\u8bc4\u4f30\u65b9\u6cd5": [[7, "id3"]], "\u5b50\u8d5b\u9053\u8bbe\u7f6e": [[7, "id4"]], "\u5b50\u8d5b\u9053\u4e00 (\u9650\u5b9a\u8bad\u7ec3\u6570\u636e):": [[7, "id5"]], "\u5b50\u8d5b\u9053\u4e8c (\u5f00\u653e\u8bad\u7ec3\u6570\u636e):": [[7, "id6"]]}, "indexentries": {}})
Search.setIndex({"docnames": ["index", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "filenames": ["index.rst", "\u57fa\u7ebf.md", "\u6570\u636e\u96c6.md", "\u7b80\u4ecb.md", "\u7ec4\u59d4\u4f1a.md", "\u8054\u7cfb\u65b9\u5f0f.md", "\u89c4\u5219.md", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30.md"], "titles": ["ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u7ade\u8d5b\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "terms": {"m2met": [0, 1, 3, 5, 7], "asru2023": [0, 3], "m2met2": [0, 3, 5, 7], "funasr": 1, "sa": 1, "asr": [1, 3, 7], "speakerencod": 1, "modelscop": [1, 7], "instal": 1, "run": 1, "sh": 1, "run_m2met_2023_inf": 1, "alimeet": [1, 5, 7], "dataset": 1, "eval_ali_far": 1, "eval_ali_near": 1, "test_ali_far": 1, "test_ali_near": 1, "train_ali_far": 1, "train_ali_near": 1, "test_2023_ali_far": 1, "16": [1, 3], "wav": 1, "scp": 1, "wav_raw": 1, "segment": 1, "utt2spk": 1, "spk2utt": 1, "data": 1, "shell": 1, "aishel": [2, 7], "cn": [2, 4, 7], "celeb": [2, 7], "test": [2, 6, 7], "2023": [2, 3, 6, 7], "118": 2, "75": 2, "104": 2, "train": 2, "eval": [2, 6], "10": [2, 3, 7], "212": 2, "15": 2, "30": 2, "456": 2, "25": 2, "13": 2, "55": 2, "42": 2, "27": 2, "34": 2, "76": 2, "20": [2, 3], "textgrid": 2, "id": 2, "openslr": 2, "baselin": 2, "automat": 3, "speech": 3, "recognit": 3, "speaker": 3, "diariz": 3, "rich": 3, "transcript": 3, "evalu": 3, "chime": 3, "comput": 3, "hear": 3, "in": 3, "multisourc": 3, "environ": 3, "misp": 3, "multimod": 3, "inform": 3, "base": 3, "process": 3, "multi": 3, "channel": 3, "parti": 3, "meet": 3, "iassp2022": 3, "29": 3, "11": 3, "22": 3, "26": 3, "session": 3, "12": 3, "asru": 3, "workshop": 3, "challeng": 3, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "star": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yzj": 4, "alibaba": 4, "inc": 4, "com": [4, 5], "sli": 4, "zsl": 4, "yanminqian": 4, "sjtu": 4, "zhuc": 4, "microsoft": 4, "wujian": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "gmail": 5, "cpcer": [6, 7], "las": 6, "rnnt": 6, "transform": 6, "aishell4": 7, "vad": 7, "cer": 7, "ins": 7, "sub": 7, "del": 7, "text": 7, "frac": 7, "mathcal": 7, "n_": 7, "total": 7, "time": 7, "100": 7, "hug": 7, "face": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"asru": 0, "2023": 0, "alimeet": 2, "aoe": 3}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0": [[0, "asru-2023-2-0"]], "\u76ee\u5f55:": [[0, null]], "\u57fa\u7ebf": [[1, "id1"]], "\u57fa\u7ebf\u6982\u8ff0": [[1, "id2"]], "\u5feb\u901f\u5f00\u59cb": [[1, "id3"]], "\u57fa\u7ebf\u7ed3\u679c": [[1, "id4"]], "\u6570\u636e\u96c6": [[2, "id1"]], "\u6570\u636e\u96c6\u6982\u8ff0": [[2, "id2"]], "Alimeeting\u6570\u636e\u96c6\u4ecb\u7ecd": [[2, "alimeeting"]], "\u83b7\u53d6\u6570\u636e": [[2, "id3"]], "\u7b80\u4ecb": [[3, "id1"]], "\u7ade\u8d5b\u4ecb\u7ecd": [[3, "id2"]], "\u65f6\u95f4\u5b89\u6392(AOE\u65f6\u95f4)": [[3, "aoe"]], "\u7ade\u8d5b\u62a5\u540d": [[3, "id3"]], "\u7ec4\u59d4\u4f1a": [[4, "id1"]], "\u8054\u7cfb\u65b9\u5f0f": [[5, "id1"]], "\u7ade\u8d5b\u89c4\u5219": [[6, "id1"]], "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30": [[7, "id1"]], "\u8bf4\u8bdd\u4eba\u76f8\u5173\u7684\u8bed\u97f3\u8bc6\u522b": [[7, "id2"]], "\u8bc4\u4f30\u65b9\u6cd5": [[7, "id3"]], "\u5b50\u8d5b\u9053\u8bbe\u7f6e": [[7, "id4"]], "\u5b50\u8d5b\u9053\u4e00 (\u9650\u5b9a\u8bad\u7ec3\u6570\u636e):": [[7, "id5"]], "\u5b50\u8d5b\u9053\u4e8c (\u5f00\u653e\u8bad\u7ec3\u6570\u636e):": [[7, "id6"]]}, "indexentries": {}})

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>基线 &#8212; m2met2 文档</title>
<title>基线 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html" title="赛道设置与评估"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">基线</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -132,7 +132,27 @@
</section>
<section id="id3">
<h2>快速开始<a class="headerlink" href="#id3" title="此标题的永久链接"></a></h2>
<p>#TODO: fill with the README.md of the baseline</p>
<p>首先需要安装FunASR和ModelScope. (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/en/installation.html">installation</a>)<br />
基线系统有训练和测试两个脚本,<code class="docutils literal notranslate"><span class="pre">run.sh</span></code> 是用于训练基线系统并在M2MET的验证与测试集上评估的<code class="docutils literal notranslate"><span class="pre">run_m2met_2023_infer.sh</span></code> 用于此次竞赛预备开放的全新测试集上测试同时生成符合竞赛最终提交格式的文件。
在运行 <code class="docutils literal notranslate"><span class="pre">run.sh</span></code>前,需要自行下载并解压<a class="reference external" href="http://www.openslr.org/119/">AliMeeting</a>数据集并放置于<code class="docutils literal notranslate"><span class="pre">./dataset</span></code>目录下:</p>
<div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>dataset
<span class="p">|</span>——<span class="w"> </span>Eval_Ali_far
<span class="p">|</span>——<span class="w"> </span>Eval_Ali_near
<span class="p">|</span>——<span class="w"> </span>Test_Ali_far
<span class="p">|</span>——<span class="w"> </span>Test_Ali_near
<span class="p">|</span>——<span class="w"> </span>Train_Ali_far
<span class="p">|</span>——<span class="w"> </span>Train_Ali_near
在运行<span class="w"> </span><span class="sb">`</span>run_m2met_2023_infer.sh<span class="sb">`</span>前,<span class="w"> </span>需要将测试集<span class="sb">`</span>Test_2023_Ali_far<span class="sb">`</span>仅包含音频将于6.16发布)放置于<span class="sb">`</span>./dataset<span class="sb">`</span>目录下。然后将主办方提供的<span class="sb">`</span>wav.scp<span class="sb">`</span><span class="sb">`</span>wav_raw.scp<span class="sb">`</span><span class="sb">`</span>segments<span class="sb">`</span><span class="sb">`</span>utt2spk<span class="sb">`</span><span class="sb">`</span>spk2utt<span class="sb">`</span>放置于<span class="sb">`</span>./data/Test_2023_Ali_far<span class="sb">`</span>目录下。
<span class="sb">```</span>shell
data/Test_2023_Ali_far
<span class="p">|</span>——<span class="w"> </span>wav.scp
<span class="p">|</span>——<span class="w"> </span>wav_raw.scp
<span class="p">|</span>——<span class="w"> </span>segments
<span class="p">|</span>——<span class="w"> </span>utt2spk
<span class="p">|</span>——<span class="w"> </span>spk2utt
</pre></div>
</div>
<p>更多基线系统详情见<a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md">此处</a></p>
</section>
<section id="id4">
<h2>基线结果<a class="headerlink" href="#id4" title="此标题的永久链接"></a></h2>
@ -171,7 +191,7 @@
<li class="right" >
<a href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html" title="赛道设置与评估"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">基线</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>数据集 &#8212; m2met2 文档</title>
<title>数据集 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="%E7%AE%80%E4%BB%8B.html" title="简介"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">数据集</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -180,7 +180,7 @@ Test-2023测试集由20场会议组成这些会议是在与AliMeeting数据
<li class="right" >
<a href="%E7%AE%80%E4%BB%8B.html" title="简介"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">数据集</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>简介 &#8212; m2met2 文档</title>
<title>简介 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -46,7 +46,7 @@
<li class="right" >
<a href="index.html" title="ASRU 2023 多通道多方会议转录挑战 2.0"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">简介</a></li>
</ul>
</div>
@ -57,7 +57,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -139,19 +139,19 @@
<h2>时间安排(AOE时间)<a class="headerlink" href="#aoe" title="此标题的永久链接"></a></h2>
<ul class="simple">
<li><p><span class="math notranslate nohighlight">\( 2023.4.29: \)</span> 开放注册</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.8: \)</span> 基线发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.15: \)</span> 注册截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.9: \)</span> 测试集数据发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.13: \)</span> 最终结果提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.19: \)</span> 评估结果和排名发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.3: \)</span> 论文提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.10: \)</span> 最终版论文提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.12.12: \)</span> ASRU Workshop &amp; challenge session</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.11: \)</span> 基线发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.22: \)</span> 注册截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.16: \)</span> 测试集数据发布,排行榜开放</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.20: \)</span> 最终结果提交截止,排行榜关闭</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.26: \)</span> 评估结果和排名发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.3: \)</span> 论文提交截止通过ASRU2023官方投稿选择竞赛Session</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.10: \)</span> 最终版论文提交截止通过ASRU2023官方投稿选择竞赛Session</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.12.12: \)</span> ASRU Workshop &amp; Challenge Session</p></li>
</ul>
</section>
<section id="id3">
<h2>竞赛报名<a class="headerlink" href="#id3" title="此标题的永久链接"></a></h2>
<p>来自学术界和工业界的有意向参赛者均应在2023年5月15日及之前填写下方的谷歌表单</p>
<p>来自学术界和工业界的有意向参赛者均应在2023年5月22日及之前填写下方的谷歌表单。同时欢迎广大参赛者加入<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html">官方交流微信群</a>交流并及时获取竞赛最新消息</p>
<p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MET2.0报名</a></p>
<p>主办方将在3个工作日内通过电子邮件通知符合条件的参赛团队团队必须遵守将在挑战网站上发布的挑战规则。在排名发布之前每个参赛者必须提交一份系统描述文件详细说明使用的方法和模型。主办方将排名前列的队伍纳入ASRU2023论文集。</p>
</section>
@ -187,7 +187,7 @@
<li class="right" >
<a href="index.html" title="ASRU 2023 多通道多方会议转录挑战 2.0"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">简介</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>组委会 &#8212; m2met2 文档</title>
<title>组委会 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="%E8%A7%84%E5%88%99.html" title="竞赛规则"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">组委会</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -188,7 +188,7 @@
<li class="right" >
<a href="%E8%A7%84%E5%88%99.html" title="竞赛规则"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">组委会</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>联系方式 &#8212; m2met2 文档</title>
<title>联系方式 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -41,7 +41,7 @@
<li class="right" >
<a href="%E7%BB%84%E5%A7%94%E4%BC%9A.html" title="组委会"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">联系方式</a></li>
</ul>
</div>
@ -52,7 +52,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -161,7 +161,7 @@
<li class="right" >
<a href="%E7%BB%84%E5%A7%94%E4%BC%9A.html" title="组委会"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">联系方式</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>竞赛规则 &#8212; m2met2 文档</title>
<title>竞赛规则 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -45,7 +45,7 @@
<li class="right" >
<a href="%E5%9F%BA%E7%BA%BF.html" title="基线"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">竞赛规则</a></li>
</ul>
</div>
@ -56,7 +56,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -166,7 +166,7 @@
<li class="right" >
<a href="%E5%9F%BA%E7%BA%BF.html" title="基线"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">竞赛规则</a></li>
</ul>
</div>

View File

@ -15,7 +15,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>赛道设置与评估 &#8212; m2met2 文档</title>
<title>赛道设置与评估 &#8212; 多通道多方会议转录挑战2.0</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
@ -46,7 +46,7 @@
<li class="right" >
<a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="数据集"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">赛道设置与评估</a></li>
</ul>
</div>
@ -57,7 +57,7 @@
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
index.html" class="text-logo">多通道多方会议转录挑战2.0</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
@ -181,7 +181,7 @@
<li class="right" >
<a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="数据集"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-0"><a href="index.html">多通道多方会议转录挑战2.0</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">赛道设置与评估</a></li>
</ul>
</div>

View File

@ -7,7 +7,7 @@ import guzzle_sphinx_theme
# -- Project information -----------------------------------------------------
# https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information
project = 'm2met2'
project = '多通道多方会议转录挑战2.0'
copyright = '2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University'
author = 'Speech Lab, Alibaba Group; Audio, Speech and Language Processing Group, Northwestern Polytechnical University'

Binary file not shown.

Before

Width:  |  Height:  |  Size: 141 KiB

After

Width:  |  Height:  |  Size: 183 KiB

View File

@ -5,8 +5,27 @@
![model archietecture](images/sa_asr_arch.png)
## 快速开始
#TODO: fill with the README.md of the baseline
首先需要安装FunASR和ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html))
基线系统有训练和测试两个脚本,`run.sh` 是用于训练基线系统并在M2MET的验证与测试集上评估的`run_m2met_2023_infer.sh` 用于此次竞赛预备开放的全新测试集上测试同时生成符合竞赛最终提交格式的文件。
在运行 `run.sh`前,需要自行下载并解压[AliMeeting](http://www.openslr.org/119/)数据集并放置于`./dataset`目录下:
```shell
dataset
|—— Eval_Ali_far
|—— Eval_Ali_near
|—— Test_Ali_far
|—— Test_Ali_near
|—— Train_Ali_far
|—— Train_Ali_near
在运行 `run_m2met_2023_infer.sh`前, 需要将测试集`Test_2023_Ali_far`仅包含音频将于6.16发布)放置于`./dataset`目录下。然后将主办方提供的`wav.scp``wav_raw.scp``segments``utt2spk`和`spk2utt`放置于`./data/Test_2023_Ali_far`目录下。
```shell
data/Test_2023_Ali_far
|—— wav.scp
|—— wav_raw.scp
|—— segments
|—— utt2spk
|—— spk2utt
```
更多基线系统详情见[此处](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md)
## 基线结果
基线系统的结果如表3所示。在训练期间说话人档案采用了真实说话人嵌入。然而由于在评估过程中缺乏真实说话人标签因此使用了由额外的谱聚类提供的说话人特征。同时我们还提供了在评估和测试集上使用真实说话人档案的结果以显示说话人档案准确性的影响。
![baseline result](images/baseline_result.png)

View File

@ -1,5 +1,6 @@
# 简介
## 竞赛介绍
语音识别Automatic Speech Recognition、说话人日志Speaker Diarization等语音处理技术的最新发展激发了众多智能语音的广泛应用。然而会议场景由于其复杂的声学条件和不同的讲话风格包括重叠的讲话、不同数量的发言者、大会议室的远场信号以及环境噪声和混响仍然属于一项极具挑战性的任务。
为了推动会议场景语音识别的发展,已经有很多相关的挑战赛,如 Rich Transcription evaluation 和 CHIMEComputational Hearing in Multisource Environments 挑战赛。最新的CHIME挑战赛关注于远距离自动语音识别和开发能在各种不同拓扑结构的阵列和应用场景中通用的系统。然而不同语言之间的差异限制了非英语会议转录的进展。MISPMultimodal Information Based Speech Processing和M2MeTMulti-Channel Multi-Party Meeting Transcription挑战赛为推动普通话会议场景语音识别做出了贡献。MISP挑战赛侧重于用视听多模态的方法解决日常家庭环境中的远距离多麦克风信号处理问题而M2MeT挑战则侧重于解决离线会议室中会议转录的语音重叠问题。
@ -14,18 +15,18 @@ IASSP2022 M2MeT挑战的侧重点是会议场景它包括两个赛道
## 时间安排(AOE时间)
- $ 2023.4.29: $ 开放注册
- $ 2023.5.8: $ 基线发布
- $ 2023.5.15: $ 注册截止
- $ 2023.6.9: $ 测试集数据发布
- $ 2023.6.13: $ 最终结果提交截止
- $ 2023.6.19: $ 评估结果和排名发布
- $ 2023.7.3: $ 论文提交截止
- $ 2023.7.10: $ 最终版论文提交截止
- $ 2023.12.12: $ ASRU Workshop & challenge session
- $ 2023.5.11: $ 基线发布
- $ 2023.5.22: $ 注册截止
- $ 2023.6.16: $ 测试集数据发布,排行榜开放
- $ 2023.6.20: $ 最终结果提交截止,排行榜关闭
- $ 2023.6.26: $ 评估结果和排名发布
- $ 2023.7.3: $ 论文提交截止通过ASRU2023官方投稿选择竞赛Session
- $ 2023.7.10: $ 最终版论文提交截止通过ASRU2023官方投稿选择竞赛Session
- $ 2023.12.12: $ ASRU Workshop & Challenge Session
## 竞赛报名
来自学术界和工业界的有意向参赛者均应在2023年5月15日及之前填写下方的谷歌表单
来自学术界和工业界的有意向参赛者均应在2023年5月22日及之前填写下方的谷歌表单。同时欢迎广大参赛者加入[官方交流微信群](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html)交流并及时获取竞赛最新消息
[M2MET2.0报名](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link)