FunASR/docs/m2met2_cn/_build/html/简介.html
2023-04-26 16:39:22 +08:00

204 lines
12 KiB
HTML
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

<!DOCTYPE html>
<html lang="zh-CN">
<head>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" /><meta name="generator" content="Docutils 0.18.1: http://docutils.sourceforge.net/" />
<!-- Licensed under the Apache 2.0 License -->
<link rel="stylesheet" type="text/css" href="_static/fonts/open-sans/stylesheet.css" />
<!-- Licensed under the SIL Open Font License -->
<link rel="stylesheet" type="text/css" href="_static/fonts/source-serif-pro/source-serif-pro.css" />
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap.min.css" />
<link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>简介 &#8212; m2met2 文档</title>
<link rel="stylesheet" type="text/css" href="_static/pygments.css" />
<link rel="stylesheet" type="text/css" href="_static/guzzle.css" />
<script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script>
<script src="_static/jquery.js"></script>
<script src="_static/underscore.js"></script>
<script src="_static/_sphinx_javascript_frameworks_compat.js"></script>
<script src="_static/doctools.js"></script>
<script src="_static/sphinx_highlight.js"></script>
<script src="_static/translations.js"></script>
<script async="async" src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-mml-chtml.js"></script>
<link rel="index" title="索引" href="genindex.html" />
<link rel="search" title="搜索" href="search.html" />
<link rel="next" title="数据集" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" />
<link rel="prev" title="ASRU 2023 多通道多方会议转录挑战 2.0" href="index.html" />
</head><body>
<div class="related" role="navigation" aria-label="related navigation">
<h3>导航</h3>
<ul>
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="总索引"
accesskey="I">索引</a></li>
<li class="right" >
<a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="数据集"
accesskey="N">下一页</a> |</li>
<li class="right" >
<a href="index.html" title="ASRU 2023 多通道多方会议转录挑战 2.0"
accesskey="P">上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">简介</a></li>
</ul>
</div>
<div class="container-wrapper">
<div id="mobile-toggle">
<a href="#"><span class="glyphicon glyphicon-align-justify" aria-hidden="true"></span></a>
</div>
<div id="left-column">
<div class="sphinxsidebar"><a href="
index.html" class="text-logo">m2met2 文档</a>
<div class="sidebar-block">
<div class="sidebar-wrapper">
<div id="main-search">
<form class="form-inline" action="search.html" method="GET" role="form">
<div class="input-group">
<input name="q" type="text" class="form-control" placeholder="Search...">
</div>
<input type="hidden" name="check_keywords" value="yes" />
<input type="hidden" name="area" value="default" />
</form>
</div>
</div>
</div>
<div class="sidebar-block">
<div class="sidebar-toc">
<p class="caption" role="heading"><span class="caption-text">目录:</span></p>
<ul class="current">
<li class="toctree-l1 current"><a class="current reference internal" href="#">简介</a><ul>
<li class="toctree-l2"><a class="reference internal" href="#id2">竞赛介绍</a></li>
<li class="toctree-l2"><a class="reference internal" href="#aoe">时间安排(AOE时间)</a></li>
<li class="toctree-l2"><a class="reference internal" href="#id3">竞赛报名</a></li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html">数据集</a><ul>
<li class="toctree-l2"><a class="reference internal" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html#id2">数据集概述</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html#alimeeting">Alimeeting数据集介绍</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html#id3">获取数据</a></li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html">赛道设置与评估</a><ul>
<li class="toctree-l2"><a class="reference internal" href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html#id2">说话人相关的语音识别</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html#id3">评估方法</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html#id4">子赛道设置</a></li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="%E5%9F%BA%E7%BA%BF.html">基线</a><ul>
<li class="toctree-l2"><a class="reference internal" href="%E5%9F%BA%E7%BA%BF.html#id2">基线概述</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E5%9F%BA%E7%BA%BF.html#id3">快速开始</a></li>
<li class="toctree-l2"><a class="reference internal" href="%E5%9F%BA%E7%BA%BF.html#id4">基线结果</a></li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="%E8%A7%84%E5%88%99.html">竞赛规则</a></li>
<li class="toctree-l1"><a class="reference internal" href="%E7%BB%84%E5%A7%94%E4%BC%9A.html">组委会</a></li>
<li class="toctree-l1"><a class="reference internal" href="%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html">联系方式</a></li>
</ul>
</div>
</div>
</div>
</div>
<div id="right-column">
<div role="navigation" aria-label="breadcrumbs navigation">
<ol class="breadcrumb">
<li><a href="index.html">Docs</a></li>
<li>简介</li>
</ol>
</div>
<div class="document clearer body">
<section id="id1">
<h1>简介<a class="headerlink" href="#id1" title="此标题的永久链接"></a></h1>
<section id="id2">
<h2>竞赛介绍<a class="headerlink" href="#id2" title="此标题的永久链接"></a></h2>
<p>语音识别Automatic Speech Recognition、说话人日志Speaker Diarization等语音处理技术的最新发展激发了众多智能语音的广泛应用。然而会议场景由于其复杂的声学条件和不同的讲话风格包括重叠的讲话、不同数量的发言者、大会议室的远场信号以及环境噪声和混响仍然属于一项极具挑战性的任务。</p>
<p>为了推动会议场景语音识别的发展,已经有很多相关的挑战赛,如 Rich Transcription evaluation 和 CHIMEComputational Hearing in Multisource Environments 挑战赛。最新的CHIME挑战赛关注于远距离自动语音识别和开发能在各种不同拓扑结构的阵列和应用场景中通用的系统。然而不同语言之间的差异限制了非英语会议转录的进展。MISPMultimodal Information Based Speech Processing和M2MeTMulti-Channel Multi-Party Meeting Transcription挑战赛为推动普通话会议场景语音识别做出了贡献。MISP挑战赛侧重于用视听多模态的方法解决日常家庭环境中的远距离多麦克风信号处理问题而M2MeT挑战则侧重于解决离线会议室中会议转录的语音重叠问题。</p>
<p>IASSP2022 M2MeT挑战的侧重点是会议场景它包括两个赛道说话人日记和多说话人自动语音识别。前者涉及识别“谁在什么时候说了话”而后者旨在同时识别来自多个说话人的语音语音重叠和各种噪声带来了巨大的技术困难。</p>
<p>在上一届M2MET成功举办的基础上我们将在ASRU2023上继续举办M2MET2.0挑战赛。在上一届M2MET挑战赛中评估指标是说话人无关的我们只能得到识别文本而不能确定相应的说话人。
为了解决这一局限性并将现在的多说话人语音识别系统推向实用化M2MET2.0挑战赛将在说话人相关的人物上评估并且同时设立限定数据与不限定数据两个子赛道。通过将语音归属于特定的说话人这项任务旨在提高多说话人ASR系统在真实世界环境中的准确性和适用性。
我们对数据集、规则、基线系统和评估方法进行了详细介绍以进一步促进多说话人语音识别领域研究的发展。此外我们将根据时间表发布一个全新的测试集包括大约10小时的音频。</p>
</section>
<section id="aoe">
<h2>时间安排(AOE时间)<a class="headerlink" href="#aoe" title="此标题的永久链接"></a></h2>
<ul class="simple">
<li><p><span class="math notranslate nohighlight">\( 2023.4.29: \)</span> 开放注册</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.8: \)</span> 基线发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.5.15: \)</span> 注册截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.9: \)</span> 测试集数据发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.13: \)</span> 最终结果提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.6.19: \)</span> 评估结果和排名发布</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.3: \)</span> 论文提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.7.10: \)</span> 最终版论文提交截止</p></li>
<li><p><span class="math notranslate nohighlight">\( 2023.12.12: \)</span> ASRU Workshop &amp; challenge session</p></li>
</ul>
</section>
<section id="id3">
<h2>竞赛报名<a class="headerlink" href="#id3" title="此标题的永久链接"></a></h2>
<p>来自学术界和工业界的有意向参赛者均应在2023年5月15日及之前填写下方的谷歌表单</p>
<p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MET2.0报名</a></p>
<p>主办方将在3个工作日内通过电子邮件通知符合条件的参赛团队团队必须遵守将在挑战网站上发布的挑战规则。在排名发布之前每个参赛者必须提交一份系统描述文件详细说明使用的方法和模型。主办方将选择前三名纳入ASRU2023论文集。</p>
</section>
</section>
</div>
<div class="footer-relations">
<div class="pull-left">
<a class="btn btn-default" href="index.html" title="上一章 (use the left arrow)">ASRU 2023 多通道多方会议转录挑战 2.0</a>
</div>
<div class="pull-right">
<a class="btn btn-default" href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="下一章 (use the right arrow)">数据集</a>
</div>
</div>
<div class="clearer"></div>
</div>
<div class="clearfix"></div>
</div>
<div class="related" role="navigation" aria-label="related navigation">
<h3>导航</h3>
<ul>
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="总索引"
>索引</a></li>
<li class="right" >
<a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="数据集"
>下一页</a> |</li>
<li class="right" >
<a href="index.html" title="ASRU 2023 多通道多方会议转录挑战 2.0"
>上一页</a> |</li>
<li class="nav-item nav-item-0"><a href="index.html">m2met2 文档</a> &#187;</li>
<li class="nav-item nav-item-this"><a href="">简介</a></li>
</ul>
</div>
<script type="text/javascript">
$("#mobile-toggle a").click(function () {
$("#left-column").toggle();
});
</script>
<script type="text/javascript" src="_static/js/bootstrap.js"></script>
<div class="footer">
&copy; Copyright 2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University. Created using <a href="http://sphinx.pocoo.org/">Sphinx</a>.
</div>
</body>
</html>