update wakenet README

This commit is contained in:
Sun Xiang Yu 2019-09-24 19:11:26 +08:00
parent 99bc33bd30
commit b5827516e8
3 changed files with 20 additions and 9 deletions

View File

@ -1,5 +1,9 @@
# Change log for esp-sr
## 0.3.0(dev)
add wakenet6
support cmake
## 0.2.0
add acoustic algorithm, include AEC, AGC, VAD ,NS
add wakenet5X2 and wakenet5X3

View File

@ -29,11 +29,11 @@ Please see the flow diagram of WakeNet below:
- How to select the WakeNet model
Go to `make menuconfig`, navigate to `Component config` >> `ESP Speech Recognition` >> `Wake word engine`. See below:
<center>
<img src="../img/model_sel.png" width = "500" />
</center>
1. Go to `make menuconfig`, navigate to `Component config` >> `ESP Speech Recognition` >> `Wake word engine`. See below:
<center> <img src="../img/model_sel.png" width = "500" /> </center>
2. WakeNet6 is divided into two tasks task1 is used to calculate speech recognition the task2 is used to calculate neural network model. The ESP32 core used to calculate task2 can be selected by `Component config` >> `ESP Speech Recognition` >> `ESP32 core to run WakeNet6`
- How to select the wake words
@ -74,6 +74,9 @@ Please see the flow diagram of WakeNet below:
|Quantised WakeNet5|41 K|15 KB|5.5 ms|30 ms|
|Quantised WakeNet5X2|165 K|20 KB|10.5 ms|30 ms|
|Quantised WakeNet5X3|371 K|24 KB|18 ms|30 ms|
|Quantised WakeNet6|378 K|45 KB|4ms(task1) + 25 ms(task2)|30 ms|
**Note**: Quantised WakeNet6 is split into two tasks, task1 is used to calculate speech features and task2 is used to calculate neural network model.
### 2. Performance

View File

@ -19,7 +19,7 @@ WakeNet的流程图如下
- wakeNet3和wakeNet4基于[CRNN](https://arxiv.org/abs/1703.05390)结构。
- WakeNet5(WakeNet5X2,WakeNetX3) 和 WakeNet6 基于 the [Dilated Convolution](https://arxiv.org/pdf/1609.03499.pdf) 结构。
注意WakeNet5,WakeNet5X2 和 WakeNet5X3 的网络结构一致,但是 WakeNetX2 和 WakeNetX3 的参数比 WakeNet5 要多。请参考 [性能测试](#性能测试) 来获取更多细节。
注意WakeNet5,WakeNet5X2 和 WakeNet5X3 的网络结构一致,但是 WakeNet5X2 和 WakeNet5X3 的参数比 WakeNet5 要多。请参考 [性能测试](#性能测试) 来获取更多细节。
- keyword trigger method
对连续的音频流为准确判断关键词的触发我们通过计算若干帧内识别结果的平均值M来判断触发。当M大于大于指定阈值发出触发的命令。
@ -28,14 +28,16 @@ WakeNet的流程图如下
## API introduction
- WakeNet模型选择
使用make menuconfig选择Component config >> ESP Speech commands >> Keyword spotting model,如下图
1. 使用make menuconfig选择Component config >> ESP Speech Recognition >> Wake Word Engine,如下图
<center>
<img src="../img/model_sel.png" width = "500" />
</center>
</center>
2. 不同与WakeNet5WakeNet6被拆分成两个tasktask1计算speech featurestask2计算neural network model。task2使用的ESP32核心可以通过Component config >> ESP Speech Recognition >> ESP32 core to run WakeNet6选择默认使用core1。
- 唤醒词选择
使用make menuconfig选择Component config >> ESP Speech commands >> Wake word list进行选择如下图
使用make menuconfig选择Component config >> ESP Speech Recognition >> Wake word list进行选择如下图
<center>
<img src="../img/word_sel.png" width = "500" />
</center>
@ -70,7 +72,9 @@ WakeNet的流程图如下
|Quantised WakeNet5|41 K|15 KB|5.5 ms|30 ms|
|Quantised WakeNet5X2|165 K|20 KB|10.5 ms|30 ms|
|Quantised WakeNet5X3|371 K|24 KB|18 ms|30 ms|
|Quantised WakeNet6|378 K|45 KB|4ms(task1)+25ms(task2)|30 ms|
**注**Quantised WakeNet6被拆分成两个task其中task1用于计算speech features另一个task2用于计算神经网络。
### 2.识别性能
|距离|安静环境|平稳噪声(SNR=0~10dB)|语音噪声(SNR=0~10dB)|AEC打断唤醒(-5~-15dB)|