mirror of
https://github.com/espressif/esp-sr.git
synced 2025-09-15 15:28:44 +08:00
update wakenet README
This commit is contained in:
parent
99bc33bd30
commit
b5827516e8
@ -1,5 +1,9 @@
|
||||
# Change log for esp-sr
|
||||
|
||||
## 0.3.0(dev)
|
||||
add wakenet6
|
||||
support cmake
|
||||
|
||||
## 0.2.0
|
||||
add acoustic algorithm, include AEC, AGC, VAD ,NS
|
||||
add wakenet5X2 and wakenet5X3
|
||||
|
||||
@ -29,11 +29,11 @@ Please see the flow diagram of WakeNet below:
|
||||
|
||||
- How to select the WakeNet model
|
||||
|
||||
Go to `make menuconfig`, navigate to `Component config` >> `ESP Speech Recognition` >> `Wake word engine`. See below:
|
||||
|
||||
<center>
|
||||
<img src="../img/model_sel.png" width = "500" />
|
||||
</center>
|
||||
1. Go to `make menuconfig`, navigate to `Component config` >> `ESP Speech Recognition` >> `Wake word engine`. See below:
|
||||
<center> <img src="../img/model_sel.png" width = "500" /> </center>
|
||||
2. WakeNet6 is divided into two tasks, task1 is used to calculate speech recognition, the task2 is used to calculate neural network model. The ESP32 core used to calculate task2 can be selected by `Component config` >> `ESP Speech Recognition` >> `ESP32 core to run WakeNet6`
|
||||
|
||||
|
||||
|
||||
|
||||
- How to select the wake words
|
||||
@ -74,6 +74,9 @@ Please see the flow diagram of WakeNet below:
|
||||
|Quantised WakeNet5|41 K|15 KB|5.5 ms|30 ms|
|
||||
|Quantised WakeNet5X2|165 K|20 KB|10.5 ms|30 ms|
|
||||
|Quantised WakeNet5X3|371 K|24 KB|18 ms|30 ms|
|
||||
|Quantised WakeNet6|378 K|45 KB|4ms(task1) + 25 ms(task2)|30 ms|
|
||||
|
||||
**Note**: Quantised WakeNet6 is split into two tasks, task1 is used to calculate speech features and task2 is used to calculate neural network model.
|
||||
|
||||
### 2. Performance
|
||||
|
||||
|
||||
@ -19,7 +19,7 @@ WakeNet的流程图如下:
|
||||
- wakeNet3和wakeNet4基于[CRNN](https://arxiv.org/abs/1703.05390)结构。
|
||||
- WakeNet5(WakeNet5X2,WakeNetX3) 和 WakeNet6 基于 the [Dilated Convolution](https://arxiv.org/pdf/1609.03499.pdf) 结构。
|
||||
|
||||
注意,WakeNet5,WakeNet5X2 和 WakeNet5X3 的网络结构一致,但是 WakeNetX2 和 WakeNetX3 的参数比 WakeNet5 要多。请参考 [性能测试](#性能测试) 来获取更多细节。
|
||||
注意,WakeNet5,WakeNet5X2 和 WakeNet5X3 的网络结构一致,但是 WakeNet5X2 和 WakeNet5X3 的参数比 WakeNet5 要多。请参考 [性能测试](#性能测试) 来获取更多细节。
|
||||
|
||||
- keyword trigger method:
|
||||
对连续的音频流,为准确判断关键词的触发,我们通过计算若干帧内识别结果的平均值M,来判断触发。当M大于大于指定阈值,发出触发的命令。
|
||||
@ -28,14 +28,16 @@ WakeNet的流程图如下:
|
||||
## API introduction
|
||||
|
||||
- WakeNet模型选择
|
||||
使用make menuconfig,选择Component config >> ESP Speech commands >> Keyword spotting model,如下图
|
||||
1. 使用make menuconfig,选择Component config >> ESP Speech Recognition >> Wake Word Engine,如下图
|
||||
|
||||
<center>
|
||||
<img src="../img/model_sel.png" width = "500" />
|
||||
</center>
|
||||
</center>
|
||||
|
||||
2. 不同与WakeNet5,WakeNet6被拆分成两个task,task1计算speech features,task2计算neural network model。task2使用的ESP32核心,可以通过Component config >> ESP Speech Recognition >> ESP32 core to run WakeNet6选择,默认使用core1。
|
||||
|
||||
- 唤醒词选择
|
||||
使用make menuconfig,选择Component config >> ESP Speech commands >> Wake word list进行选择,如下图
|
||||
使用make menuconfig,选择Component config >> ESP Speech Recognition >> Wake word list进行选择,如下图
|
||||
<center>
|
||||
<img src="../img/word_sel.png" width = "500" />
|
||||
</center>
|
||||
@ -70,7 +72,9 @@ WakeNet的流程图如下:
|
||||
|Quantised WakeNet5|41 K|15 KB|5.5 ms|30 ms|
|
||||
|Quantised WakeNet5X2|165 K|20 KB|10.5 ms|30 ms|
|
||||
|Quantised WakeNet5X3|371 K|24 KB|18 ms|30 ms|
|
||||
|Quantised WakeNet6|378 K|45 KB|4ms(task1)+25ms(task2)|30 ms|
|
||||
|
||||
**注**:Quantised WakeNet6被拆分成两个task,其中task1用于计算speech features,另一个task2用于计算神经网络。
|
||||
|
||||
### 2.识别性能
|
||||
|距离|安静环境|平稳噪声(SNR=0~10dB)|语音噪声(SNR=0~10dB)|AEC打断唤醒(-5~-15dB)|
|
||||
|
||||
Loading…
Reference in New Issue
Block a user