[KWS]Add kws example on HeySnips dataset. #1558

KPatr1ck · 2022-03-11T09:43:38Z

PR types

New features

PR changes

Models

Describe

Add mdtc model for kws.

zh794390558

对应的paper是？

zh794390558 · 2022-04-19T11:46:53Z

examples/hey_snips/kws0/conf/mdtc.yaml

@@ -0,0 +1,39 @@
+data:


配置文件参考其他egs，展开吧。不建议用层级的了。

zh794390558 · 2022-04-19T11:48:09Z

examples/hey_snips/kws0/local/score.sh

@@ -0,0 +1,5 @@
+#!/bin/bash
+


加上usage，并check输入

这个脚本不推荐用户单独使用，正常都是从run.sh调用，我在run里加检查

zh794390558 · 2022-04-19T11:50:02Z

paddleaudio/paddleaudio/datasets/dataset.py

+        if self.feat_type in ['kaldi_fbank', 'kaldi_mfcc']:
+            waveform = paddle.to_tensor(waveform).unsqueeze(0)  # (C, T)
+            record['feat'] = feat_func(
+                waveform=waveform, sr=self.sample_rate, **self.feat_config)


需要的参数需要对应指出，不建议使用**

feature api的参数太多，且大部分为默认，我加个说明让用户根据api中的kwargs去配置。

zh794390558 · 2022-04-19T11:55:05Z

paddlespeech/kws/exps/mdtc/compute_det.py

+
+# yapf: disable
+parser = argparse.ArgumentParser(__doc__)
+parser.add_argument("--cfg_path", type=str, required=True)


不建议用配置文件传入参数，需要单独指出

paddlespeech/kws/exps/mdtc/compute_det.py

paddlespeech/kws/exps/mdtc/train.py

paddlespeech/kws/models/loss.py

zh794390558 · 2022-04-19T12:13:28Z

paddlespeech/kws/models/loss.py

+
+    mask = padding_mask(lengths)
+    num_utts = logits.shape[0]
+    num_keywords = logits.shape[2]


num_keywords 好奇怪。
B，T，D

zh794390558 · 2022-04-19T12:15:47Z

paddlespeech/kws/models/loss.py

+    num_keywords = logits.shape[2]
+
+    loss = 0.0
+    for i in range(num_utts):


为什么这个不直接调用CE？

不可以直接用CrossEntropy，需要在有效的音频长度内取max作为得分再算CE。

mergify · 2022-04-22T12:21:46Z

This pull request is now in conflict :(

KPatr1ck · 2022-04-24T14:19:12Z

对应的paper是？

在README中的模型处给出了，https://arxiv.org/pdf/2102.13552.pdf

zh794390558 · 2022-04-25T03:22:17Z

paddlespeech/kws/exps/mdtc/compute_det.py

+                while i < len(score_list):
+                    if score_list[i] >= threshold:
+                        num_false_alarm += 1
+                        i += args.window_shift


50*10=500ms

zh794390558 · 2022-04-25T03:26:27Z

paddlespeech/kws/models/loss.py

+        for j in range(num_keywords):
+            # Add entropy loss CE = -(t * log(p) + (1 - t) * log(1 - p))
+            if target[i] == j:
+                # For the keyword, do max-polling


看着是在T上做了max_pool后算Cross Entropy

zh794390558 · 2022-04-25T03:27:37Z

paddlespeech/kws/models/mdtc.py

+        super(KWSModel, self).__init__()
+        self.backbone = backbone
+        self.linear = nn.Linear(self.backbone.hidden_dim, num_keywords)
+        self.activation = nn.Sigmoid()


zh794390558 · 2022-04-25T03:32:41Z

paddlespeech/kws/models/mdtc.py

+        self.kernel_size = kernel_size
+        self.dilation = dilation
+        self.causal = causal
+        self.receptive_fields = dilation * (kernel_size - 1)


kernel_size - 1 if no_dilation else dilation * (kernel_size -1)

zh794390558

LGTM

KPatr1ck added this to the r0.2.0 milestone Mar 11, 2022

zh794390558 reviewed Mar 11, 2022

View reviewed changes

zh794390558 marked this pull request as draft March 16, 2022 07:21

zh794390558 modified the milestones: r0.2.0, r1.0.0 Apr 1, 2022

KPatr1ck force-pushed the kws branch from 2889ca4 to 8625f12 Compare April 8, 2022 13:38

mergify bot added Audio Example README labels Apr 8, 2022

KPatr1ck force-pushed the kws branch 2 times, most recently from a35e99a to fe7c4e5 Compare April 19, 2022 09:46

KPatr1ck marked this pull request as ready for review April 19, 2022 09:46

zh794390558 reviewed Apr 19, 2022

View reviewed changes

KPatr1ck changed the title ~~[KWS]Add mdtc model.~~ [KWS]Add kws example on HeySnips dataset. Apr 19, 2022

PaddlePaddle deleted a comment from KPatr1ck Apr 20, 2022

mergify bot added the conflicts label Apr 22, 2022

KPatr1ck added 6 commits April 24, 2022 23:52

Add mdtc model.

521e222

Add KWS example.

e01abc5

Add KWS example.

b60b1da

Add KWS example.

f9761d5

Add KWS example.

43659b9

Add KWS example.

caa8eb4

KPatr1ck force-pushed the kws branch from f7a9d70 to caa8eb4 Compare April 24, 2022 15:53

mergify bot removed the conflicts label Apr 24, 2022

zh794390558 reviewed Apr 25, 2022

View reviewed changes

zh794390558 approved these changes Apr 25, 2022

View reviewed changes

zh794390558 merged commit 962a278 into PaddlePaddle:develop Apr 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[KWS]Add kws example on HeySnips dataset. #1558

[KWS]Add kws example on HeySnips dataset. #1558

KPatr1ck commented Mar 11, 2022

zh794390558 left a comment •

edited

Loading

zh794390558 Apr 19, 2022

zh794390558 Apr 19, 2022

KPatr1ck Apr 24, 2022

zh794390558 Apr 19, 2022

KPatr1ck Apr 24, 2022

zh794390558 Apr 19, 2022

zh794390558 Apr 19, 2022

zh794390558 Apr 19, 2022

KPatr1ck Apr 24, 2022

mergify bot commented Apr 22, 2022

KPatr1ck commented Apr 24, 2022

zh794390558 Apr 25, 2022

zh794390558 Apr 25, 2022

zh794390558 Apr 25, 2022

zh794390558 Apr 25, 2022

zh794390558 left a comment

[KWS]Add kws example on HeySnips dataset. #1558

[KWS]Add kws example on HeySnips dataset. #1558

Conversation

KPatr1ck commented Mar 11, 2022

PR types

PR changes

Describe

zh794390558 left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mergify bot commented Apr 22, 2022

KPatr1ck commented Apr 24, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zh794390558 left a comment

Choose a reason for hiding this comment

zh794390558 left a comment •

edited

Loading