Add Speech Recognition Task (Wav2Vec) #586

SeanNaren · 2021-07-14T11:32:53Z

What does this PR do?

Adds a speech recognition task based on the HF port of Wav2Vec.

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests? [not needed for typos/docs]
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Is this pull request ready for review? (if not, please submit in draft mode)

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

pep8speaks · 2021-07-14T11:32:57Z

Hello @SeanNaren! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-07-19 17:52:10 UTC

for more information, see https://pre-commit.ci

codecov · 2021-07-14T11:36:33Z

Codecov Report

Merging #586 (bfe8ea6) into master (ea4604f) will decrease coverage by 0.06%.
The diff coverage is 85.64%.

@@            Coverage Diff             @@
##           master     #586      +/-   ##
==========================================
- Coverage   90.04%   89.98%   -0.07%     
==========================================
  Files         144      149       +5     
  Lines        8251     8457     +206     
==========================================
+ Hits         7430     7610     +180     
- Misses        821      847      +26

Flag	Coverage Δ
gpu	`?`
pytest	`?`
unittests	`89.98% <85.64%> (-0.07%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
flash/audio/speech_recognition/data.py	`74.13% <74.13%> (ø)`
flash/audio/speech_recognition/model.py	`97.14% <97.14%> (ø)`
flash/audio/__init__.py	`100.00% <100.00%> (ø)`
flash/audio/speech_recognition/__init__.py	`100.00% <100.00%> (ø)`
flash/audio/speech_recognition/backbone.py	`100.00% <100.00%> (ø)`
flash/audio/speech_recognition/collate.py	`100.00% <100.00%> (ø)`
flash/core/data/batch.py	`96.51% <100.00%> (+0.06%)`	⬆️
flash/core/data/process.py	`86.86% <100.00%> (+0.28%)`	⬆️
flash/core/utilities/imports.py	`91.50% <100.00%> (+0.33%)`	⬆️
flash/core/serve/flash_components.py	`90.74% <0.00%> (ø)`
... and 8 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ea4604f...bfe8ea6. Read the comment docs.

for more information, see https://pre-commit.ci

tchaton

Good progress !

flash/audio/speech_recognition/data.py

flash/audio/speech_recognition/model.py

flash/audio/speech_recognition/data.py

ethanwharris

Awesome, LGTM 😃

tchaton

LGTM !

tchaton · 2021-07-19T18:42:01Z

flash/audio/speech_recognition/model.py

+        # set os environ variable for multiprocesses
+        os.environ["PYTHONWARNINGS"] = "ignore"
+
+        model = self.backbones.get(backbone


Weird formatting.

Base files for wav2vec integration

dd92d79

SeanNaren added enhancement New feature or request task flash Task Task-a-thon labels Jul 14, 2021

SeanNaren self-assigned this Jul 14, 2021

Format code with autopep8

2a43fe7

SeanNaren changed the title ~~New Speech Recognition Task (Wav2Vec)~~ Add Speech Recognition Task (Wav2Vec) Jul 14, 2021

[pre-commit.ci] auto fixes from pre-commit.com hooks

6a39b34

for more information, see https://pre-commit.ci

SeanNaren and others added 3 commits July 14, 2021 17:39

Closer to working

1b48bc1

Format code with autopep8

c87dcc2

[pre-commit.ci] auto fixes from pre-commit.com hooks

091da56

for more information, see https://pre-commit.ci

tchaton reviewed Jul 15, 2021

View reviewed changes

SeanNaren added 10 commits July 15, 2021 11:40

Refactors

2690e9c

Refactors

1531560

Cleanups

e8664d6

Refactor to allow files

6d0f1c3

Get predictions working

a9735b2

Add licence

0901d12

Merge branch 'master' into feat/speech_recognition

bce0e10

Fix loads

1f18f05

Add check

71cb06d

Fix imports

50642f5

SeanNaren commented Jul 15, 2021

View reviewed changes

flash/audio/speech_recognition/data.py Outdated Show resolved Hide resolved

SeanNaren added 5 commits July 16, 2021 10:34

Cleanups

d271951

Add backbone API

956ac8e

Cleanups

6b132f2

Fix

3db4dad

Add tests

c54acf1

SeanNaren requested review from edenlightning, ethanwharris, justusschock and kaushikb11 as code owners July 16, 2021 21:09

SeanNaren and others added 11 commits July 18, 2021 09:02

Fix path

14795f3

Swap to audio available

1b8eb08

Small fix

ab3a437

Some fixes

13eb84f

Small fix

af9e0c1

Small fix

4bbc31c

Fix

4336f61

Updates

51c640a

Fix docs

801b752

Remove duplicate

683f671

Add check for audio

8590052

SeanNaren enabled auto-merge (squash) July 19, 2021 11:17

Updates

1c98625

ethanwharris approved these changes Jul 19, 2021

View reviewed changes

ethanwharris added 9 commits July 19, 2021 17:59

Update CHANGELOG.md

a208e17

Updates

d9d1a0a

Update docs

9259f44

Update docs

70607a2

Update docs

4e6bce7

Add example to CI

2d08f21

Fix some tests

0052f1f

Fix some broken tests

0c87f04

Fixes

bfe8ea6

tchaton approved these changes Jul 19, 2021

View reviewed changes

SeanNaren merged commit b8b4ebc into master Jul 19, 2021

SeanNaren deleted the feat/speech_recognition branch July 19, 2021 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Speech Recognition Task (Wav2Vec) #586

Add Speech Recognition Task (Wav2Vec) #586

SeanNaren commented Jul 14, 2021 •

edited

Loading

pep8speaks commented Jul 14, 2021 •

edited

Loading

codecov bot commented Jul 14, 2021 •

edited

Loading

tchaton left a comment

ethanwharris left a comment

tchaton left a comment

tchaton Jul 19, 2021

Add Speech Recognition Task (Wav2Vec) #586

Add Speech Recognition Task (Wav2Vec) #586

Conversation

SeanNaren commented Jul 14, 2021 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Jul 14, 2021 • edited Loading

Comment last updated at 2021-07-19 17:52:10 UTC

codecov bot commented Jul 14, 2021 • edited Loading

Codecov Report

tchaton left a comment

Choose a reason for hiding this comment

ethanwharris left a comment

Choose a reason for hiding this comment

tchaton left a comment

Choose a reason for hiding this comment

tchaton Jul 19, 2021

Choose a reason for hiding this comment

SeanNaren commented Jul 14, 2021 •

edited

Loading

pep8speaks commented Jul 14, 2021 •

edited

Loading

codecov bot commented Jul 14, 2021 •

edited

Loading