[ASR] Added a script for evaluating metrics for audio-to-audio #5971

anteju · 2023-02-08T23:42:50Z

Added a script for evaluating audio-to-audio metrics for a manifest file (audio_to_audio_eval.py)

Signed-off-by: Ante Jukić [email protected]

What does this PR do ?

This PR adds a script for evaluation of audio-to-audio metrics.
It is similar to speech_to_text_eval.py

This scripts depends on the process_audio.py script and inherits all its arguments.

Collection: ASR

Changelog

Added audio_to_audio_eval.py
Minor change in proces_audio
Added an option to get_full_path to specify data_dir directly instead of using dirname(manifest_file)

Usage

You can potentially add a usage example below

To score a dataset with a manifest file that contains the input audio which needs to be processed and target audio

python audio_to_audio_eval.py \
    model_path=null \
    pretrained_model=null \
    dataset_manifest=<Mandatory: path to a dataset manifest file> \
    output_dir=<Optional: Directory where processed audio will be saved> \
    processed_channel_selector=<Optional: list of channels to select from the processed audio file> \
    target_key=<Optional: key for the target audio in the dataset manifest. Default: target_audio_filepath> \
    target_channel_selector=<Optional: list of channels to select from the target audio file> \
    metrics=<Optional: list of metrics to evaluate. Defaults to [sdr,estoi]>
    batch_size=32 \
    amp=True

To score a manifest file which has been previously processed and contains both processed audio and target audio

python audio_to_audio_eval.py \
    dataset_manifest=<Mandatory: path to a dataset manifest file> \
    processed_key=<Optional: key for the target audio in the dataset manifest. Default: processed_audio_filepath>
    processed_channel_selector=<Optional: list of channels to select from the processed audio file> \
    target_key=<Optional: key for the target audio in the dataset manifest. Default: target_audio_filepath> \
    target_channel_selector=<Optional: list of channels to select from the target audio file> \
    metrics=<Optional: list of metrics to evaluate. Defaults to [sdr,estoi]>
    batch_size=32 \
    amp=True

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

examples/asr/audio_to_audio/audio_to_audio_eval.py

…fest file (audio_to_audio_eval.py) Signed-off-by: Ante Jukić <[email protected]>

jbalam-nv

LGTM

…fest file (audio_to_audio_eval.py) (NVIDIA#5971) Signed-off-by: Ante Jukić <[email protected]>

…fest file (audio_to_audio_eval.py) (NVIDIA#5971) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: hsiehjackson <[email protected]>

github-actions bot added ASR common labels Feb 8, 2023

github-advanced-security bot found potential problems Feb 9, 2023

View reviewed changes

examples/asr/audio_to_audio/audio_to_audio_eval.py Fixed Show fixed Hide fixed

examples/asr/audio_to_audio/audio_to_audio_eval.py Fixed Show fixed Hide fixed

anteju force-pushed the dev/audio-to-audio-eval-script branch from d1e324b to 96b2a89 Compare February 9, 2023 01:12

anteju requested a review from jbalam-nv February 9, 2023 18:20

anteju force-pushed the dev/audio-to-audio-eval-script branch 7 times, most recently from f973be3 to f34f1b4 Compare February 13, 2023 21:50

anteju force-pushed the dev/audio-to-audio-eval-script branch 3 times, most recently from 38bda00 to 68c4f26 Compare February 21, 2023 19:05

anteju requested a review from titu1994 February 21, 2023 19:18

[ASR] Added a script for evaluating audio-to-audio metrics for a mani…

d8cf738

…fest file (audio_to_audio_eval.py) Signed-off-by: Ante Jukić <[email protected]>

anteju force-pushed the dev/audio-to-audio-eval-script branch from 68c4f26 to d8cf738 Compare February 24, 2023 18:23

jbalam-nv approved these changes Feb 24, 2023

View reviewed changes

jbalam-nv merged commit 2f11f05 into NVIDIA:main Feb 24, 2023

titu1994 pushed a commit to titu1994/NeMo that referenced this pull request Mar 24, 2023

[ASR] Added a script for evaluating audio-to-audio metrics for a mani…

ccfabb8

…fest file (audio_to_audio_eval.py) (NVIDIA#5971) Signed-off-by: Ante Jukić <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ASR] Added a script for evaluating metrics for audio-to-audio #5971

[ASR] Added a script for evaluating metrics for audio-to-audio #5971

anteju commented Feb 8, 2023 •

edited

Loading

jbalam-nv left a comment

[ASR] Added a script for evaluating metrics for audio-to-audio #5971

[ASR] Added a script for evaluating metrics for audio-to-audio #5971

Conversation

anteju commented Feb 8, 2023 • edited Loading

What does this PR do ?

Changelog

Usage

To score a dataset with a manifest file that contains the input audio which needs to be processed and target audio

To score a manifest file which has been previously processed and contains both processed audio and target audio

Before your PR is "Ready for review"

Who can review?

Additional Information

jbalam-nv left a comment

Choose a reason for hiding this comment

anteju commented Feb 8, 2023 •

edited

Loading