Skip to content

Commit d908a76

Browse files
github-actions[bot]krishnacpuvvadatitu1994
authored andcommitted
bug fix in fast-conformer-aed.yaml and adding jenkins test for speech_to_text_aed model (NVIDIA#8368) (NVIDIA#8383)
Signed-off-by: Krishna Puvvada <[email protected]> Co-authored-by: Krishna Puvvada <[email protected]> Co-authored-by: Krishna Puvvada <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Sasha Meister <[email protected]>
1 parent 4d6e4e7 commit d908a76

File tree

2 files changed

+43
-1
lines changed

2 files changed

+43
-1
lines changed

Jenkinsfile

+42
Original file line numberDiff line numberDiff line change
@@ -605,6 +605,48 @@ pipeline {
605605

606606
}
607607

608+
stage('L2: Speech to Text AED') {
609+
when {
610+
anyOf {
611+
branch 'r1.23.0'
612+
changeRequest target: 'r1.23.0'
613+
}
614+
}
615+
steps {
616+
sh 'python examples/asr/speech_multitask/speech_to_text_aed.py \
617+
model.prompt_format=canary \
618+
model.model_defaults.asr_enc_hidden=256 \
619+
model.model_defaults.lm_dec_hidden=256 \
620+
model.encoder.n_layers=12 \
621+
model.transf_encoder.num_layers=0 \
622+
model.transf_decoder.config_dict.num_layers=12 \
623+
model.train_ds.manifest_filepath=/home/TestData/asr/manifests/canary/an4_canary_train.json \
624+
++model.train_ds.is_tarred=false \
625+
model.train_ds.batch_duration=60 \
626+
+model.train_ds.text_field="answer" \
627+
+model.train_ds.lang_field="target_lang" \
628+
model.validation_ds.manifest_filepath=/home/TestData/asr/manifests/canary/an4_canary_val.json \
629+
+model.validation_ds.text_field="answer" \
630+
+model.validation_ds.lang_field="target_lang" \
631+
model.test_ds.manifest_filepath=/home/TestData/asr/manifests/canary/an4_canary_val.json \
632+
+model.test_ds.text_field="answer" \
633+
+model.test_ds.lang_field="target_lang" \
634+
model.tokenizer.langs.spl_tokens.dir=/home/TestData/asr_tokenizers/canary/canary_spl_tokenizer_v32 \
635+
model.tokenizer.langs.spl_tokens.type="bpe" \
636+
model.tokenizer.langs.en.dir=/home/TestData/asr_tokenizers/canary/en/tokenizer_spe_bpe_v1024_max_4 \
637+
model.tokenizer.langs.en.type=bpe \
638+
++model.tokenizer.langs.es.dir=/home/TestData/asr_tokenizers/canary/es/tokenizer_spe_bpe_v1024_max_4 \
639+
++model.tokenizer.langs.es.type=bpe \
640+
trainer.devices=[0] \
641+
trainer.accelerator="gpu" \
642+
+trainer.use_distributed_sampler=false \
643+
+trainer.fast_dev_run=True \
644+
exp_manager.exp_dir=examples/asr/speech_to_text_aed_results'
645+
sh 'rm -rf examples/asr/speech_to_text_results'
646+
}
647+
648+
}
649+
608650
stage('L2: Speaker dev run') {
609651
when {
610652
anyOf {

examples/asr/conf/speech_multitask/fast-conformer_aed.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ model:
3838
# https://github.com/NVIDIA/NeMo/blob/main/docs/source/asr/datasets.rst#lhotse-dataloading
3939
# You can also check the following configuration dataclass:
4040
# https://github.com/NVIDIA/NeMo/blob/main/nemo/collections/common/data/lhotse/dataloader.py#L36
41-
batch_size: None
41+
batch_size: null
4242
batch_duration: 360
4343
quadratic_duration: 15
4444
use_bucketing: True

0 commit comments

Comments
 (0)