Update README.rst #4

okuchaiev · 2019-09-12T05:17:04Z

No description provided.

allow as many tokens to be generated as max target

* Create README.rst * Update and rename README.rst to README.md * Update README.md

) * add initial impl of ModularizedSpeechGPTModel and integration test * fix typo in the test name (#1) approve the nit change * clean a initial version of example config; make sure it works by test (#2) approve as no need to review * add the test for training_step and fix the code correspondingly (test passed now) (#3) * add test for validation_step (#4) * mv audio and text emb concat to prepare_llm_input so as to write test to guard the llm input * Merge heh and zhehuai's initial version of frozen am+llm (#5) * Merge heh and zhehuai's initial version of frozen am+llm The previous differences are summarized here: https://docs.google.com/document/d/1zNI4hC6vJtUfcHbrUSPaMuYWRBQdN_36H0P2NiBiuPY/edit This PR includes 1. Finish merging the model, dataset, and config code 2. Previous tests are still enabled and passed (prepare_llm_input, training_step, validation_step) 3. the example training script with LS960 has been run to make sure the training pipeline works The major remaining works are listed here https://docs.google.com/document/d/1o0AM7v4gcTQkPZjE0Vl9TTX4vYnGTrbXEFGWh0UhGlk/edit#bookmark=id.pzvdadt5oxyw --------- Co-authored-by: He Huang (Steve) <[email protected]> * fix a nit init bug broke test (#6) Signed-off-by: zhehuaichen <[email protected]> * Clean up implementation for SALM paper and sync to NEMO v1.20.0 (#18) * wip Signed-off-by: zhehuaichen <[email protected]> * fix data Signed-off-by: zhehuaichen <[email protected]> * fix consumed_samples Signed-off-by: zhehuaichen <[email protected]> * fix the training restart problem by storing adapter+perception model and init them from the ckpt Signed-off-by: zhehuaichen <[email protected]> * refix state dict Signed-off-by: zhehuaichen <[email protected]> * support wer and inf Signed-off-by: zhehuaichen <[email protected]> * nan guard Signed-off-by: zhehuaichen <[email protected]> * reimpl inf and bug fix Signed-off-by: zhehuaichen <[email protected]> * multi loader Signed-off-by: zhehuaichen <[email protected]> * unfreeze lm Signed-off-by: zhehuaichen <[email protected]> * flag for load am Signed-off-by: zhehuaichen <[email protected]> * tokenizer Signed-off-by: zhehuaichen <[email protected]> * overwrite vocab size Signed-off-by: zhehuaichen <[email protected]> * support bpe dropout Signed-off-by: zhehuaichen <[email protected]> * add tarred datasets Signed-off-by: stevehuang52 <[email protected]> * fix sample_alpha Signed-off-by: stevehuang52 <[email protected]> * fix bpe dropout bugs in the mismatched context in tokenization Signed-off-by: zhehuaichen <[email protected]> * add bleu metric Signed-off-by: stevehuang52 <[email protected]> * update metrics Signed-off-by: stevehuang52 <[email protected]> * support inference and fix a bug in wer calculation Signed-off-by: zhehuaichen <[email protected]> * fix bucketing dataset Signed-off-by: stevehuang52 <[email protected]> * fix bleu implementation Signed-off-by: zhehuaichen <[email protected]> * support question set file per dataset/data loader in preparation for multitask understanding; also fix bleu implementation Signed-off-by: zhehuaichen <[email protected]> * support simple random context for word boosting Signed-off-by: zhehuaichen <[email protected]> * use sacrebleu.corpus_bleu to be consistent with the rest Signed-off-by: zhehuaichen <[email protected]> * make audio_file optional in the data loader Signed-off-by: zhehuaichen <[email protected]> * add a tool to materialize mt and text data Signed-off-by: zhehuaichen <[email protected]> * compatible with tar dataset Signed-off-by: zhehuaichen <[email protected]> * temp fix for metric and speed up materialization Signed-off-by: zhehuaichen <[email protected]> * make num of context configurable Signed-off-by: zhehuaichen <[email protected]> * val_check_interval fix; make manifest dumping consistent with speech models Signed-off-by: zhehuaichen <[email protected]> * random_context_positive_ratio configurable to control precision Signed-off-by: zhehuaichen <[email protected]> * bug fix: freeze_llm flag is not passed to the model cfg Signed-off-by: zhehuaichen <[email protected]> * overwrite tensor_model_parallel_size Signed-off-by: zhehuaichen <[email protected]> * support both stt and ssl models for loading audio encoder Signed-off-by: zhehuaichen <[email protected]> * fix the inference config so as to use sampling; allow inference config update in training Signed-off-by: zhehuaichen <[email protected]> * refactorize and clean up code for preprocessing collections, dataset interface, model inference and rename some classes to be consistent with salm paper. also make sure test passed Signed-off-by: zhehuaichen <[email protected]> * Undo changes in megatron_gpt_peft_models.py and move them to speechllm_models.py; make sure the correctness by test_speechllm_models.py::TestModularizedAudioGPTModel::test_predict_step Signed-off-by: zhehuaichen <[email protected]> * update default inference config and test golden value accordingly Signed-off-by: zhehuaichen <[email protected]> * integration test and minor fix Signed-off-by: zhehuaichen <[email protected]> * nit bug fix on manifest_filepath introduced by code cleanup Signed-off-by: zhehuaichen <[email protected]> * update workspace/ files; consider moving to examples later Signed-off-by: zhehuaichen <[email protected]> * further remove unnecessary stuff in the inference implementation Signed-off-by: zhehuaichen <[email protected]> * revert the update in default end_string to be compatible with legacy models Signed-off-by: zhehuaichen <[email protected]> --------- Signed-off-by: zhehuaichen <[email protected]> Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: stevehuang52 <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> * rename 'ModularizedAudioGPTModel' to 'ModularAudioGPTLoRAModel'; move speechllm stuff under nemo/collections/multimodal/speechllm Signed-off-by: zhehuaichen <[email protected]> * update copyright; remove workspace/scripts and workspace/tools folders since the main branch has LLaMA support Signed-off-by: zhehuaichen <[email protected]> --------- Signed-off-by: zhehuaichen <[email protected]> Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: Zhehuai Chen <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Co-authored-by: stevehuang52 <[email protected]>

Update README.rst

55fa0fc

okuchaiev closed this Sep 12, 2019

okuchaiev deleted the okuchaiev-patch-1 branch October 19, 2019 00:30

sankulka mentioned this pull request Nov 16, 2020

[Question] How to solve Exception while using another wav file: RuntimeError: Argument #4: Padding size should be less than the corresponding input dimension, but got: padding (256, 256) at dimension 2 of input [1, 1, 2] ? #1457

Closed

briebe mentioned this pull request Sep 6, 2021

torch.stft() signature has been updated for PyTorch 1.7+ Please update PyTorch to remain compatible with later versions of NeMo. #2780

Closed

briebe mentioned this pull request Sep 17, 2021

speaker_reco_infer.py - pytorch version issue? #2842

Closed

yzhang123 pushed a commit to yzhang123/NeMo that referenced this pull request May 1, 2023

Merge pull request NVIDIA#4 from yzhang123/t5_finetune

29ae48c

allow as many tokens to be generated as max target

lhb8125 added a commit to lhb8125/NeMo that referenced this pull request Jul 19, 2023

Create README.rst (NVIDIA#4)

9927a96

* Create README.rst * Update and rename README.rst to README.md * Update README.md

pzelasko pushed a commit to pzelasko/NeMo that referenced this pull request Nov 29, 2023

add test for validation_step (NVIDIA#4)

bf8c4af

pzelasko pushed a commit to pzelasko/NeMo that referenced this pull request May 8, 2024

add test for validation_step (NVIDIA#4)

8aeb8f4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update README.rst #4

Update README.rst #4

okuchaiev commented Sep 12, 2019

Update README.rst #4

Update README.rst #4

Conversation

okuchaiev commented Sep 12, 2019