Skip to content

Commit 447de7d

Browse files
authored
Merge pull request #3 from moconnor725/beta-candidate
Beta candidate
2 parents 8ea92eb + c17d0a0 commit 447de7d

File tree

244 files changed

+29294
-10064
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

244 files changed

+29294
-10064
lines changed

Diff for: .gitignore

+2
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,7 @@ eggs/
2727
.eggs/
2828
lib/
2929
lib64/
30+
#parts/
3031
sdist/
3132
var/
3233
wheels/
@@ -143,3 +144,4 @@ examples/*/outputs
143144
examples/*/wandb
144145
examples/*/data
145146
wandb
147+
dump.py

Diff for: LICENSE

+1-201
Original file line numberDiff line numberDiff line change
@@ -1,201 +1 @@
1-
Apache License
2-
Version 2.0, January 2004
3-
http://www.apache.org/licenses/
4-
5-
TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
6-
7-
1. Definitions.
8-
9-
"License" shall mean the terms and conditions for use, reproduction,
10-
and distribution as defined by Sections 1 through 9 of this document.
11-
12-
"Licensor" shall mean the copyright owner or entity authorized by
13-
the copyright owner that is granting the License.
14-
15-
"Legal Entity" shall mean the union of the acting entity and all
16-
other entities that control, are controlled by, or are under common
17-
control with that entity. For the purposes of this definition,
18-
"control" means (i) the power, direct or indirect, to cause the
19-
direction or management of such entity, whether by contract or
20-
otherwise, or (ii) ownership of fifty percent (50%) or more of the
21-
outstanding shares, or (iii) beneficial ownership of such entity.
22-
23-
"You" (or "Your") shall mean an individual or Legal Entity
24-
exercising permissions granted by this License.
25-
26-
"Source" form shall mean the preferred form for making modifications,
27-
including but not limited to software source code, documentation
28-
source, and configuration files.
29-
30-
"Object" form shall mean any form resulting from mechanical
31-
transformation or translation of a Source form, including but
32-
not limited to compiled object code, generated documentation,
33-
and conversions to other media types.
34-
35-
"Work" shall mean the work of authorship, whether in Source or
36-
Object form, made available under the License, as indicated by a
37-
copyright notice that is included in or attached to the work
38-
(an example is provided in the Appendix below).
39-
40-
"Derivative Works" shall mean any work, whether in Source or Object
41-
form, that is based on (or derived from) the Work and for which the
42-
editorial revisions, annotations, elaborations, or other modifications
43-
represent, as a whole, an original work of authorship. For the purposes
44-
of this License, Derivative Works shall not include works that remain
45-
separable from, or merely link (or bind by name) to the interfaces of,
46-
the Work and Derivative Works thereof.
47-
48-
"Contribution" shall mean any work of authorship, including
49-
the original version of the Work and any modifications or additions
50-
to that Work or Derivative Works thereof, that is intentionally
51-
submitted to Licensor for inclusion in the Work by the copyright owner
52-
or by an individual or Legal Entity authorized to submit on behalf of
53-
the copyright owner. For the purposes of this definition, "submitted"
54-
means any form of electronic, verbal, or written communication sent
55-
to the Licensor or its representatives, including but not limited to
56-
communication on electronic mailing lists, source code control systems,
57-
and issue tracking systems that are managed by, or on behalf of, the
58-
Licensor for the purpose of discussing and improving the Work, but
59-
excluding communication that is conspicuously marked or otherwise
60-
designated in writing by the copyright owner as "Not a Contribution."
61-
62-
"Contributor" shall mean Licensor and any individual or Legal Entity
63-
on behalf of whom a Contribution has been received by Licensor and
64-
subsequently incorporated within the Work.
65-
66-
2. Grant of Copyright License. Subject to the terms and conditions of
67-
this License, each Contributor hereby grants to You a perpetual,
68-
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
69-
copyright license to reproduce, prepare Derivative Works of,
70-
publicly display, publicly perform, sublicense, and distribute the
71-
Work and such Derivative Works in Source or Object form.
72-
73-
3. Grant of Patent License. Subject to the terms and conditions of
74-
this License, each Contributor hereby grants to You a perpetual,
75-
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
76-
(except as stated in this section) patent license to make, have made,
77-
use, offer to sell, sell, import, and otherwise transfer the Work,
78-
where such license applies only to those patent claims licensable
79-
by such Contributor that are necessarily infringed by their
80-
Contribution(s) alone or by combination of their Contribution(s)
81-
with the Work to which such Contribution(s) was submitted. If You
82-
institute patent litigation against any entity (including a
83-
cross-claim or counterclaim in a lawsuit) alleging that the Work
84-
or a Contribution incorporated within the Work constitutes direct
85-
or contributory patent infringement, then any patent licenses
86-
granted to You under this License for that Work shall terminate
87-
as of the date such litigation is filed.
88-
89-
4. Redistribution. You may reproduce and distribute copies of the
90-
Work or Derivative Works thereof in any medium, with or without
91-
modifications, and in Source or Object form, provided that You
92-
meet the following conditions:
93-
94-
(a) You must give any other recipients of the Work or
95-
Derivative Works a copy of this License; and
96-
97-
(b) You must cause any modified files to carry prominent notices
98-
stating that You changed the files; and
99-
100-
(c) You must retain, in the Source form of any Derivative Works
101-
that You distribute, all copyright, patent, trademark, and
102-
attribution notices from the Source form of the Work,
103-
excluding those notices that do not pertain to any part of
104-
the Derivative Works; and
105-
106-
(d) If the Work includes a "NOTICE" text file as part of its
107-
distribution, then any Derivative Works that You distribute must
108-
include a readable copy of the attribution notices contained
109-
within such NOTICE file, excluding those notices that do not
110-
pertain to any part of the Derivative Works, in at least one
111-
of the following places: within a NOTICE text file distributed
112-
as part of the Derivative Works; within the Source form or
113-
documentation, if provided along with the Derivative Works; or,
114-
within a display generated by the Derivative Works, if and
115-
wherever such third-party notices normally appear. The contents
116-
of the NOTICE file are for informational purposes only and
117-
do not modify the License. You may add Your own attribution
118-
notices within Derivative Works that You distribute, alongside
119-
or as an addendum to the NOTICE text from the Work, provided
120-
that such additional attribution notices cannot be construed
121-
as modifying the License.
122-
123-
You may add Your own copyright statement to Your modifications and
124-
may provide additional or different license terms and conditions
125-
for use, reproduction, or distribution of Your modifications, or
126-
for any such Derivative Works as a whole, provided Your use,
127-
reproduction, and distribution of the Work otherwise complies with
128-
the conditions stated in this License.
129-
130-
5. Submission of Contributions. Unless You explicitly state otherwise,
131-
any Contribution intentionally submitted for inclusion in the Work
132-
by You to the Licensor shall be under the terms and conditions of
133-
this License, without any additional terms or conditions.
134-
Notwithstanding the above, nothing herein shall supersede or modify
135-
the terms of any separate license agreement you may have executed
136-
with Licensor regarding such Contributions.
137-
138-
6. Trademarks. This License does not grant permission to use the trade
139-
names, trademarks, service marks, or product names of the Licensor,
140-
except as required for reasonable and customary use in describing the
141-
origin of the Work and reproducing the content of the NOTICE file.
142-
143-
7. Disclaimer of Warranty. Unless required by applicable law or
144-
agreed to in writing, Licensor provides the Work (and each
145-
Contributor provides its Contributions) on an "AS IS" BASIS,
146-
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
147-
implied, including, without limitation, any warranties or conditions
148-
of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
149-
PARTICULAR PURPOSE. You are solely responsible for determining the
150-
appropriateness of using or redistributing the Work and assume any
151-
risks associated with Your exercise of permissions under this License.
152-
153-
8. Limitation of Liability. In no event and under no legal theory,
154-
whether in tort (including negligence), contract, or otherwise,
155-
unless required by applicable law (such as deliberate and grossly
156-
negligent acts) or agreed to in writing, shall any Contributor be
157-
liable to You for damages, including any direct, indirect, special,
158-
incidental, or consequential damages of any character arising as a
159-
result of this License or out of the use or inability to use the
160-
Work (including but not limited to damages for loss of goodwill,
161-
work stoppage, computer failure or malfunction, or any and all
162-
other commercial damages or losses), even if such Contributor
163-
has been advised of the possibility of such damages.
164-
165-
9. Accepting Warranty or Additional Liability. While redistributing
166-
the Work or Derivative Works thereof, You may choose to offer,
167-
and charge a fee for, acceptance of support, warranty, indemnity,
168-
or other liability obligations and/or rights consistent with this
169-
License. However, in accepting such obligations, You may act only
170-
on Your own behalf and on Your sole responsibility, not on behalf
171-
of any other Contributor, and only if You agree to indemnify,
172-
defend, and hold each Contributor harmless for any liability
173-
incurred by, or claims asserted against, such Contributor by reason
174-
of your accepting any such warranty or additional liability.
175-
176-
END OF TERMS AND CONDITIONS
177-
178-
APPENDIX: How to apply the Apache License to your work.
179-
180-
To apply the Apache License to your work, attach the following
181-
boilerplate notice, with the fields enclosed by brackets "[]"
182-
replaced with your own identifying information. (Don't include
183-
the brackets!) The text should be enclosed in the appropriate
184-
comment syntax for the file format. We also recommend that a
185-
file or class name and description of purpose be included on the
186-
same "printed page" as the copyright notice for easier
187-
identification within third-party archives.
188-
189-
Copyright [yyyy] [name of copyright owner]
190-
191-
Licensed under the Apache License, Version 2.0 (the "License");
192-
you may not use this file except in compliance with the License.
193-
You may obtain a copy of the License at
194-
195-
http://www.apache.org/licenses/LICENSE-2.0
196-
197-
Unless required by applicable law or agreed to in writing, software
198-
distributed under the License is distributed on an "AS IS" BASIS,
199-
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
200-
See the License for the specific language governing permissions and
201-
limitations under the License.
1+
Please refer to per-package licenses

Diff for: README.rst

+5-6
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,10 @@ Neural Modules’ inputs and outputs have Neural Type for semantic checking.
1515

1616
An application built with NeMo application is a Directed Acyclic Graph(DAG) of connected modules enabling researchers to define and build new speech and nlp networks easily through API Compatible modules.
1717

18+
**Documentation and Tutorials**
19+
20+
Please refer to the HTML documentation in the `docs` folder
21+
1822

1923
**VIDEO**
2024

@@ -31,11 +35,6 @@ An application built with NeMo application is a Directed Acyclic Graph(DAG) of c
3135
* **Collections** - NeMo comes with collections - related group of modules such as `nemo_asr` (for Speech Recognition) and `nemo_nlp` for NLP
3236

3337

34-
**Documentation**
35-
36-
Please refer to the HTML documentation in the `docs` folder
37-
38-
3938
**Requirements**
4039

4140
1) Python 3.6 or 3.7
@@ -60,7 +59,7 @@ Run this:
6059
2) Go to `nemo` folder and do: `python setup.py install`
6160
3) Install collections:
6261
a) ASR collection from `collections/nemo_asr` do: `python setup.py install`
63-
b) NLP collection coming soon ...
62+
b) NLP collection from `collections/nemo_nlp` do: `python setup.py install`
6463

6564
4) For development do: `python setup.py develop` instead of `python setup.py install` in Step (3) above
6665
5) Go to `examples/start_here` to get started with few simple examples

Diff for: collections/nemo_asr/nemo_asr/__init__.py

+15-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,17 @@
1-
# Copyright (c) 2019 NVIDIA Corporation
1+
# Copyright 2019 AI Applications Design Team at NVIDIA. All Rights Reserved.
2+
#
3+
# Licensed under the Apache License, Version 2.0 (the "License");
4+
# you may not use this file except in compliance with the License.
5+
# You may obtain a copy of the License at
6+
#
7+
# http://www.apache.org/licenses/LICENSE-2.0
8+
#
9+
# Unless required by applicable law or agreed to in writing, software
10+
# distributed under the License is distributed on an "AS IS" BASIS,
11+
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
12+
# See the License for the specific language governing permissions and
13+
# limitations under the License.
14+
# ==============================================================================
215
from nemo.core import Backend
316

417
from .data_layer import AudioToTextDataLayer, AudioPreprocessing, \
@@ -11,3 +24,4 @@
1124

1225
name = "nemo_asr"
1326
backend = Backend.PyTorch
27+
__version__ = "0.1"

Diff for: collections/nemo_asr/nemo_asr/data_layer.py

+8-9
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
import torch
77
from apex import amp
88

9-
from nemo.backends.pytorch.nm import DataLayerNM, NonTrainableNM
9+
from nemo.backends.pytorch.nm import DataLayerNM, TrainableNM, NonTrainableNM
1010
from nemo.core import Optimization, DeviceType
1111
from nemo.core.neural_types import *
1212
from .parts.dataset import AudioDataset, seq_collate_fn
@@ -112,13 +112,12 @@ def __init__(
112112
labels=labels,
113113
featurizer=self._featurizer, max_duration=max_duration,
114114
min_duration=min_duration, normalize=normalize_transcripts,
115-
trim=trim_silence, verbose=self._master_process,
115+
trim=trim_silence, logger=self._logger,
116116
eos_id=eos_id, load_audio=load_audio
117117
)
118118

119119
if self._placement == DeviceType.AllGpu:
120-
if self._master_process:
121-
print('Parallelizing DATALAYER')
120+
self._logger.info('Parallelizing DATALAYER')
122121
sampler = torch.utils.data.distributed.DistributedSampler(
123122
self._dataset)
124123
else:
@@ -146,7 +145,7 @@ def data_iterator(self):
146145
return self._dataloader
147146

148147

149-
class AudioPreprocessing(NonTrainableNM):
148+
class AudioPreprocessing(TrainableNM):
150149
"""
151150
Neural Module that does batch processing of audio files and converts them
152151
to spectrogram representations
@@ -232,7 +231,7 @@ def __init__(
232231
raise NotImplementedError("AudioPreprocessing currently only "
233232
"accepts 'fbank' or 'logfbank' as "
234233
"feat_type")
235-
NonTrainableNM.__init__(self, **kwargs)
234+
TrainableNM.__init__(self, **kwargs)
236235

237236
self.featurizer = FilterbankFeatures(
238237
sample_rate=sample_rate,
@@ -248,14 +247,14 @@ def __init__(
248247
dither=dither,
249248
pad_to=pad_to,
250249
frame_splicing=frame_splicing,
251-
stft_conv=stft_conv
250+
stft_conv=stft_conv,
251+
logger=self._logger
252252
)
253253
# _pre_procesing_config = self.local_parameters
254254
# self.featurizer = FeatureFactory.from_config(_pre_procesing_config)
255255
self.featurizer.to(self._device)
256256

257-
stft_conv = kwargs.get("stft_conv", False)
258-
self.disable_casts = (self._opt_level != Optimization.nothing and
257+
self.disable_casts = (self._opt_level == Optimization.mxprO1 and
259258
not stft_conv)
260259

261260
def forward(self, input_signal, length):

0 commit comments

Comments
 (0)