-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ranking merge #3906
Ranking merge #3906
Conversation
Signed-off-by: Yang Zhang <[email protected]>
This pull request introduces 5 alerts when merging 8423be4 into ff91628 - view on LGTM.com new alerts:
|
8423be4
to
992a4f5
Compare
This pull request introduces 5 alerts when merging ab5cab3 into ff91628 - view on LGTM.com new alerts:
|
This pull request introduces 5 alerts when merging 4f778e8 into f68c924 - view on LGTM.com new alerts:
|
This pull request introduces 5 alerts when merging 8d6c449 into 60f4c6c - view on LGTM.com new alerts:
|
Signed-off-by: ekmb <[email protected]>
This pull request introduces 4 alerts when merging be732af into ca8a7e0 - view on LGTM.com new alerts:
|
|
||
|
||
class RangeFst(GraphFst): | ||
""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dosctings
Signed-off-by: ekmb <[email protected]>
Signed-off-by: ekmb <[email protected]>
This pull request introduces 1 alert when merging 7b19452 into b1b6e5e - view on LGTM.com new alerts:
|
Signed-off-by: Yang Zhang <[email protected]>
This pull request introduces 1 alert when merging be8192a into 5c88c8d - view on LGTM.com new alerts:
|
Signed-off-by: ekmb <[email protected]>
Signed-off-by: ekmb <[email protected]>
Signed-off-by: ekmb <[email protected]>
Signed-off-by: ekmb <[email protected]>
This pull request introduces 1 alert when merging d3d8437 into 087de54 - view on LGTM.com new alerts:
|
Signed-off-by: ekmb <[email protected]>
nemo_text_processing/text_normalization/en/data/address/address_words.tsv
Show resolved
Hide resolved
nemo_text_processing/text_normalization/en/data/measurements.tsv
Outdated
Show resolved
Hide resolved
@@ -129,32 +132,80 @@ def normalize(self, text: str, n_tagged: int, punct_post_process: bool = True, v | |||
len(text.split()) < 500 | |||
), "Your input is too long. Please split up the input into sentences, or strings with fewer than 500 words" | |||
original_text = text | |||
|
|||
if self.lang == "en": | |||
if self.lang in ["en", "de"]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ekmb only needed for english or can this be applied to all langauges?
nemo_text_processing/text_normalization/normalize_with_audio.py
Outdated
Show resolved
Hide resolved
@@ -0,0 +1,20 @@ | |||
`female.tsv` - List of common female names. Copyright (c) January 1991 by Mark Kantrowitz, 4987 names, Version 1.3 (29-MAR-94) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will this file type be added to pip package?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, it's needed to build the roman graph
roman_dict = load_labels(get_abs_path("data/roman/roman_to_spoken.tsv")) | ||
default_graph = pynini.string_map(roman_dict).optimize() | ||
default_graph = pynutil.insert("integer: \"") + default_graph + pynutil.insert("\"") | ||
graph_teens = pynini.string_map([x[0] for x in roman_dict[:19]]).optimize() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ekmb shall we relax this too for audio based without lm?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are there any confusing cases like abbreviations?
nemo_text_processing/text_normalization/en/taggers/tokenize_and_classify_with_audio.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Yang Zhang <[email protected]>
nemo_text_processing/text_normalization/en/data/address/address_words.tsv
Show resolved
Hide resolved
@@ -0,0 +1,20 @@ | |||
`female.tsv` - List of common female names. Copyright (c) January 1991 by Mark Kantrowitz, 4987 names, Version 1.3 (29-MAR-94) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, it's needed to build the roman graph
nemo_text_processing/text_normalization/en/taggers/tokenize_and_classify_with_audio.py
Outdated
Show resolved
Hide resolved
roman_dict = load_labels(get_abs_path("data/roman/roman_to_spoken.tsv")) | ||
default_graph = pynini.string_map(roman_dict).optimize() | ||
default_graph = pynutil.insert("integer: \"") + default_graph + pynutil.insert("\"") | ||
graph_teens = pynini.string_map([x[0] for x in roman_dict[:19]]).optimize() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are there any confusing cases like abbreviations?
Signed-off-by: Yang Zhang <[email protected]>
Signed-off-by: Yang Zhang <[email protected]>
Signed-off-by: Yang Zhang <[email protected]>
Signed-off-by: Yang Zhang <[email protected]>
Signed-off-by: ekmb <[email protected]>
This pull request introduces 2 alerts when merging 668eafc into afc7b71 - view on LGTM.com new alerts:
|
Signed-off-by: Yang Zhang <[email protected]>
This pull request introduces 2 alerts when merging 715f95c into c7a5a33 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 89e5dff into c7a5a33 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 1702604 into 4aba4b2 - view on LGTM.com new alerts:
|
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information