Skip to content

Commit 57a68aa

Browse files
github-actions[bot]nithinraok
authored andcommitted
updated link to pubmed (NVIDIA#8402) (NVIDIA#8407)
Signed-off-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: Nithin Rao <[email protected]> Signed-off-by: Sasha Meister <[email protected]>
1 parent 1638156 commit 57a68aa

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

tutorials/nlp/SpellMapper_English_ASR_Customization.ipynb

+4-4
Original file line numberDiff line numberDiff line change
@@ -214,7 +214,7 @@
214214
"source": [
215215
"## Download data\n",
216216
"\n",
217-
"File `pubmed23n0009.xml` taken from public ftp server of https://www.ncbi.nlm.nih.gov/pmc/ contains information about 5593 medical papers, from which we extract only their abstracts. We will feed sentences from there to TTS + ASR to get initial ASR results.\n",
217+
"File `pubmed24n0009.xml` taken from public ftp server of https://www.ncbi.nlm.nih.gov/pmc/ contains information about 5593 medical papers, from which we extract only their abstracts. We will feed sentences from there to TTS + ASR to get initial ASR results.\n",
218218
"\n",
219219
"File `wordlist.txt` contains 100k **single-word** medical terms.\n",
220220
"\n",
@@ -231,9 +231,9 @@
231231
},
232232
"outputs": [],
233233
"source": [
234-
"!wget https://ftp.ncbi.nlm.nih.gov/pubmed/baseline/pubmed23n0009.xml.gz\n",
235-
"!gunzip pubmed23n0009.xml.gz\n",
236-
"!grep \"AbstractText\" pubmed23n0009.xml > abstract.txt\n",
234+
"!wget https://ftp.ncbi.nlm.nih.gov/pubmed/baseline/pubmed24n0009.xml.gz\n",
235+
"!gunzip pubmed24n0009.xml.gz\n",
236+
"!grep \"AbstractText\" pubmed24n0009.xml > abstract.txt\n",
237237
"\n",
238238
"!wget https://raw.githubusercontent.com/McGill-NLP/medal/master/toy_data/valid_adam.txt\n",
239239
"!wget https://raw.githubusercontent.com/glutanimate/wordlist-medicalterms-en/master/wordlist.txt\n",

0 commit comments

Comments
 (0)