Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion egs/madcat_ar/v1/local/prepare_data.sh
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ mkdir -p data/{train,test,dev}
if [ $stage -le 1 ]; then
echo "$0: Processing dev, train and test data..."
echo "Date: $(date)."
local/process_data.py $download_dir1 $download_dir2 $download_dir3 $data_splits/madcat.train.raw.lineid data/dev data/local/dev/images.scp || exit 1
local/process_data.py $download_dir1 $download_dir2 $download_dir3 $data_splits/madcat.dev.raw.lineid data/dev data/local/dev/images.scp || exit 1
local/process_data.py $download_dir1 $download_dir2 $download_dir3 $data_splits/madcat.test.raw.lineid data/test data/local/test/images.scp || exit 1
local/process_data.py $download_dir1 $download_dir2 $download_dir3 $data_splits/madcat.train.raw.lineid data/train data/local/train/images.scp || exit 1

Expand Down
2 changes: 1 addition & 1 deletion egs/madcat_ar/v1/local/process_data.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
""" This script reads MADCAT files and creates the following files (for the
data subset selected via --dataset) :text, utt2spk, images.scp.
Eg. local/process_data.py data/local /export/corpora/LDC/LDC2012T15 /export/corpora/LDC/LDC2013T09
/export/corpora/LDC/LDC2013T15 /home/kduh/proj/scale2018/data/madcat_datasplit/ar-en/madcat.train.raw.lineid
/export/corpora/LDC/LDC2013T15 data/download/data_splits/madcat.train.raw.lineid
data/dev data/local/lines/images.scp
Eg. text file: LDC0001_000404_NHR_ARB_20070113.0052_11_LDC0001_00z2 وجه وعقل غارق حتّى النخاع
utt2spk file: LDC0001_000397_NHR_ARB_20070113.0052_11_LDC0001_00z1 LDC0001
Expand Down