Skip to content
Merged

Reverb #2753

Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
46 commits
Select commit Hold shift + click to select a range
43a3921
add and delete files
Aug 31, 2018
2bc97b9
add files for TDNN training and modify some files
Sep 13, 2018
d816ad1
modified run.sh and add local/score.sh, get_reslts.sh
Sep 24, 2018
13ce9d0
update and clean up scripts
Oct 2, 2018
53fc524
minor modification
Oct 2, 2018
3a648fc
minor modification
Oct 2, 2018
078137e
Merge branch 'master' into reverb
Szu-JuiChen Oct 3, 2018
2b112e8
parameter update
Oct 3, 2018
04b42f4
Merge branch 'reverb' of https://github.com/Szu-JuiChen/kaldi into re…
Oct 3, 2018
8145d3c
Update run.sh
Szu-JuiChen Oct 3, 2018
37960b2
Update run_tdnn_1a.sh
Szu-JuiChen Oct 3, 2018
cb026a1
Update run_ivector_common.sh
Szu-JuiChen Oct 3, 2018
e775685
fix nan value issue
Oct 4, 2018
6d04afc
Added WPE
sas91 Oct 4, 2018
147cade
Merge pull request #1 from sas91/reverb
Szu-JuiChen Oct 4, 2018
c9e5f6a
Remove partial tag
Oct 6, 2018
327aabb
change naming chime5 to reverb
Oct 9, 2018
623a212
Added Beamformit
sas91 Oct 11, 2018
ccebba1
Updated GMM WPE and Beamformit Results
sas91 Oct 11, 2018
a4b6fb7
Included the beamforming script
sas91 Oct 11, 2018
c9e29fe
Added Beamformit config file
sas91 Oct 11, 2018
1797446
Merge pull request #2 from sas91/reverb
Szu-JuiChen Oct 11, 2018
1c771b7
Store 1ch and 2ch WPE wavefiles in separate directories
sas91 Oct 15, 2018
c2108ce
rm check_tools.sh and bug fixed in run.sh
Oct 15, 2018
d1e5998
remove clean room in recog lists, add wpe only in recog set
sas91 Oct 16, 2018
7b72e54
Merge branch 'reverb' of https://github.com/Szu-JuiChen/kaldi into re…
sas91 Oct 16, 2018
3dc5bb8
added wpe recog sets in run.sh
sas91 Oct 16, 2018
ecda4c1
Added 1ch without WPE also to recog sets
sas91 Oct 16, 2018
d1947df
bug fix for code refactoring in previous commit
sas91 Oct 16, 2018
21f2337
change the data storage place on the grid
Oct 17, 2018
3afa9ed
Merge pull request #3 from sas91/reverb
Szu-JuiChen Oct 19, 2018
31f82f1
Added dereverberation measures, cln evaluation and updated RESULTS
sas91 Nov 7, 2018
3fb2981
Minor modification in scoring script
sas91 Nov 7, 2018
03ffe53
Merge pull request #4 from sas91/reverb
Szu-JuiChen Nov 7, 2018
f453337
Added patch files
sas91 Nov 7, 2018
9c0887d
Merge pull request #5 from sas91/reverb
Szu-JuiChen Nov 8, 2018
53b3259
Updated RESULTS according to Shinji's comments
sas91 Nov 8, 2018
bc0e0f7
Merge pull request #6 from sas91/reverb
Szu-JuiChen Nov 8, 2018
10d4713
Enabled SE computation by default and added flag to enable PESQ
sas91 Nov 15, 2018
7734371
Merge pull request #7 from sas91/reverb
Szu-JuiChen Nov 15, 2018
8d23156
update RESULTS and fix error in compute_se_scores.sh
Nov 18, 2018
5caf1ca
minor fix
Nov 18, 2018
fc0edd5
update RESULTS and fix error in compute_se_scores.sh
Nov 18, 2018
69659e6
remove some useless comment lines
Nov 18, 2018
135494b
1) removed unnecessary files 2) Add the shebang header 3) Add option…
sw005320 Nov 20, 2018
447cdea
delete unused config files
sw005320 Nov 22, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
447 changes: 298 additions & 149 deletions egs/reverb/s5/RESULTS

Large diffs are not rendered by default.

2 changes: 0 additions & 2 deletions egs/reverb/s5/conf/decode_dnn.config

This file was deleted.

2 changes: 0 additions & 2 deletions egs/reverb/s5/conf/fbank.conf

This file was deleted.

10 changes: 10 additions & 0 deletions egs/reverb/s5/conf/mfcc_hires.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# config for high-resolution MFCC features, intended for neural network training.
# Note: we keep all cepstra, so it has the same info as filterbank features,
# but MFCC is more easily compressible (because less correlated) which is why
# we prefer this method.
--use-energy=false # use average of log energy, not energy.
--sample-frequency=16000
--num-mel-bins=40
--num-ceps=40
--low-freq=40
--high-freq=-400
1 change: 1 addition & 0 deletions egs/reverb/s5/conf/online_cmvn.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
# configuration file for apply-cmvn-online, used in the script ../local/run_online_decoding.sh
50 changes: 50 additions & 0 deletions egs/reverb/s5/conf/reverb_beamformit.cfg
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
#BeamformIt sample configuration file for AMI data (http://groups.inf.ed.ac.uk/ami/download/)

# scrolling size to compute the delays
scroll_size = 250

# cross correlation computation window size
window_size = 500

#amount of maximum points for the xcorrelation taken into account
nbest_amount = 4

#flag wether to apply an automatic noise thresholding
do_noise_threshold = 1

#Percentage of frames with lower xcorr taken as noisy
noise_percent = 10

######## acoustic modelling parameters

#transition probabilities weight for multichannel decoding
trans_weight_multi = 25
trans_weight_nbest = 25

###

#flag wether to print the feaures after setting them, or not
print_features = 1

#flag wether to use the bad frames in the sum process
do_avoid_bad_frames = 1

#flag to use the best channel (SNR) as a reference
#defined from command line
do_compute_reference = 1

#flag wether to use a uem file or not(process all the file)
do_use_uem_file = 0

#flag wether to use an adaptative weights scheme or fixed weights
do_adapt_weights = 1

#flag wether to output the sph files or just run the system to create the auxiliary files
do_write_sph_files = 1

####directories where to store/retrieve info####
#channels_file = ./cfg-files/channels

#show needs to be passed as argument normally, here a default one is given just in case
#show_id = Ttmp

23 changes: 10 additions & 13 deletions egs/reverb/s5/local/Generate_mcTrainData_cut.m
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
function Generate_mcTrainData_cut(WSJ_dir_name, save_dir)
%
% Input variables:
% WSJ_dir_name: string name of user's clean wsjcam0 corpus directory
% (*Directory structure for wsjcam0 corpushas to be kept as it is after obtaining it from LDC.
% WSJ_dir_name: string name of WAV file directory converted from original wsjcam0 SPHERE files
% (*Directory structure for wsjcam0 corpus to be kept as it is after obtaining it from LDC.
% Otherwise this script does not work.)
%
% This function generates multi-condition traiing data
% based on the following items:
% 1. wsjcam0 corpus (distributed from the LDC)
% 1. wsjcam0 corpus (WAV files)
% 2. room impulse responses (ones under ./RIR/)
% 3. noise (ones under ./NOISE/).
% Generated data has the same directory structure as original wsjcam0 corpus.
Expand All @@ -26,8 +26,6 @@ function Generate_mcTrainData_cut(WSJ_dir_name, save_dir)

display(['Name of directory for original wsjcam0: ',WSJ_dir_name])
display(['Name of directory to save generated multi-condition training data: ',save_dir])
unix(['chmod u+x sphere_to_wave.csh']);
unix(['chmod u+x bin/*']);

% Parameters related to acoustic conditions
SNRdB=20;
Expand Down Expand Up @@ -89,7 +87,6 @@ function Generate_mcTrainData_cut(WSJ_dir_name, save_dir)
save_dir_tr=[save_dir,'/data/mc_train/'];
end
mkdir([save_dir_tr]);
%mkdir([save_dir,'/taskfiles/'])

mic_idx=['A';'B';'C';'D';'E';'F';'G';'H'];
prev_fname='dummy';
Expand All @@ -114,13 +111,12 @@ function Generate_mcTrainData_cut(WSJ_dir_name, save_dir)
end
prev_fname=fname(1:idx1(end));

% load (sphere format) speech signal
x=read_sphere([WSJ_dir_name,'/data/', fname]);
x=x/(2^15); % conversion from short-int to float
% load speech signal
x=audioread([WSJ_dir_name, '/data/', fname, '.wav'])';

% load RIR and noise for "THIS" utterance
eval(['RIR=wavread(RIR_sim',num2str(rcount),');']);
eval(['NOISE=wavread([noise_sim',num2str(ceil(rcount/4)),',''_',num2str(ncount),'.wav'']);']);
eval(['RIR=audioread(RIR_sim',num2str(rcount),');']);
eval(['NOISE=audioread([noise_sim',num2str(ceil(rcount/4)),',''_',num2str(ncount),'.wav'']);']);

% Generate 8ch noisy reverberant data
y=gen_obs(x,RIR,NOISE,SNRdB);
Expand All @@ -138,8 +134,9 @@ function Generate_mcTrainData_cut(WSJ_dir_name, save_dir)
y=y/4; % common normalization to all the data to prevent clipping
% denominator was decided experimentally

for ch=1:8
eval(['wavwrite(y(:,',num2str(ch),'),16000,''',save_dir_tr fname,'_ch',num2str(ch),'.wav'');']);
for ch=1:8
outfilename = [save_dir_tr, fname, '_ch', num2str(ch), '.wav'];
eval(['audiowrite(outfilename, y(:,',num2str(ch),'), 16000);']);
end

display(['sentence ',num2str(fcount),' (out of 7861) finished! (Multi-condition training data)'])
Expand Down
74 changes: 0 additions & 74 deletions egs/reverb/s5/local/REVERB_create_mcdata.sh

This file was deleted.

165 changes: 0 additions & 165 deletions egs/reverb/s5/local/REVERB_mcwsjav_data_prep.sh

This file was deleted.

Loading