-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path.gitignore
66 lines (59 loc) · 1.1 KB
/
.gitignore
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
# macOS
.DS_Store
# Python
__pycache__/
# development
config.sh
*.bbprojectd/
TODO
TODO/
/zz-*/
/00-*/
/results-2023-11-19/
/results-2023-12/
/clean-*-test*/
/tubelex-en-V0/
/tubelex-en-remote/
/*.ftz
# jtubespeech & tubelex output
/jtubespeech
/jtubespeech-subtitles
/tubelex-ja-channels.tsv
/*.xz
/*.zip
/*.tsv
/*.out
/tubelex-frequency-2024-07-15
/tubelex-frequency-2024-07-23-es
tubelex-*-removed-addresses.json
# The corpus is not public due to copyright...
/corpus/*
# ...except ideo ids and metadata:
!/corpus/videos-*.csv
# SubIMDB data, LaboroTVSpeech, HKUST-MTCS data
/SubIMDB_All_Individual.tar.xz
/SubIMDB_All_Individual.tar
/SubIMDB_All_Individual/
/laborotvspeech.zip
/laborotvspeech
/LDC2005T32
# experiments/LCP data
/data/*
/data/downloads/*
!/data/subtlex-es.tsv.xz
!/data/chinese-familiarity.tsv.xz
!/data/indonesian-subjective-frequency.csv.xz
!/data/os*.tsv.xz
!/data/os_README.txt
!/data/es-words.*.txt
!/data/mrc.csv
!/data/mrc_README.txt
!/data/github-images
/experiments/zz-*
/experiments/models/*
/experiments/output/*
/experiments/output.tsv
/experiments/cache/*
# large files
/fasttext/
/kenlm/