Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: normalize patho-mms.bed => patho_mms.bed #315

Merged
merged 2 commits into from
Feb 27, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
48 changes: 26 additions & 22 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -305,34 +305,38 @@ $ varfish-server-worker strucvars query \
--path-output OUT.jsonl
```

The worker database has the following structure:
The worker database has the following structure.
Note that also mehari transcripts are read, thus the `mehari/` directory is included.

```
$ROOT/
mehari/
{genome_release}/
txs.bin.zst
worker/
noref/
genes/
acmg.tsv -- ACMG SF list genes
mim2gene.tsv -- OMIM to NCBI mapping from clingen
xlink.bin -- gene crosslinks
{genome_release}/ -- one per genome release
acmg.tsv -- ACMG SF list genes
mim2gene.tsv -- OMIM to NCBI mapping from clingen
xlink.bin -- gene crosslinks
{genome_release}/ -- one per genome release
mehari/
txs.bin.zstd -- mehari transcripts
features/ -- features important for annotation
masked_repeat.bin -- masked repeats
masked_seqdup.bin -- masked segmental duplications
strucvars/ -- structural variant specific
bgdbs/ -- background databases
dbvar.bin -- dbVar
dgv.bin -- DGV
dgv-gs.bin -- DGV gold standard
exac.bin -- ExAC CNVs
g1k.bin -- 1000 genomes CNVs
gnomad.bin -- gnomAD-SVs
clinvar.bin -- ClinVar SVs
inhouse.bin -- inhouse SV database
patho-mms.bed -- well-known pathogenic DELs/DUPs
txs.bin.zstd -- mehari transcripts
features/ -- features important for annotation
masked_repeat.bin -- masked repeats
masked_seqdup.bin -- masked segmental duplications
strucvars/ -- structural variant specific
bgdbs/ -- background databases
dbvar.bin -- dbVar
dgv.bin -- DGV
dgv_gs.bin -- DGV gold standard
g1k.bin -- 1000 genomes CNVs
gnomad_exomes.bin -- gnomAD-exomes/ExAC SVs
gnomad_genomes.bin -- gnomAD-genomes SVs
clinvar.bin -- ClinVar SVs
inhouse.bin -- inhouse SV database
patho_mms.bed -- well-known pathogenic DELs/DUPs
tads/
hesc.bed -- hESC TAD definitions
hesc.bed -- hESC TAD definitions
```

# Developer Information
Expand Down
2 changes: 1 addition & 1 deletion src/strucvars/query/pathogenic.rs
Original file line number Diff line number Diff line change
Expand Up @@ -178,7 +178,7 @@ pub fn load_patho_dbs(
let result = PathoDbBundle {
mms: load_patho_db_records(
Path::new(path_db)
.join(format!("{}/strucvars/patho-mms.bed", genome_release))
.join(format!("{}/strucvars/patho_mms.bed", genome_release))
.as_path(),
)?,
};
Expand Down
2 changes: 1 addition & 1 deletion tests/strucvars/query/db/bootstrap.sh
Git LFS file not shown
Loading