Skip to content

Commit

Permalink
adding gene order and disjoint-exon-per-gene for the default annotati…
Browse files Browse the repository at this point in the history
…on resum check
  • Loading branch information
nmra-cwilks committed Feb 19, 2022
1 parent 68457fb commit 646c591
Showing 1 changed file with 21 additions and 0 deletions.
21 changes: 21 additions & 0 deletions get_unify_refs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -48,4 +48,25 @@ fi
#need to add a header to the exons file and gzip it
#slight misnomer in the header, "gene" is really "chromosome" but leave for backwards compatibility
cat <(echo "gene start end name score strand") ../${org}/gtf/exons.bed | gzip > exons.w_header.bed.gz

#finally, grab per-annotation ordering and default annotation disjoin-exon-per-gene BED file for post-run resum check
annotations="G026 G029 R109 F006 ERCC SIRV"
default="G026"
if [[ $org == "grcm38" ]]; then
annotations="M023 ERCC SIRV"
default="M023"
fi
if [[ ! -e disjoint2exons2genes.${default}.sorted.cut.bed ]]; then
wget https://recount-ref.s3.amazonaws.com/${org}_unify/disjoint2exons2genes.${default}.sorted.cut.bed.gz
gunzip disjoint2exons2genes.${default}.sorted.cut.bed.gz
fi
for f in $annotations; do
f="${f}.gene_sums.gene_order.tsv.gz"
unzipped=$(echo $f | sed 's/\.gz$//')
if [[ ! -e "$unzipped" ]]; then
wget https://recount-ref.s3.amazonaws.com/${org}_unify/$f
gunzip $f
fi
done

popd

0 comments on commit 646c591

Please sign in to comment.