PULpy

Open prediction of Polysaccharide Utilisation Loci (PUL)

Create conda env

conda env create -f envs/PULpy.yaml
source activate PULpy

Get Pfam data

# Pfam

mkdir pfam_data && cd pfam_data
wget ftp://ftp.ebi.ac.uk/pub/databases/Pfam/current_release/Pfam-A.hmm.gz
wget ftp://ftp.ebi.ac.uk/pub/databases/Pfam/current_release/Pfam-A.hmm.dat.gz
wget ftp://ftp.ebi.ac.uk/pub/databases/Pfam/current_release/active_site.dat.gz
gunzip Pfam-A.hmm.gz Pfam-A.hmm.dat.gz active_site.dat.gz
hmmpress Pfam-A.hmm
cd ..

Get DBCAN data

mkdir dbcan_data && cd dbcan_data
wget http://bcb.unl.edu/dbCAN2/download/Databases/dbCAN-old@UGA/hmmscan-parser.sh
wget http://bcb.unl.edu/dbCAN2/download/Databases/dbCAN-old@UGA/dbCAN-fam-HMMs.txt
hmmpress dbCAN-fam-HMMs.txt
chmod 755 hmmscan-parser.sh
cd ..

Edit config.json if you need to....

Make scripts executable

chmod -R 755 scripts

Input file

PULpy is designed to process genomes downloaded from NCBI, therefore expects them in a format:

"genomes/{id}_genomic.fna.gz")

That is, in a directory called "genomes", with a unique_id followed by "_genomic.fna.gz"

Run it

snakemake --use-conda

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
envs		envs
scripts		scripts
README.md		README.md
Snakefile		Snakefile
config.json		config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PULpy

Create conda env

Get Pfam data

Get DBCAN data

Edit config.json if you need to....

Make scripts executable

Input file

Run it

About

Releases

Packages

Languages

WatsonLab/PULpy

Folders and files

Latest commit

History

Repository files navigation

PULpy

Create conda env

Get Pfam data

Get DBCAN data

Edit config.json if you need to....

Make scripts executable

Input file

Run it

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages