git clone https://github.com/marcottelab/MSblender.git
git checkout consistent_search_params
Start with a folder for each experiment containing a folder called mzXML containing its mzxml files and a folder called DB containing formatted fasta database. See below for how to make a formatted fasta
Ce1104_example
-----mzXML
---------- WAN1100427_OT2_Celegans_HCW_P1A01.mzXML
---------- WAN1100427_OT2_Celegans_HCW_P1A02.mzXML
-----DB
---------- uniprot-proteome%3AUP000001940_caeel_contam.combined.fasta
Write each experiment you want to process as one line in a list of experiment IDs
This can be a one line file with one experiment or multiline with multiple experiments
For example:
$ cat exp_list_example.txt
Ce1104_example
$ bash scripts/create_directory.sh $SCRATCH/metazoans/exp_list_example.txt
Creates file for each experiment called [experimentID]_commands.txt
$ bash scripts/create_commands.sh exp_list_example.txt $WORK/MSblender/runMS2.sh $HOME/searchgui
Need to manually edit header in template_run_MS.sbatch
Creates file for each experiment called [experimentid]_run_MS.sbatch
$ bash scripts/create_sbatch.sh exp_list_example.txt
$ sbatch Ce_1104_example_run_MS.sbatch
$ parallel -j4 :::: Ce1104_COMMANDS.txt
This goes in /DB folder of each experiment
Can also format ahead of time for a proteome and just cp in to the DB folder
$ bash $WORK/MSblender/setup/setup_database_only.sh uniprot-triticum%3AUP000019116.fasta
$ formatdb -i uniprot-triticum%3AUP000019116_contam.combined.fasta