Skip to content
/ InFuse Public

Software for detecting gene fusions and exon-skippings from long read sequencing

Notifications You must be signed in to change notification settings

WGLab/InFuse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

InFuse

InFuse: gene fusion and exon-skipping detection from long read sequencing (under construction) overview

Installation

git clone https://github.com/WGLab/InFuse.git
pip install numpy pandas numba sklearn pysam ncls numpy_indexed parasail
python InFuse/InFuse.py --help

Usage

For gene fusion detection, please run the following command:

python PATH_TO_INFUSE_REPO/InFuse.py --bam BAM_FILE --output OUTPUT_FOLDER --prefix PREFIX --gff GFF_ANNOTATION_FILE --transcripts TRANSCRIPT_FASTA_FILE --threads NUM_THREADS --min_support MIN_READS --distance_threshold DISTANCE_THRESHOLD --gf_only

where BAM_FILE is an aligned BAM file, GFF_ANNOTATION_FILE is genome annotation GFF3 file, TRANSCRIPT_FASTA_FILE is a FASTA file containing spliced transcript sequences from the GFF3 file, NUM_THREADS is the number of threads to use for multiprocessing speedup, MIN_READS specifies the minimum read support needed for reporting a gene fusion, and DISTANCE_THRESHOLD specifies the distance in bp for merging gene fusion breakpoints.

The output folder will contain a tab seperated file PREFIX.final_gf_double_bp containing all gene fusions, as well as a python pickle file PREFIX.final_gf_double_bp.pickle containing detailed read-level information for all gene fusions.

About

Software for detecting gene fusions and exon-skippings from long read sequencing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages