Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

filterCRs IndexError:list index out of range #88

Open
bio-xtt opened this issue Apr 24, 2024 · 7 comments
Open

filterCRs IndexError:list index out of range #88

bio-xtt opened this issue Apr 24, 2024 · 7 comments

Comments

@bio-xtt
Copy link

bio-xtt commented Apr 24, 2024

Hello!
When I run the software according to the following code, there is an index error. How can I solve this problem?
Here's my code:

step1:build index

scTE_build -g panTro6

step2: filter bam

samtools view possorted_genome_bam.bam -h | awk '/^@/ || /CB:/' | samtools view -h -b > possorted_genome_bam_CB.bam
samtools view possorted_genome_bam.bam -h | awk '/^@/ || /UB:/' | samtools view -h -b > possorted_genome_bam_UB.bam

step3: TE count

scTE -i /work/xtt/data_input/fastq_out/Pan_troglodytes_C1_out/outs/possorted_genome_bam_UB.bam
-o Pan_troglodytes_C1
-x /work/xtt/biosoft/scTE_ref/panTro6.exclusive.idx
--hdf5 True -CB CB -UMI UB -p 40
echo "Pan_troglodytes_C1 done!"

The possorted_genome_bam_UB.bam file format is as follows:
2024-04-24_08-45

The error message is as follows:
2024-04-24_08-45_1

@jphe
Copy link
Contributor

jphe commented Apr 24, 2024

For step2, scTE requires the reads has both CB and UB tags.

How dose the Pan_troglodytes_C1_scTEtmp/o1 and Pan_troglodytes_C1_scTEtmp/o2 folder looks like, seems the format of *count.gz file under the Pan_troglodytes_C1_scTEtmp/o2 are incorrect.

@bio-xtt
Copy link
Author

bio-xtt commented Apr 24, 2024

Hello, I used the above command to perform UB,CB filtering on bam, so my bam file should contain these two tags, o1,o2 directory structure is as follows:
image

@jphe
Copy link
Contributor

jphe commented Apr 26, 2024

What's the *count.gz file looks like?

@bio-xtt
Copy link
Author

bio-xtt commented Apr 28, 2024

Hello, my count.gz and bed.gz formats are as follows:
微信截图_20240428090941

@jphe
Copy link
Contributor

jphe commented May 10, 2024

I can't see any problem. Could you try using a different file? Maybe this error is specific to this file. Can you run scTE succssufully for the test.bam file under the scTE folder?

@bio-xtt
Copy link
Author

bio-xtt commented May 14, 2024

I have a total of four samples, then all of them were quantified using cellranger atac, scTE for TE count, the other three were normal, only this sample data showed the following error

@jphe
Copy link
Contributor

jphe commented May 16, 2024

Yes, there are some hidden problems with this input file, I'm also not aware of what issues there might be

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants