Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in code #9

Open
Danigore25 opened this issue Jan 15, 2023 · 5 comments
Open

Error in code #9

Danigore25 opened this issue Jan 15, 2023 · 5 comments

Comments

@Danigore25
Copy link

Hi, I tried to run ClassifyCNV by Python3 from my Linux terminal, but the output told me that it was an error in position 4 with a decode byte 0xa8. I searched it in the web but I can't solve this problem. Could you assist me with this?

@tgurbich
Copy link
Collaborator

Hi! Could you please paste the full command you were running and the entire error message (full output from the terminal)?

@Danigore25
Copy link
Author

Of course. The full command is:
python3 ClassifyCNV.py --infile ../../MxGDAR_Nofilters.bed --GenomeBuild hg38 --precise

And the error message is:
ClassifyCNV.py Version 1.1.1
Traceback (most recent call last):
File "ClassifyCNV.py", line 821, in
cnv_list = parse_infile(infile_path) # save each CNV as chr_start_end_type and print a new file for BEDTools
File "ClassifyCNV.py", line 87, in parse_infile
for line in bed_infile:
File "/usr/lib/python3.6/codecs.py", line 321, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa8 in position 4: invalid start byte

@tgurbich
Copy link
Collaborator

Thank you. Would you mind also sharing the input file you are using (MxGDAR_Nofilters.bed)?

@Danigore25
Copy link
Author

I'm sorry, but I'm not capable to share the input file. Although, I can describe you the structure of it.
With the command xxd -b, I could visualize my file .bed, and I found a series of eight columns. In the first one I have eight digits (some of them could be letters) and then two points. Next, I have six columns with a code compounded by 0's and 1's and a final column with a security code (with points and other characters).

@tgurbich
Copy link
Collaborator

Could you please check that when viewing your input file as text, not as bit strings, the only characters within the file (besides empty space) are digits and words "chr", "DEL", "DUP"? The input file should look like this: https://raw.githubusercontent.com/Genotek/ClassifyCNV/master/Examples/1000Genomes.hg38.bed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants