Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in determining bac_sequence #8

Open
peng-ye opened this issue May 12, 2022 · 1 comment
Open

Error in determining bac_sequence #8

peng-ye opened this issue May 12, 2022 · 1 comment

Comments

@peng-ye
Copy link

peng-ye commented May 12, 2022

Dear authors,

I found some IDs have no sequence in the resulting fna file. From what I saw, all those sequences should start at "0". I.e., the corresponding IDs look like xxxxx|0:\d+|DBSCAN-SWA (see below). I think there is sth wrong in determining the boundary for bac_sequence.

Another observation supporting this is that many sequences start with "[T|G|C]ATG", but not "ATG". It seems like the window should slide to the right by one base.

Would you please help check it out? Thanks.

Screen Shot 2022-05-13 at 00 16 44

@gancao
Copy link

gancao commented May 14, 2022

I am sorry for the miss. I parsed protein locations using python package "Bio". The start location added 1 base automatically . Now I have updated dbscan-swa.py on https://github.com/gancao/DBSCAN-SWA-1

Thanks for your interest in DBSCAN-SWA. If you have any other questions, please comment on github

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants