Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

query_align = next(iter(sequence_dict.values())) && StopIteration #6

Closed
jixing475 opened this issue Mar 6, 2023 · 7 comments
Closed

Comments

@jixing475
Copy link

Hello, I encountered an error during the configuration and usage process.

I was able to run the original version of AlphaFold successfully after configuring the environment.

However, when attempting to predict the GPCR state, I encountered an error.

query_align = next(iter(sequence_dict.values()))
StopIteration

image

Additionally, the following files were generated but were empty: "uniref90_hits.sto" and "mgnify_hits.sto".

@huhlim
Copy link
Owner

huhlim commented Mar 6, 2023

As far as I understood, you tried to run our scripts locally, and running the original AF protocol using our scripts was successful. If so, it is strange because our protocol runs the exactly same input preparation procedure and modifies MSA features afterward before running AF2's model. So, can you try it again and check if the error is persistent?

@jixing475
Copy link
Author

Hi,

I noticed that the two generated files in the red box appear to be empty(In the below screenshot). Could it be possible that the MSA comparison was not done by default?

Furthermore, I would like to bring to your attention that I may have misrepresented this error by failing to predict the state accurately. running the original AF protocol using your scripts, i get the same error information.

Lastly, I have attached the PBS script that I ran for your reference.

image

Please let me know if you require any further information.

Best regards, Thanks

@huhlim
Copy link
Owner

huhlim commented Mar 14, 2023

It seems like there were empty ".sto" files in the directory before running the AF script. AF2 attempts to reuse those files by default. You need to remove the directory and try it again. The image is log messages when I run the script.
image
If the error persists, you can check if jackhmmer and related databases (uniref90.fasta and mgy_clusters.fa) are configured correctly.

@jixing475
Copy link
Author

Subject: Assistance with AlphaFold Query and Script Issues

Hello, Thank you for your prompt response.

I have thoroughly inspected the input files and data files, and have deleted the previously generated .sto files. After rerunning the script, I discovered an issue with the query step in the program. The query time seems to be too short, as evidenced in the attached screenshot.

I have made a comparison between the jackhmmer.py script provided by the official AlphaFold and the one I am currently using (specifically, the Multi-state GPCR version). As shown in the screenshots below, there are two notable differences. The left image represents the official script, while the right image represents the one I am currently utilizing. Can you please confirm if this is normal?

Additionally, I have scrutinized the input CXCR2.fasta file from the following perspectives and found no issues:

  1. Short input sequence
  2. Presence of non-standard amino acids
  3. Formatting issues

Thank you for your assistance. Should you require any further information, please do not hesitate to contact me(/[email protected]).

@huhlim
Copy link
Owner

huhlim commented Mar 16, 2023

I found something strange from your log message that jackhmmer.py is called instead of an executable binary file jackhmmer. I suspect that you might not have installed the HMMER package or incorrectly configured it. In libconfig_af.py, binary_paths should be directed to executable binary files rather than scripts in alphafold/data/tools directory.
You may refer to an issue or our AF2 local installation script.

I updated the installation script and added a missing file, requirements.txt, which reflect version updates in dependent libraries such as jax, jaxlib, dm-tree, haiku, and etc. Consequently, I also updated the version of alphafold.
I think the difference in jackhmmer.py is okay because the difference originated from a recent (Dec. 15, 2022) commit, and my scripts are based on a version of AF2 before May, 2022. But, it was also updated as I updated the version of alphafold.

@jixing475
Copy link
Author

jixing475 commented Mar 18, 2023

@huhlim

Hello, I wanted to express my gratitude for your invaluable assistance in resolving the issues I was experiencing.

The problem has been successfully resolved. As it turns out, it was indeed a problem with the binary file. However, through reset the environment and adjusting the binary path, I was able to run it successfully.

Once again, I would like to extend my heartfelt appreciation for your support.

@huhlim
Copy link
Owner

huhlim commented Mar 18, 2023

I am glad that the issue is resolved! I hope you find our work helpful :)

@huhlim huhlim closed this as completed Mar 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants