Initiated integration of ColabFold-AF3 pipeline. Draft state of AF3 json input generation. #680
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Major
batch.py
--af3-json
to specify AF3 json generation.--fasta-type
to handle non-protein molecules in fasta file.The header should be
> entityID | entityType | numberOfEntity
e.g. FASTA file with 1 homodimer and 3 ligand
generate_af3_input
processes fasta and a3m files and produce json files if specified--af3-json
. (This will not proceed with structure inference.)get_queries_af3
handles fasta files containing complex with non-protein molecules if--fasta-type
is 1.parse_fasta_af3
is called byget_qureries_af3
to parse single fasta file .utils.py
AF3Utils
to fill out elements in json file.Subsidiary
msa2json.py
This code was adopted from Yoshi's code. It generates json inputs from given a3m file / directgory contains a3m files.
af3json.py
Adaptation of msa2json.py to generate default AF3 json format. This json will make AF3 to produce MSAs using their own pipeline.
Progress
(each molecule types, msa-pairing, template, custom ccd, RNA MSA, modifications, etc.)