SchNetESM

This model modifies the SchNet model for learning quantum interactions and incorporates ESM embeddings for residues. Here, we train the SchNet model to predict on proteins, rather than individual small molecules.

Inputs

PDB files for each protein of interest
ESM embeddings for each protein of interest. For the example data provided, the ESM embeddings were generated via the individual chain sequences.
The labels for each protein of interest. For a SchNet-based prediction, these should correspond to potential energies.

Modifications

This model makes several underlying modifications and assumptions to the original SchNet model:

Given the likelihood of few samples for large proteins, we do not utilize every atom in the protein for prediction. Rather, we focus on the C-alpha atoms as representations for the corresponding residues, as this should provide a good correlation to understanding residue-level potential energy and prevent overfitting.
Because of (1), and because we are utilizing ESM embeddings, we do not utilize nuclear charge information for computing potentials. It is probable that most energetic interactions between proteins can be learned at the residue level.

Dependencies

This model depends on several packages:

BioPython
PyTorch
PyTorch Geometric

Running/Training

The run.py is a script provided to run and train this model. The command is as follows:

usage: python run.py [-h]
                  [--hidden_channels NUMBER_OF_HIDDEN_CHANNELS]
                  [--num_filters  NUM_FILTERS]
                  [--cutoff INTERACTION_DISTANCE_CUTOFF]
                  [--num_interactions NUMBER_OF_INTERACTION_BLOCKS]
                  [--max_neighbors MAXIMUM_NUMBER_OF_NEIGBORS]
                  [--readout AGGREGATION_READOUT]
                  [--batch_size]
                  [--epochs]
                  [--log_interval]
                  [--esm_embed_path]
                  [--pdb_path]
                  [--labels_file]

required arguments:
  esm_embed_path        Path to the directory of ESM embeddings
  pdb_path              Path to the directory of PDB files
  labels_file           Path to the file containing labels

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
model		model
.DS_Store		.DS_Store
.gitignore		.gitignore
ESM_from_PDB.ipynb		ESM_from_PDB.ipynb
README.md		README.md
run.py		run.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SchNetESM

Inputs

Modifications

Dependencies

Running/Training

About

Releases

Packages

Languages

vymao/SchNetESM

Folders and files

Latest commit

History

Repository files navigation

SchNetESM

Inputs

Modifications

Dependencies

Running/Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages