Substrate Enzyme Interaction Prediction

The repository includes codes for reproducing work in paper Enzyme Activity Prediction of Sequence Variants onNovel Substrates using Improved Substrate Encodings and Convolutional Pooling, (https://proceedings.mlr.press/v165/xu22a.html). In this work, a new compound protein interaction prediction pipeline is proposed with performance tested on datasets obtained from Machine learning modeling of family wide enzyme-substrate specificity screens (arXiv:2109.03900v1, by S. Goldman and C. W. Coley). The pipeline is based on sequence embeddings generated by protein language models and count encodings of molecule fingerprints.

The figure below shows the prediction model's architechture,

We were able to show a substantial improvements with the new pipeline as we tested the predictions on multiple enzyme-substrate-activity datasets (i.e. aminotransferase, kinase, halogenase, phosphatase, etc. ) as shown in the table below.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
X_DataProcessing/X00_enzyme_datasets_processed		X_DataProcessing/X00_enzyme_datasets_processed
CDKImpl.class		CDKImpl.class
ModifiedModels.py		ModifiedModels.py
README.md		README.md
X00_Data_Preprocessing.py		X00_Data_Preprocessing.py
X01_TAPE_FineTuning.py		X01_TAPE_FineTuning.py
X02_Generate_Seq_Split_Index.py		X02_Generate_Seq_Split_Index.py
X03_LM_Embeddings.py		X03_LM_Embeddings.py
X04B_sq_sb_y_NN.py		X04B_sq_sb_y_NN.py
X05A_sq_sb_y_CNN.py		X05A_sq_sb_y_CNN.py
X_model_architecture.py		X_model_architecture.py
chemconvert.py		chemconvert.py
chemfuncs.py		chemfuncs.py
license		license

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Substrate Enzyme Interaction Prediction

About

Releases

Packages

Languages

License

LMSE/Compound_Protein_Interac_Pred

Folders and files

Latest commit

History

Repository files navigation

Substrate Enzyme Interaction Prediction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages