Peptide Status Detection

Files:

'create_amylnset.py': Python script used to extract positive data regions and prepare dataset for classification from the protein sequences in fasta format.
'classifier.py': Classifier written in python script using scikit-learn to predict the amyloidogenic regions of size 'n' in a protein sequence.
Data: Protein sequences, both positive and negative regions.

Make sure that all dependancies are pre-installed
Clone the repository as .zip
Extract the contents
The protein sequence for which the amyloidogenic regions are to be predicted in .fasta format is to be renamed as input.fasta and placed in the 'data' directory.
Run the 'classifier.py' script and input the window size.
Clean the 'data/temp' directory if required.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
data		data
README.md		README.md
classifier.py		classifier.py
create_datasets.py		create_datasets.py
optimal_feature_selection.py		optimal_feature_selection.py