GitHub - nd-hung/CNN-based-Contact-Prediction: Implementation of protein residue-residue contact prediction using fully convolutional neural network

Reimplementation of CNN-based contact prediction models

DeepCov: Fully convolutional neural networks for protein residue-residue contact prediction David T. Jones and Shaun M. Kandathil - University College London

DeepCon: Dilated convolution network with dropout (best reported performing model, Fig.3d)

Datasets

Get train & test data from Bioinformatics Group, UCL DEPARTMENT OF COMPUTER SCIENCE

Data structure

DeepCov/
    setup.sh # script to compile cov21stats 
    feature_extraction.ipynb # script that calls cov21stats to extract features for a dataset
    train.py # training script
    models.py # models
    data.py   # data loader
    predict.ipynb # make prediction on test data with a trained model
    evaluate.ipynb # evaluation 
    read_result.ipynb # read evaluation results
    
    src/ 
       /cov21stats.c  # C source code for covariance stats computation
    
    bin/ 
       /cov21stats  # compiled covariance stats
    
    data/
        train/
            aln/  # contains 3456 aligments files
            21c/  # contains 3456 feature files, each in shape (441, m, m)
            map/  # ground truth
        test/
            psicov150/
                     aln/
                     21c/
                     pdb/ # ground truth
                     rr/  # predicted contact maps

Data preparation

Compile feature extractor

Get the scripts setup.sh and cov21stats at https://github.com/psipred/DeepCov Run setup.sh to compile the extractor:

./setup.sh

Run feature extraction

Run once:

feature_extraction.ipynb

Run training

For the first time training, run:

python train.py [--model=DeepCon] [--gpu=1]

In case of resume training, specified the saved checkpoint file:

python train.py [--model=DeepCon] [--gpu=1] [--resume=DeepCov_checkpoint.pth.tar]

Prediction on test data

Modify the path to prediction folder if needed (default: 'data/test/psicov150/rr')
Run

predict.ipynb

Evaluation

Modify the path to prediction folder if needed (default: 'data/test/psicov150/rr')
Run

evaluate.ipynb

Read results (precision in long-range distance: P@5, P@L/10, P@L/5, P@L/2, P@L)

Modify the result file name
Run

read_result.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DeepCon_best_model.pt		DeepCon_best_model.pt
Readme.md		Readme.md
combined_models.py		combined_models.py
coneva-lite3.pl		coneva-lite3.pl
data.py		data.py
data_combined.py		data_combined.py
evaluate.ipynb		evaluate.ipynb
feature_extraction.ipynb		feature_extraction.ipynb
models.py		models.py
predict.ipynb		predict.ipynb
predict_combined.ipynb		predict_combined.ipynb
read_result.ipynb		read_result.ipynb
train.py		train.py
train_combined.py		train_combined.py
x_feat1.py		x_feat1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Datasets

Data structure

Data preparation

Compile feature extractor

Run feature extraction

Run training

Prediction on test data

Evaluation

Read results (precision in long-range distance: P@5, P@L/10, P@L/5, P@L/2, P@L)

About

Uh oh!

Releases

Packages

Languages

nd-hung/CNN-based-Contact-Prediction

Folders and files

Latest commit

History

Repository files navigation

Datasets

Data structure

Data preparation

Compile feature extractor

Run feature extraction

Run training

Prediction on test data

Evaluation

Read results (precision in long-range distance: P@5, P@L/10, P@L/5, P@L/2, P@L)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages