Click here to test out the classifier yourself. (temporarily offline!)
Retinal optical coherence tomography (OCT) is an imaging technique used to capture high-resolution cross sections of the retinas of living patients. It has become a standard imaging modality for diagnosis and treatment management of disease leading to vision loss with nearly 30 million scans acquired annually (Swanson and Fujimoto, 2017). Age related diseases such as age-related macular degeneration (AMD) and diabetic macular edema (DME), choroidal neovascularization (CNV) are the leading causes of blindness worldwide (Varma et al., 2014, Wong et al., 2014), and are likely to become more prevalent with an aging population. Early diagnosis of CNV and DME are crucial, as delayed treatment could lead to irreversible loss of vision.
Figure 1. (Far left) shows choroidal neovascularization (CNV) with neovascular membrane (white arrowheads) and associated subretinal fluid (arrows). (Middle left) Diabetic macular edema (DME) with retinal-thickening-associated intraretinal fluid (arrows). (Middle right) Multiple drusen (arrowheads) present in early AMD. (Far right) Normal retina with preserved foveal contour and absence of any retinal fluid/edema.
source: http://www.cell.com/cell/fulltext/S0092-8674(18)30154-5
Optical coherence tomography (OCT) images (Spectralis OCT, Heidelberg Engineering, Germany) were selected from retrospective cohorts of adult patients from the Shiley Eye Institute of the University of California San Diego, the California Retinal Research Foundation, Medical Center Ophthalmology Associates, the Shanghai First People’s Hospital, and Beijing Tongren Eye Center between July 1, 2013 and March 1, 2017. Before training, each image went through a tiered grading system consisting of multiple layers of trained graders of increasing expertise for verification and correction of image labels.
Source: http://www.cell.com/cell/fulltext/S0092-8674(18)30154-5
We constructed an ensemble CNN classifier to diagnose the most common retinal diseases and improve upon the results presented by Kermany et. al 2018. The reported method consisted of the InceptionV3 network pretrained on the ImageNet dataset, then fine-tuned on a set of target OCT images. We extended this architecture by utilizing several classifiers in an ensemble fashion.
Our method consists of four CNNs widely used CNNs in image classification tasks (VGG16, ResNet50, Xception, InceptionV3). These networks were pretrained on the ImageNet dataset, then fine-tuned on 1000 OCT images (250 NORMAL, 250 CNV, 250 DME, 250 DRUSEN) labeled by a team of expert clinicians. An independent test set of 1000 OCT images (250 NORMAL, 250 CNV, 250 DME, 250 DRUSEN) was used to compare each model (Fig. 2). To reflect the potential clinical use of these predictions, labels were also binarized into two groups: URGET (CNV, DME) and NON-URGENT (NORMAL, DRUSEN).
Figure 2. Comparison of several ImageNet pretrained CNNs, fine tuned on OCT scans, and evaluated on the test set.
Although we were able to achieve superior performance on the test set with Xception, even higher performance was obtained by combining the predictions across all the model (Fig. 3).
Figure 3. Performance comparison of different methods of combining outputs of VGG16, ResNet50, InceptionV3, and Xception.
Our comparison found that averaging probabilities across all models as a simple yet accurate technique which improved above the performance of any individual model, as well as the previous published results. This method yielded a 4-class accuracy of 96.4% (in distinguishing CNV, DME,NORMAL, DRUSEN), which is a 3% improvement on what was reported by Kermany et. al 2018 on the same test data. For the binary classification task of URGENT vs NON-URGENT the ensemble CNN achieved an accuracy of 97.5%, sensitivity of 98.6%, specificity of 96.5%, and ROC AUC of 99.8% (Fig. 4).
Figure 4. ROC plots for proposed method (ensemble classifier) and InceptionV3 for URGENT vs NON-URGENT OCT image classification. Trained and tested on the same datasets.
Download the (Anaconda Distribution)[https://www.anaconda.com/distribution/]
Create new envornment by entering the command conda create -n <yourenvname> python=3.6
Activate new environment activate <yourenvname>
(windows) or source activate <yourenvname>
Install the packages in requirements.txt conda install --file requirements.txt
.
The data used for training and testing may be found here and here
The dataset is organized into 3 folders (train, test, val) and contains subfolders for each image category (NORMAL,CNV,DME,DRUSEN). There are 84,495 OCT images (JPEG) and 4 categories (NORMAL,CNV,DME,DRUSEN).
Images are labeled as (disease)-(randomized patient ID)-(image number by this patient) and split into 4 directories: CNV, DME, DRUSEN, and NORMAL.
From the root project directory run train.py
with the following arguments:
-iTrn
Path to the training data directory. Must contain 4 subdirectories (NORMAL,CNV,DME,DRUSEN) corresponding to the image labels. Simply use the path to the downloaded training data
-iVal
Path to the val data directory (optional). This is used for stopping the model training at the lowest validation loss. If not provided, a portion (10%) of training data will be used for validation.
-n
Number of sample to use during training (default=1000).
example:
python ./train.py -iTrn "./RawData/OCT2017/train" -iVal "./RawData/OCT2017/val"
This will automatically save the contents of the model to the modelOutput
directory.
To predict the class of a new OCT image, run predict.py
with the following arguments
-i
Path to directory of new oct images. Images must be in the root folder.
-o
Path to directory where output will be generated.
-m
Path to trained model directory.
example:
python ./predict.py -i "./RawData/testDir" -o "./modelOutput/metaClf_2019-06-22" -m "./modelOutput/metaClf_2019-06-22"
Andrei Mouraviev
Eric Chou
Data: https://data.mendeley.com/datasets/rscbjbr9sj/2
Citation: http://www.cell.com/cell/fulltext/S0092-8674(18)30154-5
Inspiration
Automated methods to detect and classify human diseases from medical images.