Abstract

Due to recent advances in computer technology and the accessibility of large datasets, deep learning has become at the forefront of artificial intelligence and on various tasks, especially those related to image classification and modelling, its performance is often equal to or even better than human appreciation. Ophthalmology has always been in an ideal position to employ one of CNN’s most popular deep learning algorithms to evaluate vast volumes of data from these tests since it is one of the health practices that focuses extensively on imaging. Glaucoma is one among the situations in which CNN can benefit from the enormous quantity of data collected by tests that assess the anatomy as well as function of the optic nerve and macula. We discussed the recommended use of CNN algorithm for specific glaucoma scenarios, such as fundus photography screening and diagnosis and detection of the course of glaucoma through OCT imaging modality. The purpose of this article is not only to critically examine and discuss the latest applications of CNN models in glaucoma but also to focus on the challenges associated with developing such models for screening, diagnosis, and progress detection. After a brief overview of the clinical practices and their comparison with conventional clinical methods, we discussed training and validation of CNN algorithm and how it was developed and why it is particularly suitable for glaucoma. The following features make our contribution worthwhile and unique among the reviews of similar kind: (i) our review classifies the existing literature with respect to detection of the glaucoma disease using conventional and nonconventional approaches; (ii) it covers a very different outlook of the glaucoma disease by providing in-depth discussions of the existing works at different granularity levels, that is, from primary to mediatory to the severe level; (iii) this state-of-the-art review covers each article in the following dimensions: the computer-based approach to tomographic model; analysis of different datasets; and summarizing the literature review in a disciplined way by mentioning the research gap concluded with discussion on future work.

1. Background

According to a study by the World Health Organization, 64.3 million people worldwide are affected by this disease and it is estimated that the disease will grow to 80 million in 2020 and to 111.8 million in 2040. These figures are expected to increase by 2050. For example, the number of blind people in Canada in 2013 was approximately 500,000 and it is estimated that 50,000 people will become blind each year [1]. Glaucoma is one of the main diseases that cause blindness. Treating blindness and compensating for an individual’s loss of ability to find meaningful work is a costly expense for individuals and for society. For example, in the UK, the burden of vision loss and blindness has increased from around 143,600 to over 154,600 disability-adjusted life years, while the disability-adjusted life years in the UK for various reasons have declined. In 2013, the total economic cost of vision loss and blindness in the UK was estimated to be between $2.4 billion and $35.5 billion [2]. The quantity of glaucoma patients within the United States is predicated to increase from 2.7 million in 2010 to 6.3 million in 2050 and by the end of 2020, the amount of glaucoma sufferers aged 40 to 80 was estimated to increase to 64.3 million and by 2040 to 111.8 million [3]. The profile of glaucoma patients in the coming decades is depicted in Figure 1.

2. Introduction

Glaucoma is the primary vision impairment diagnosed around the world. The main cause of glaucoma is the slow death of retinal ganglion cells (RGC) and increased intraocular pressure in the eye. This happens due to the increased mass of fluid in the eyeball. A raised bulge in the eye can damage nerve bundles (axons) in the nerve fiber layer of the retina (RNF). It starts with a negligible problem and the breakdown of the neural chain as shown in Figure 2 below. These malformations are so trivial that they are generally incomprehensible in the early stages of glaucoma. The most common cause of visual impairment and possible blindness is that glaucoma is difficult to diagnose and treat early on. In Singapore, around 90% of glaucoma patients are unaware of their condition. In Australia, almost 50% of glaucoma patients are still unaware of their condition. In the United States, approximately 4 million people have experienced the side effects of this disease, and half of them are conscious of their disease again. According to a seminar by Dr. Mirza Shafiq, almost half of the 1.8 million glaucoma patients in Pakistan are permanently blind, unfortunately 2% of them are children [6, 7]. Aside from early detection and treatment, there is no cure for glaucoma [8]. The high proportion of undiagnosed patients indicates the need for major changes to existing diagnostic procedures. Early detection depends on manual observation by the ophthalmologist. This is expensive and can go wrong. This is because not all ophthalmologists have the necessary experience to correctly diagnose glaucoma. Subsequently, most of the patients were either undiagnosed or misdiagnosed. Therefore, glaucoma can progress and cannot be treated until it is discovered. So, it is very important to develop automated tools that will aid in the early detection of glaucoma as it will have a positive impact on people and the economy. Therefore, it is very important to develop automated methods for the early detection of glaucoma. These methods will use the experience of various ophthalmologists to get reliable results.

2.1. Types of Glaucoma

Various sorts of glaucoma are available all over the world, while the two significant categories of glaucoma are open-angle and angle-closure glaucoma, which can be answerable for development in intraocular pressure. Open-angle glaucoma [9] that is also known as primary open-angle glaucoma (POAG) is a common type of glaucoma, which appeared to be the main leading cause of glaucoma spread around the world. According to the Glaucoma Research Centers, almost more than 90% cases are because of open-angle glaucoma. Primary open-angle glaucoma is a cutting-edge optic neuropathy and, perhaps, the most widely recognized type of glaucoma. Angle-closure glaucoma [10], which is also acknowledged as closed angle, occurs due to complete aqueous fluid blockage leading to a critical rise of IOP. The pressure rises so speedily that it may affect the vision in a swift manner, and it demands instantaneous medical attention.

Glaucoma is characteristically found out by means of an affected person after an extended period of disease progression. Over the years, the same procedures have been used by the ophthalmologists to diagnose the glaucoma. Those procedures depend exclusively on ophthalmologists’ manual interpretations and therefore 80% of glaucoma cases are not detected at their early stage. Consequently, it is crucial to process everyday eye tests on the way to find the symptoms of glaucoma and to deal with symptoms as early as possible.

3.1. Conventional Clinical Methods to Detect Glaucoma

Tonometer instrument [11] is used to measure IOP in the retina. Currently, numerous techniques of tonometry are in practice, among which Goldmann is widely adopted. Due to irregular cornea thickness, tonometry will not be able to produce proper measurements. To overcome this issue, Pachymetry test is getting used by ophthalmologists. It helps to regulate thickness cornea, as it is significant because the patient with thinner cornea and authentic IOP may be underestimated and with thicker cornea may be overestimated. If ophthalmologists are not sure about glaucoma identification even after performing tonometry and pachymetry, then other tests (i.e., gonioscopy, perimetry, and ophthalmoscopy) are conducted [12]. Perimetry is used to check patient vision by measuring the light sensitivity by detecting targets. However, this test is time-consuming because it may take three attempts before an accuracy is obtained. Thus, results may be compromised, and the accuracy of this method remains questioned. To get the clean and greater correct result and to determine the thickness of RNFL, ophthalmologists are going for 3D imaging modalities. However, these tools are not available everywhere as they are expensive and furthermore, clinics, in particular, practice these procedures even though entirely different tests specify a conceivable glaucoma.

3.2. Computer-Based Generic Methods to Detect Glaucoma
3.2.1. Fundus Image Analysis

One of the complex retinal imaging procedures to detect glaucoma is using fundus camera [13, 14]. A digital fundus camera equipped with a low-light microscope can record images of the entire retina and the posterior pole of the eye. The circular area termed as papilla is responsible for the connection of optic nerve to retina [15]. Disc region surrounding the lighter area is termed as cutting area. The disc and cup diameter of fundus and OCT are shown in Figure 3.

Singh et al. [17] extract the green channel so one can extract the cup and disc area from the fundus image. The green channel contains most of the information needed to identify the center of disc. Select the Kaiser window for all lines and maximum values of the image that takes the X-coordinate of the intervertebral disc into account. However, this type of method relies only on the intensity of the pixels to locate the disk center. If other shiny artefacts like exudate and PPA are present, they will not give accurate results. Ahmad et al. [18] employed morphological operations for the extraction of optic disc in fundus images. Extracted cup and disc region is further used to calculate cup-to-disc ratio (CDR). The relationship between the boundary regions of the retina is also extracted from the features to assist with the detection of glaucoma patients. The credibility of established system was tested on small dataset on 80 fundus images and received a 97.5% accuracy rate. Panda et al. [19] used a different technique to detect the disc center. In order to detect symmetrical lines for considering the properties of thick blood vessels, morphological operation was engaged. The second step involves the detection of optic disc center by counting the highest vessel component. In the third step, the author finalizes the exact location of optic disc center by concluding a local symmetry line along with radius. This approach is verified on nine publicly available datasets. Nevertheless, the proposed approach is not able to perform well on low-quality images or if the number of blood vessels is not sufficient. Fraga et al. [20] offered a method for the optic disc segmentation by introducing different stages. Retinex algorithm was used to normalize the retinal images to increase the process reliability and to diminish the disparity variability. The technique achieved 100% of optic disc localization. However, this approach did not involve pathologic retinal images that are affecting the optic disc overall accuracy. Salam et al. [21] proposed a unique method to classify fundus images into three categories: glaucoma patients, suspicious specimens, and nonglaucoma images. The method was tested on a local dataset containing 100 fundus images of 26 glaucoma patients and 74 nonglaucoma patients. Haleem et al. [22] proposed a methodology for segmenting and localizing the optic disc using vasculatures convergence and weighted feature maps. They have used the RIMONE and SLO datasets and obtained an average accuracy of 93.9%. The authors take the test images and minimize the distance between the normal profiles of future maps by estimating the contour and in training set by calculating the mean of the images. Table 1 shows analysis for glaucoma detection using fundus images.

3.2.2. Optical Coherence Tomography (OCT) Image Analysis

OCT is a revolutionary imaging approach that is extensively used by ophthalmologists worldwide. OCT images provide detailed information about the layers of the retina. Optical coherence tomography (SD-OCT) is often used to improve retinal assessment measures [27].The image affected by glaucoma has a larger cup diameter than the normal OCT image, as presented in Figure 4.

Siesky et al. [23] inspected progression of open angle glaucoma within 5 years of time. A total of 112 open-angle glaucoma patients were examined after every six months for the period of 5 years. Different perimetry devices like visual field, OCT, and Heidelberg retinal tomography were considered to observe the structural changes in eyes. Their findings concluded that both RNFL and ONH were associated with practical glaucomatous progression and visual field lost in patients during 5 years of examination. Rao et al. [24] tested 46 control and 61 glaucoma eyes. By measuring the thickness, the functional ability of glaucoma diagnosis was compared. They concluded that SDOCT is more suitable for measuring thickness. Gopinath et al. proposed a new method [25]. OCTA (Optical Coherence Tomography Angiography) imaging was used for glaucoma diagnosis. Capillary density (CD) and RNFL thickness are two estimated features that classify OCTA images as glaucoma or nonglaucoma images. After the capillary network is identified as a region of interest (ROI), measure the CD of eight sectors. The CD and RNFL thickness values of the evaluation parameter glaucoma detection system are lower than normal OCTA images in glaucoma patients. Nieves-Moreno et al. proposed an algorithm for detecting glaucoma on SD-OCT images of the retina [26]. Abdel-Hamid [28] proposed an automated glaucoma detection device primarily based on the frequency area discrete wavelet transform (DWT) of preprocessed OCT images. The noise of the image is removed by using median filter. After domain wavelet transform decomposition, the wavelet coefficients are the attributes extracted inside the frequency area.

3.2.3. Databases

Distinctive globally and locally accessible fundus and OCT datasets are accumulated for healthy and glaucoma patients. More information such as the names of the datasets, the images available in each dataset, the provision of the datasets, and the technology applied to them is depicted in Table 2. The HRF dataset contains 30 images including 15 healthy and 15 glaucomatous images with a resolution of 3504  2336. Pathan et al. used the HRF dataset in a real-time detection system and achieved a precision of 96.7% [29]. The Drishti database contains 101 images with a resolution of 2049  1757, which are used for analyzing glaucoma. Its source is the publicly accessible Aravind Eye Hospital in India. Patil et al. [30] used the Drishti dataset to classify retinal features and obtained an accuracy of 97.4. The glaucoma DB dataset comprised of 452 images with a resolution of 1504  1000 for analysis of glaucoma. This will contain different features like hybrid features, optic disk, and classification to classify glaucoma [31].

3.2.4. Evaluation of OCT and Fundus for Glaucoma Detection

An evaluation among fundus and OCT images for differentiating between ordinary and glaucoma sufferers is provided for further amplification. As shown in Table 1 and Figure 5 below, various methods in these two categories have been proposed and discussed. The researchers use various algorithms such as the Otsu method, the area growth algorithm, the climbing algorithm, the FCM clustering algorithm, and the optical pan segmentation method to segment the optic disc area and calculate the CDR value in the fundus image. In order to use OCT to assess glaucoma, various algorithms are provided. In order to segment the retinal nerve fiber and retinal pigment epithelium layer, segmentation algorithm is used. This layer-based segmentation allows to calculate the CDR value after segmenting optic disc and cup area. The CDR measurement of OCT imaging has been shown to be closer to the clinical outcomes of glaucoma patients. Nithya et al. [32]. A comparison was performed between fundus and OCT images to determine performance error. From experiment analysis, it is clearly indicated that glaucoma can be diagnosed more efficiently with OCT images. Due to the glaucoma, the deformity occurs in the cup-shaped area and can be seen more precisely on the OCT image, as it provides information about the retinal layer.

4. CNN-Based Methods to Detect Glaucoma

In recent years, deep learning technology has improved the technical level of classification, segmentation, and recognition of objects in medical and ophthalmological images; the summary of these is presented in Table 3 further in this section. As compared to traditional machine learning approaches that rely heavily on expert domain knowledge to employ functions, CNN is a network that maximizes the network capacity by learning functions to move between healthy and infected images. There are a variety of CNN architectures designed for image segmentation as well as classification. Each of these architectures is different in terms of number and size of functions, how these functions will be connected, and what would be the entire depth of the network. This is because different architecture is suited for different problems as per the nature of problem and it is challenging to know the priority of network, which architecture would be the right choice for a given task. The best way to make the decision in these circumstances is empirical testing.

Zilly et al. [33]. Using the CNN network with VGG-19, the specialized layer is expanded to segment the optic disc. A stereoscopic image is an image that uses multiangle images combined to represent a 3D model of the retina. They achieved a mean F-score of 96.5% and a mean error rate of 5 pixels with a maximum margin of 16 pixels. More than 50% of the images have an error rate of more than 5 pixels. However, the marginal error of the RIMONER3 image is greater than that of DRIONS dataset. This shows the need to improve edge detection, especially in the presence of exudates. Additionally, cross-validation will be a more powerful verification technique than simply sharing the dataset during training and testing. Tan et al. [34] presented a CNN-based sample methodology that reduces computational complexity. Output of the last layer is entered as input from the new layer. New type of Softmax classifier is used consistently for the output of all filters. It is then applied to the test image. This method accomplished 95.60% specificity, 94.10% accuracy, and sensitivity 92.30% on DRISHTI-GS1 dataset. However, to train a CNN using few images seems incredible. This study proposed a method to learn CNN when dataset is small. By having CNN trained on less data means less parameters can be reliably trained; therefore, more advancement in this method is needed for better accuracy. Orlando et al. [35]. After normalizing the image, a seven-layer CNN is used to segment the optic disc. CNN has been trained in back propagation and stochastic gradient descent. Softmax is used for the linear units of the sixth and first layers and the activation of the leaky rectifiers of the third and fifth layers is used. This method achieved a sensitivity of 87%, a specificity of 99%, and an overlap error rate of 62%. Select the drive dataset to produce the results of the proposed method. However, the treatment of exudate and PPA is not discussed. Furthermore, the proposed method was tested on 20 images only. This small dataset is certainly not sufficient to guarantee the correction and accuracy of the overall results. Orlando et al. [36] used two different architectures VGG-S and OverFeat to develop a CNN model for automatic glaucoma detection. For confirmation from current CNN (OverFeat and VGG-S), pretrained architectures for images without fundus images are implemented. Preprocessing takes place first for these newly acquired fund images to improve quality; then, ONH segmentation is applied. The proposed method influences 76.30% and 71.80% as AUC values. However, the author did not perform extensive comparison with other state-of-the-art strategies as most of the methods assessment used their own private dataset.

Chai et al. [37] used the concept of ant colony for optic cup segmentation in order to determine glaucoma in an early stage. The reason to use ant colony because it is a probabilistic technique, which makes sure to find an optimal path that will minimize the need to find a path through graph. The approach obtained an average overlap error of 24.3%. Nevertheless, the approach to the ant colony suffers from an important limitation, namely, the uncertainty at the time of convergence. How this was handled was not discussed in the paper too. Zilly et al. [33] presented a CNN twofold structure. Segmented optics as input the disc was escalated to the CNN model and the entire CNN model was fed with the whole image. Likewise the concatenation was performed on the CNN model and a fully connected layer was used for classification. This method reaches 81.69% as a classification precision for the Beijing Tongren China Hospital dataset. Jiang et al. [38] improved their existing work by applying CNN to extract optic cup for diagnosis of glaucoma. To get around the challenge of using CNN (which requires a large dataset) discussed above, the authors incorporated a colearning approach. With a marginal error of 11.8 pixels, an average F value of 84.75% was obtained. In terms of the cup, this is a great achievement, yet further improvements in performance and accuracy are possible. Furthermore, the authors did not test their method on other publicly accessible datasets in order to compare it to other methods; therefore, they were unable to validate its robustness. Jiang et al. [39] use the DCNN multilabel model with GL-Net. This method achieved 90.50% and 97.10% of the F1 values for the optic disc and cup, respectively. Ahan et al. [40] used CNN for diagnosis of glaucoma. Two convolutional and hidden layers along with max pooling were applied. ReLU served as an activation function. The weights were assigned with Xavier initializer. The proposed model was tested on small dataset collected from Kim’s Eye Hospital. The total 1542 images were collected from which 756 were glaucomatous and 786 were nonglaucomatous. This method achieved 87.9 accuracy by using fundus photography. However, they have used a small dataset and therefore unable to verify the robustness of their proposed solution. Jiang et al. [41] diagnose glaucoma by segmenting the optic cup and disc using the CNN algorithm. The proposed end-to-end algorithm is called the ordinary RCNN. Porous folding plays an important role in improving performance. Two datasets named SECS and ORIGA evaluate this approach. In the dataset, AUCs of 90.10% and 85.40% were achieved. Christopher et al. [42] used a large ONH oblique fundus image database to evaluate several different CNN architectures based on the accuracy of glaucoma diagnosis. They achieved an AUC of 0.97 for GON eyes with moderate-to-severe loss of function. However, for the current study, the longitudinal section of the data is not taken into account, but the dataset is used to train the model, and each image is viewed separately. The authors of [43] concluded that the standard CNN deep learning model is not designed to handle and compute the longitudinal data. For the OC (Optical Cutting) and OD (Optical Disc) segmentation, an improved U-Net Convolutional Neural Network technology based on deep learning is proposed. With the help of deep learning, U-Net and CNN technologies have a very simple framework that takes much less time to implement and predict, which is much better than the most advanced technologies. The proposed method is implemented on the RIM-ONE and DRISHTI-GS datasets that provide RIM-ONE with Dice 0.82 and IOU 0.69 performance metrics, and DRISHTI-GS provides Dice 0.85 and IOU 0.75 performance metrics. The paper does not report accuracy of glaucoma detection. Raghavendra et al. [4] detect glaucoma through convolutional neural network. The proposed model consists of three steps with 15 layers CNN. Drishti dataset is used to evaluate the proposed model. This method obtained 89.4% accuracy. Despite the fact that deep CNN requires a huge number of datasets for training, it is feasible to gain improved accuracy by training the model with a big number of frames.

Raghavendra et al. [44] use CNN algorithm to diagnose glaucoma. The pixels of the input image are allocated hierarchically to estimate the background image. The private dataset is used to train the CNN algorithm from scratch. This model is most suitable for a large number of dataset images. The overall accuracy rate reached 98%. However, due to the large number of images required for training, this method cannot be used for small datasets. Mookiah et al. [33] use histogram and random transformation for fundus imaging to diagnose glaucoma. In addition to processing, properties such as frequency spectrum and wavelet transformation can also be extracted from the image. The SVM classifier later uses this information to classify the image as normal or glaucoma. The method was tested on a small private dataset with 60 images. It achieved an accuracy of 95%. For clinical implementation, the proposed method should be evaluated using a large dataset with characteristics from the domain. Julian et al. [45] use convolutional neural networks for OD and OC segmentation. The researchers introduced technologies based on community learning. Convolution filter framework are designed by using filter points. Use the output of the filter to train the final flexible logistics classifier. The output of the classifier is divided by the intersection of the graph and the convex envelope. Use OD and OC to calculate the segmented CDR. The method is evaluated using the Drishti dataset, which consists of 101 images. The three coefficients obtained by dividing the OD and OC are 97% and 87%, respectively. However, for OD classification, this approach works well. The topic of optimizing network parameters in order to decrease computing effort is a challenge. The review of glaucoma diagnosis using CNN is presented in Table 3 and Figure 6.

5. Conclusion

The development of convolution algorithms for neural networks has opened up the possibility of improving the diagnosis of glaucoma in the test environment and in clinical practice. Not only does the CNN algorithm reduce the tedious work of manually reviewing annotated images, it can also accurately identify glaucoma in OCT and fundus photos, which can improve its usefulness in high-pressure environments with limited resources. The algorithm can also identify other features on the OCT B-scan that are important in diagnosing glaucoma or its progression. However, the performance of the CNN algorithm is heavily influenced by the data and image quality used for training and testing. It is important to use a more representative dataset that includes multiethnic populations and related pathologies of the retina and optic nerve head. Additionally, CNN may perform better on test images obtained from the same camera used for the training set. Hence, more training and validation are needed to support the data obtained on different platforms. In addition, when applied to images taken in real-world conditions where the prevalence of glaucoma is much lower, it is particularly important to ensure that CNN is highly specific to glaucoma. Future research should apply these algorithms under real-world conditions to see if they perform well in clinical and screening scenarios when compared to human practitioners.

Previous studies evaluated the burden of vision loss in Pakistan in terms of prevalence of eye diseases alone. We used both prevalence and yield methods to investigate the vision loss burden in our study. This enabled us to compare the burden of vision loss in Pakistan with those in other countries and other diseases in a more detailed manner. There is no medication for glaucoma still, and vision loss is irretrievable; consequently, molecular diagnostics for predictive assessment and early interference is essential to decrease the influence of visual mutilation and eventually blindness. To accomplish this goal, the requirement is to exemplify all subtypes of glaucoma at the molecular level and recognize loci/genes contributing to this ophthalmic disorder in diverse populaces.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.