Artificial Intelligence in Glaucoma: Comparison
Please note this is a comparison between Version 1 by Antonio Maria Fea and Version 2 by Conner Chen.

Glaucoma is a multifactorial neurodegenerative illness requiring early diagnosis and strict monitoring of the disease progression. Artificial intelligence algorithms can extract various optic disc features and automatically detect glaucoma from fundus photographs.

  • biomarkers
  • artificial intelligence
  • genetics

1. Artificial Intelligence in Glaucoma

The use of artificial intelligence is expanding rapidly. Machine learning (ML) and deep learning (DL) allowed a more sophisticated and self-programming way to use machines in automatic data analysis. More in detail, in machine learning, a system can automatically improve its performance and learn by itself with experience without being specifically programmed to do so. Specifically, using a convolutional neural network (CNN) architecture, the deep learning algorithm can take in an input image, assign importance (learnable weights and biases) to various aspects/objects in the image, and be able to differentiate one from the other [1][5]. Similar to neurons derived from the mammalian visual cortex, the neural network’s architecture consists of many hidden layers, each with its specific receptive field and connection to a further layer (Figure 1).
Figure 1.
Classical scheme of a convolutional neural network.
The deep learning network works as a two-step process. The first is the feature learning step, in which convolution, pooling, and activation functions make the ‘jump ahead’ between hidden layers. Secondly, the classification function converts the probability value to a label, providing a clinical output such as healthy or pathologic [2][3][6,7].
Although this architecture traditionally provided a high degree of computational power, in recent years, more advanced network architectures have been developed, allowing the system to analyze more complex data sources. AlexNet (2012) was introduced to improve the results of the ImageNet challenge. VGGNet (2014) was introduced to reduce the number of parameters in the CNN layers and improve the training time. ResNet (2015) architecture makes use of shortcut connections to solve the vanishing gradient problem (which is encountered when during the iteration of training, each of the neural network’s weights receives an update proportional to the partial derivative of the error function with respect to the current weight) [4][8]. The basic building block of ResNet is a residual block that is repeated throughout the network. There are multiple versions of ResNet architectures, each with a different number of layers. Inception (2014) increases the network space from which the best network is to be chosen via training. Each inception module can capture salient features at different levels [5][9].
Traditional metrics assessing the DL algorithm’s quality are sensitivity, specificity, precision, accuracy, positive predictive value, negative predictive value, and area under the receiver operating curve (AUC).
It is known that the early detection of glaucoma could eventually preserve vision in affected people. However, due to its clinical history of being symptomatic only in advanced stages and when most of the retinal ganglion cells (RGCs) are already compromised, it is crucial to introduce a tool to detect glaucoma in clinical practice in pre-symptomatic form automatically. Furthermore, it could be of clinical relevance also to find new ways to provide targeted treatment and forecast the clinical progression.

2. Fundus Photography

In clinical practice, ophthalmologists suspect glaucoma by analyzing optic nerve head (ONH) anatomy, cup-to-disc ratio (CDR), optic nerve head notching or vertical elongation, retinal nerve fiber layer (RNFL) thinning, presence of disc hemorrhages, nasal shifting of vessels, or the presence of parapapillary atrophy. However, the diagnostic process could be challenging considering the extreme variance of these parameters [6][10]. It has been shown that agreement among experts on detecting glaucoma from optic nerve anatomy is barely moderate [7][11]. Furthermore, with standard fundus photography, not only the variability of anatomy could be misleading, but also the parameters of acquisition such as exposition, focus, depth of focus, contrast, quality, magnification, and state of mydriasis. In this scenario, artificial intelligence algorithms can extract various optic disc features and automatically detect glaucoma from fundus photographs. For example, Ting et al. [3][7] collected 197,085 images and trained an artificial intelligence algorithm to automatically determine the cup-disc ratio (CDR) with an AUC of the receiver operating characteristic (ROC) curve of 0.942 and sensibility and specificity, respectively, of 0.964 and 0.872. Similarly, Li et al. [8][12] developed an algorithm based on 48,116 fundus images reporting high sensitivity (95.6%), specificity (92.0%), and AUC (0.986). Although the importance of automatically detecting the excavation of the optic nerve head, it is known that high inter-subject variability characterizes CDR; some large optic nerve heads have bigger cupping even without any sign of glaucoma. To reduce the rate of false positives, other researchers trained a deep learning algorithm to determine the presence of glaucoma based on fundus photographs and implemented it with the visual field severity [9][13]. Li and coworkers used a pre-trained CNN called ResNet101 and implemented it with raw clinical data in the last connected layer of the network; interestingly, there were no statistically significant changes in AUC, but they found an improvement in the overall sensitivity and specificity of the model, confirming the importance of multi-source data to improve the discriminative capacity of the glaucomatous optic disc [10][14]. More recently, Hemelings et al. utilized a pre-trained CNN structure relying on active and transfer learning to develop an algorithm with an AUC of 0.995. They also introduced the possibility for clinicians to use heatmaps and occlusion tests to understand better the predominant areas from which the algorithm based its predictions; it is an exciting way of trying to overcome some problems related to the well-known ‘black-box’ effect [11][15]. The majority of the publications that were analyzed suggested that an automated system for diagnosing glaucoma could be developed (Table 1). The severity of the disease and its high incidence rates support the studies that have been conducted. Deep learning and other recent computational methods have proven to be promising fundus imaging technologies. Some recent technologies, such as data augmentation and transfer learning, have been used as an alternative way to optimize and reduce network training, even though such techniques necessitate a large database and high computational costs.
Table 1.
Summary of studies on glaucoma detection using fundus photography.
Author Year N. of Images Structure SEN SPEC ACC AUC
Kolar et al. [12][16] 2008 30
OCT Fundus Thompson et al. [43][FD     93.80%  
47] 2019 1. Global BMO-MRW prediction Nayak et al. [13][17] 2009 61 Morphological 100% 80% 90%  
Bock et al. [14][18] 2010 575 Glaucoma Risk Index 73% 85% 80%  
Acharya et al. [15][19] 2011 60 SVM     91%  
Dua et al. [16][20] 2012 60 DWT  
 
 
0.914
ARCH = Architecture; SEN = Sensibility; SPEC = Specificity; ACC = Accuracy; AUC = Area under the curve.
ScholarVision Creations