Retinal imaging based glaucoma detection using modified pelican optimization based extreme learning machine

Muduli, Debendra; Kumari, Rani; Akhunzada, Adnan; Cengiz, Korhan; Sharma, Santosh Kumar; Kumar, Rakesh Ranjan; Sah, Dinesh Kumar

doi:10.1038/s41598-024-79710-7

Download PDF

Article
Open access
Published: 29 November 2024

Retinal imaging based glaucoma detection using modified pelican optimization based extreme learning machine

Debendra Muduli¹,
Rani Kumari^2,3,
Adnan Akhunzada⁴,
Korhan Cengiz⁵,
Santosh Kumar Sharma¹,
Rakesh Ranjan Kumar¹ &
…
Dinesh Kumar Sah^6,7

Scientific Reports volume 14, Article number: 29660 (2024) Cite this article

2825 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Glaucoma is defined as progressive optic neuropathy that damages the structural appearance of the optic nerve head and is characterized by permanent blindness. For mass fundus image-based glaucoma classification, an improved automated computer-aided diagnosis (CAD) model performing binary classification (glaucoma or healthy), allowing ophthalmologists to detect glaucoma disease correctly in less computational time. We proposed learning technique called fast discrete curvelet transform with wrapping (FDCT-WRP) to create feature set. This method is entitled extracting curve-like features and creating a feature set. The combined feature reduction techniques named as principal component analysis and linear discriminant analysis, have been applied to generate prominent features and decrease the feature vector dimension. Lastly, a newly improved learning algorithm encompasses a modified pelican optimization algorithm (MOD-POA) and an extreme learning machine (ELM) for classification tasks. In this MOD-POA+ELM algorithm, the modified pelican optimization algorithm (MOD-POA) has been utilized to optimize the parameters of ELM’s hidden neurons. The effectiveness has been evaluated using two standard datasets called G1020 and ORIGA with the $10 \times 5$-fold stratified cross-validation technique to ensure reliable evaluation. Our employed scheme achieved the best results for both datasets obtaining accuracy of 93.25% (G1020 dataset) and 96.75% (ORIGA dataset), respectively. Furthermore, we have utilized seven Explainable AI methodologies: Vanilla Gradients (VG), Guided Backpropagation (GBP ), Integrated Gradients ( IG), Guided Integrated Gradients (GIG), SmoothGrad, Gradient-weighted Class Activation Mapping (GCAM), and Guided Grad-CAM (GGCAM) for interpretability examination, aiding in the advancement of dependable and credible automation of healthcare detection of glaucoma.

OCT-based diagnosis of glaucoma and glaucoma stages using explainable machine learning

Article Open access 28 January 2025

A hierarchical deep learning approach with transparency and interpretability based on small samples for glaucoma diagnosis

Article Open access 11 March 2021

Assessing the efficacy of 2D and 3D CNN algorithms in OCT-based glaucoma detection

Article Open access 23 May 2024

Introduction

Around the world, glaucoma is the second most prevalent disease, poses a significant threat to the optic nerve and can result in complete vision loss¹. It has been observed that more than 60.5 million persons were affected under primary open-angle glaucoma (POAG) and primary angle closure glaucoma (PACG) globally in 2010, and it also predicted that can affect more than 112 million people by 2040². It is a progressive disease which silently noticeable symptoms named “silent theft of sight”. Hence, early detection and screening of glaucoma is essential for timely treatment. Tonometry is a technique used to measure the fluid pressure in the eyes, known as intraocular pressure (IOP), as a part of the assessment for detecting glaucoma. One limitation is that all healthcare centres are not equipped to perform visual field tests. The Optic Nerve Hypoplasia (ONH) assessment has been the more prominent to find clinically. Researchers have also investigated adaptive optics, which can correct for aberrations in the eye and provide even higher-resolution images. Finally, there has been growing interest in developing new biomarkers for glaucoma. Biomarkers are measurable indicators of disease that can be detected in body fluids or tissues. Several authors have identified potential biomarkers for glaucoma, including changes in the levels of specific proteins and metabolites in the eye. Regular eye examinations are essential for early detection and treatment of glaucoma, as vision loss from the disease is irreversible. Recent advances in glaucoma detection have focused on developing new technologies and methods, such as artificial intelligence and machine learning algorithms, which can analyse retinal images and predict the risk of developing glaucoma³. The manual expansion of detecting fundus images is costly, complicated and computationally time-consuming for ophthalmologists. Therefore, if the optic nerve in the back of the eye, gets injured and is not treated correctly, it becomes extremely challenging to identify the problem Consequently, to assist the ophthalmologist in accurately identifying the fundus image, an enhanced CAD model that relies on automated fundus image detection is utilised for this purpose. It enhances productivity and increases the outcomes of detecting several types of fundus images^4,5. Different CAD models have been deployed on neural networks to detect glaucoma. Because of their ability to simplify and provide global approximations, neural networks are employed in numerous research studies to tackle classification and regression tasks. But, the basic limitation of neural network models like overfitting, slow convergence, and local minima. It is an omnipotence and prominent ML classifier named as ELM^6,7, that based on the correlation between the learning mechanisms within adaptive layered networks and the procedure of modeling data through intricate, high-dimensional surfaces⁸. A specific type of adaptive network is recognised that explicitly defines the interjection method, which obtained greeted speeds and enhanced generalisation outcomes. Namely,ELM conventional neural networks, training has not necessitate iterations. But instead, the input-to-hidden-layer weights are randomly generated, and the output weights are computed using an analytical approach. To mitigate such issues, the neural network is trained using a potent learning technique called extreme learning machine (ELM)⁷, radial basis function network (RBFN)⁸, offers enhanced speed and delivers superior performance outcomes compared to alternative methods. The input weights are non-linearly estimated by transforming the input layer to the hidden layer within the ELM framework.Therefore, in order to ascertain the weights connecting the hidden and output layers, excluding parameter tuning, an explanatory approach known as Moore-Penrose (MP) is employed. Due to faster learning speed, ELM has been widely employed across various research domains, including the classification of breast tumours^9,10,11,12, diabetes detection^13,14, etc. Manghnani et al.¹⁵ proposed a new CAD model based on an adaptive approach for glaucoma detection using bidimensional empirical mode decomposition (BDEMD) from retinal images. Then, Adaptive bi-dimensional intrinsic mode functions (BDIMFs) are obtained using very simple steps, from pre-processed and coloured decomposed images then normalized and classified by a support vector machine (SVM) with its different kernel functions. In¹⁶, the authors have presented a novel model on glaucoma prediction using bi-dimensional intrinsic model functions(BDIMFs)based feature fusion then normalized and SVM has been used for classification with 10-fold cross-validation(FCV). In¹⁷, the authors employed a new deep learning model with different potential models namely Inceptionv3, ResNet50, VGG19, Xception and custom Convolutional neural network(CCNN). VGG19 achieved better classification accuracy than other deep-learning models for glaucoma detection. Kikar et al.¹⁸ presented a new CAD model based on second-stage quasi-bivariate vibrational model decomposition (SS-QB-VDM) based sub-band images from fundus images for glaucoma detection. Then, in the second stage decomposed SBIs are fine with no mode mix problems and SVM is utilized for glaucoma classification. Kikar et al.¹⁹ have used a new CAD model for glaucoma detection using image channels(ICs)and discrete wavelet transform(DWT) for feature extraction. The robust features are selected and fed to the least square support vector machine (LS-SVM) for classification which achieves an accuracy of 84.95% than other traditional models. Agrawal et al.²⁰ presented a novel model and more accurate method for automated glaucoma detection using quasi-bivariate variational mode decomposition (QB-VMD) from digital fundus images. Based on better detection results they have used seventy features extracted from QB-VMD SBIs. Extracted features are normalised and selected using the ReliefF method. Selected features are then fed to singular value decomposition to reduce their dimensionality. Finally, the reduced features are classified using the least square support vector machine classifier for glaucoma detection. Kikar et al.²¹ used a CAD model that employed the concatenation approach which is the combination of all features obtained using DWT and EWT and their combination. Extracted features from each of DWT, EWT, DWTEWT and EWTDWT are concatenated. Concatenated features are normalised, ranked and fed to singular value decomposition to find robust features. Fourteen robust features are used by the support vector machine classifier to achieve better classification results. Kikar et al.²² used a novel fully variational and adaptive computer-based glaucoma detection using compact variational mode decomposition (CVMD) from fundus images. Then, Efficient sub-band images having narrow Fourier bandwidth, and clear and sharp boundaries are obtained using CVMD which gives a robust estimate of features. Texture features capture subtle variation and give more information therefore these features are extracted from efficient subband images and help to increase the glaucoma detection accuracy. Extracted features are normalized and classified using a support vector machine classifier. Explainable artificial intelligence (XAI) is crucial in the medical field as it aids in enhancing the understanding of deep learning models²³. Diagnostic tasks are prone to errors due to subjective judgments, and explainability can make these systems more transparent, leading to better evaluation and addressing issues related to accountability, fairness, and ethics²⁴ . As a result, XAI holds significant practical value in medical imaging. By using XAI, healthcare professionals can better grasp the reasoning behind medical conclusions and more effectively utilize deep learning technologies²⁵. The demand for explainable approaches in medical imaging is growing, and in the European Union, XAI in medical contexts is now mandated under the General Data Protection Regulation (GDPR)²⁶.

This study aims to develop an automated computer-aided detection (CAD) model designed explicitly for classifying fundus images. Our deployed scheme has been applied to ELM, an excellent learning approach used for glaucoma detection with a modified pelican optimization algorithm (MOD-POA) to address the constraints inherent in ELM, we fine-tuned the input weight and bias of the model to enhance optimization. The prime contributions are listed below:

To extract features, specifically a collection of curves, from glaucoma images, the fast discrete curvelet transform with wrapping ( FDCT-WRP) schemes has been employed for capturing 2D singularities features and integrated into the methodology.
To enhance traditional learning methods and achieve faster learning speed and superior performance, an improved version of a single hidden layer feed-forward network (SLFN) called extreme learning machine (ELM) is employed.
The hybridized approach combining the modified pelican optimization algorithm and ELM (MOD-POA+ELM) has been implemented to enhance the learning process by effectively disregarding unnecessary hidden nodes, avoiding local minima, and optimizing the speed of response for testing data.
We present a glaucoma classification task that utilizes re-parameterization. This model offers simplicity and efficiency, effectively capturing both cup-to-disc ratio (CDR) information, while also addressing classification and explainability tasks based on seven XAI methods.

Related works

During the past decade, numerous proposed CAD models have introduced to classify glaucoma using fundus images. Most CAD models have three distinct components: feature extraction, selection, classification. The classification of fundus images heavily relies on detection of glaucoma plays crucial roles of extraction of features and dimensional reduction, significantly impacting the classification process. The primary objective of ONH assessments is to perform binary classification distinguishing between individuals with glaucoma and healthy. It is essential to accurately segment the optic disc and cup to detect the cup-to-disc ratio (CDR) precisely. Based on given the time-consuming nature of manually segmenting the optic disc and cup, the cup-to-disc ratio (CDR) is often assessed by comparing the vertical diameter of the cup to that of the disc. From majority of CAD models utilize various features, such as spatial ___domain, frequency ___domain, textual ___domain, statistical ___domain, and multi-resolution behavior, to classify glaucoma fundus images^27,28. There are three main categories of techniques for optic disc segmentation: template-based methods, deformable model-based methods, and pixel classification-based methods^29,30.

The utilization of multi-resolution techniques such as wavelet and curvelet offers a proficient and compact way to represent images and wavelet characteristics have been employed in various CAD models to address classification challenges³¹. The method proposed by Agboola³² employs wavelet coefficients to achieve multi-resolution glaucoma analysis, resulting in enhanced outcomes. They explored a combination of statistical characteristics along with a binary tree as a classification tool in their approach. Akhras et al.³³ introduced an innovative framework, the team has unveiled an advanced approach leveraging machine learning-based classification to potentially revolutionize the early and automated diagnosis of glaucoma. In this model, support vector machines and finely tuned artificial neural networks serve as the key components, functioning as a robust classifier. Usman et al.³⁴ have introduced a CAD system designed to incorporate multiple techniques such as Fourier ___domain OCT systems (FD-OCT), spectral ___domain OCT (SD-OCT), and swept-source OCT (SS-OCT). They have utilized several methods involving a laser that sweeps across different frequencies and a high-capacity balanced photo-detector to capture images. Khan et al.³⁵ have proposed a new methodology to classify automated fundus images by deploying a non-gaussian bivariate distribution function based on a feature selection algorithm and SVM as a classifier with several kernel functionalities. Acharya et al.³⁶ have presented an innovative automated method for detecting glaucoma is presented in this study. The approach incorporates adaptive histogram equalization to transform RGB images into grayscale. Convolution is then applied using diverse filter banks, including Leung-Malik (LM), Schmid (S), and higher-response (MR8, MR4) filters.. They utilized six features extracted from the LM filter bank and applied the K-nearest neighbours (KNN) as a classifier. Dey et al.³⁷ presented a CAD model, which utilizes machine learning techniques based on statistical feature extraction techniques, particularly texture features are derived from fundus images through the analysis of the Grey-Level Run Length Matrix (GLRLM) and Grey-Level Co-occurrence Matrix (GLCM). The classification task is executed through the application of a Support Vector Machine (SVM) classifier. In³⁸, introduced innovative computer-aided design (CAD) methods leveraging the accelerated robust feature (SURF) and histogram of oriented gradients (HOG) characteristics. Such scheme incorporates an advanced hybrid optimization technique in conjunction with SVM for performing classification tasks.

In³⁹ the authors used an automated diagnostic system employing a model incorporating histogram of oriented gradients (HOG) in conjunction with a feed-forward neural network (FFNN) In²⁰, have implemented an innovative approach that combines quasi-bivariate variational mode decomposition (QB-VMD) with the ReliefF algorithm for efficient feature selection. As a result, we chose to use the highly efficient least squares support vector machine (LS-SVM) for the classification task. In⁴⁰, the authors have employed an alternative CAD model, we applied the flexible analytic wavelet transform (FAWT) to extract a range of entropies and fractal dimension (FD) characteristics. To reduce feature dimensions, we utilized linear discriminant analysis (LDA) and implemented a least-squares support vector machine (LS-SVM) as a classifier. In⁴¹, the authors employed novel CAD model employs DWT and HOG features, with subsequent classification performed using an ELM classifier. Balasubramanian et al.⁴², introduced a ML technique which leverages correlation features selected through a bio-inspired algorithm. Additionally, the classification task is performed using a KELM (Kernel Extreme Learning Machine) with optimization based on salp-swarm methodology. Morales et al.⁴³ havesuggested a novel Computer-Aided Design (CAD) model for optic disc extraction employing mathematical morphology and principal component analysis (PCA). This approach enhances the grayscale image quality compared to the original RGB. For OD segmentation, they have focused on different mathematical morphology like centroid calculation, stochastic watershed, and region discrimination; post-prepossessing has been used as circulation approximation. Huang et al.⁴⁴ have presented a CAD model that builds upon ELM and can be expanded to include the radial basis function (RBF) enables the random generation of centers and impact widths for RBF kernels, with the subsequent iterative calculation of output weights through analytical methods.

In²¹, introduced a novel method that employs extraction of features methods like discrete wavelet transforms (DWTs) and empirical wavelet transforms (EWTs) have been utilized to decompose images. They combined all the features obtained through DWT and EWT. They subsequently normalized the concatenated set of features, ranked which fed to singular values decomposition and LS-SVM algorithm used for classiffication. Maheshwari et al.⁴⁵ have introduced a novel method for diagnosing glaucoma, utilizing empirical wavelet transform (EWT), and the t-value feature selection algorithm is employed to rank the features. They applied the least squares support vector machine (LS-SVM) classifier to perform classification using radial basis function, market wavelet, and Mexican-hat wavelet kernels. In⁴⁶, presented a CAD method on linear discriminant analysis (LDA) and artificial neural network (ANN) techniques have been applied to improve the differentiation between glaucomatous and normal eyes within the Taiwanese Chinese population, taking into account the measurement of retinal nerve fiber layer thickness⁴⁷. They have used ANN, LDF, and NFI methods to demonstrate equal diagnosis power in glaucoma detection. Huang et al.⁴⁸ use computational intelligence approches, i.e., neural network, support vector machine (SVM) as a classifier, and ELM works for generalized single-hidden layer feed-forward networks (SLFNs). In⁴⁹, used an innovative learning algorithm known as ELM, accompanied by a hybrid learning approach for the selection of input weights. Additionally, the Moore-Penrose (MP) generalized inverse is employed to analytically determine the output weights in our proposed methodology. Dua et al.⁵⁰ have used The demonstrated discriminatory capability of employing wavelet filters such as Daubechies (db3), eyelets (sym3), and biorthogonal (bio3.3, bio3.5, and bio3.7) for feature extraction, in conjunction with a Support Vector Machine (SVM) classifier, has been showcased.

From the literature, it has come to notice that a majority of CAD models extract features from two-dimensional wavelets, such as DWT, SWT, EWT, and similar methods. The fundus images have subjected to various levels of segregation and subsequently transformed into one-dimensional representations for experimental purposes. Hence, the observation indicates that DWT is inadequate for extracting the curve sample features in glaucoma images. So, SLFN, SVM, and ELM are commonly utilized in various CAD models for classification purposes; this results in more parameters being tuned, leading to a longer computational time.

In this article, we have introduced a model which effectively captures two-dimensional singularities; a fast discrete curvelet transform with wrapping (FDCT-WRP) method has been employed for extraction of features that capture 2D singularities,method of dimensionality reduction, “PCA+LDA”,used for choose pertinent features. Due to fundus digital image detection, a modified extreme learning machine (ELM) called MOD-POA+ELM utilized on POA (pelican optimization algorithm) to enhance the learning process of the traditional ELM algorithm⁵¹. It’s merits of ELM are reducing the number of hidden nodes, avoiding local minima, list of remote nodes. Our implemented model has evaluated using two widely used datasets, G1020⁵² and ORIGA⁵³. Digital database for screening of fundus images in G1020 and ORIGA, the performance analysis with traditional models to assess the efficacy of the implemented model, and validation has been performed.

Additionally, we introduce seven existing explainable artificial intelligence (XAI) methods integrated with our classification approach namely Vanilla Gradients (VG)⁵⁴, Guided Backpropagation (GBP)⁵⁵, Integrated Gradients(IG)⁵⁴, Guided Integrated Gradients( GIG)⁵⁶, SmoothGrad⁵⁷, (Gradient-weighted Class Activation Mapping (GCAM)⁵⁸, Guided Grad-CAM ( GGCAM)²⁴. Our goal is to demonstrate a glaucoma identification method that outperforms previous studies, while incorporating explainability features to enhance trust in the system. The proposed model has been implemented as a web application, allowing practitioners to use it as a support tool.

Methods

The deployed scheme comprises four key components: pre-processing, extraction of features, feature reduction, and classification. In the preprocessing stage, relevant features are obtained by applying the fast discrete curvelet transform with wrapping (FDCT-WRP) for the extraction of the region of interest (ROI) from fundus images. Then, removing relevant features is accomplished sequentially by employing PCA, and LDA. The relevant features extracted from the preprocessing stage are ultimately fed into the optimized modified pelican optimization algorithm and extreme learning machine (MOD-POA+ELM) for classification (Fig. 1).

In our implemented scheme, the glaucoma data sets have been split into a training set and a testing set using a specific 60:40 ratio and testing better classification results. In our proposed work, we presented two online repositories, G1020⁵² and ORIGA images⁵³. Usually, fundus images contain several unwanted components, namely noise, artifacts, test information, etc., during acquisition. These abnormalities continue to spread to the extracted features if they are obtained without eliminating such artifacts. Therefore, standard techniques for image preprocessing, such as enhancement and noise reduction methods, are utilized in all images. Today, the enormous amount of fundus images obtained primarily stored locally, due to exploitation of the valuable embedded fundus images, could be more efficient. Our model has assessed the cropping procedure to isolate the relevant region of interest (ROI). To streamline processing time, manually identified regions of interest (ROIs) are employed⁵⁰ extracted using cropping techniques for both datasets, namely G1020 and ORIGA. The ophthalmologists delineated the values of cup and disc ratio in numerous images. Our proposed approach is centered around cropped images with a specified size of 128 $\times$ 128 as shown in Fig. 2. The Comprehensive overview of the proposed model illustrate the overall workflow and depicts the central theme shown in Fig. 1.

Feature extraction utilizing FDCT-WRP

The widespread use of wavelet transform is attributed to its multi-resolution characteristics, allowing for extracting time-frequency localized information from images. Nevertheless, conventional wavelet methods need help in detecting two-dimensional singularities, such as lines and curves, in contrast to the capabilities of ridgelet and curvelet transforms. The Ridgelet transform can extract diverse features such as lines and orientations, but it needs more efficient handling curve singularities within an image^59,60. Addressing this limitation, the initial generation of the curvelet transform has introduced by Candes and Donoho, utilizing a multi-scale ridgelet approach⁶¹. This variant offers enhanced attributes, including increased directional sensitivity, multi-resolution adaptability, anisotropy, and improved localization. An advanced iteration of the curvelet, known as the second generation curvelet transform, has been deployed on⁶² to address the limitations of the initial generation of curvelet transforms, which encompassed issues such as ambiguous identification of ridgelets and excessive computational burden⁶³. In the following, we elucidate the essential mathematical derivation of discrete curvelet transforms. In signal $S_i$, the curvelet transform $C_U$($p_c$, $q_c$, $r_c$) has explained as the inner product of $S_i$, $\tau _{{p_c},{q_c},{r_c}}$ that is

$$\begin{aligned} C_U(p_c, q_c, r_c)=<S_i, \tau _{{p_c},{q_c},{r_c}}> \end{aligned}$$

(1)

Where, $\tau _{{p_c},{q_c},{r_c}}$ shows the primary function of the curvelet, $p_c$, $q_c$, and $r_c$ shows the list of the parameters such as scale, position, and orientation as needed. In the process of curvelet transformation, the image is divided into multiple windows at different scales and orientations. The discrete form of a curvelet transform is obtained based on the input image $\int [a^1,b^1]$ with $\ 0 \leqslant x^1, y^1 < m$ is expressed as

$$\begin{aligned} {C_U ^{{\hat{D}}} \left( p_c,q_c,r_c\right) = \sum \limits _{0 \le a^1, b^1 < n} S\left[ a^1,b^1\right] \overline{\tau ^{{\hat{D}}}_{p_c, q_c, r_c}[a^1,b^1]}} \end{aligned}$$

(2)

Hence, $\tau ^{{\hat{D}}}_{ p_c, q_c, r_c}$ The provided image exhibits an automated digital curvelet waveform. The progression of the $2^{nd}$ generation curvelet transform involves two distinct processes: wrapping and the unequally spaced first Fourier transform (USFFT). Based on the first-generation curvelet, these methods are characterized by simplicity, being the first and having the least redundancy. Hence, the FDCT - WRP method is more straightforward and prominent than USFFT in two procedures. Taking into account that, in our research, for feature extraction. We have utilized wrapping plans to construct by initial discrete curvelet transform known as FDCT - WRP. Consider the following list of steps to be considered for the implementation of FDCT - WRP.

Perform a 2D Fast Fourier Transform (FFT) calculation and obtain its coefficients ($V[c^1,d^1]$) from every image.
Generate a discrete window is created for each scale and angle, providing localization within the given context within the Fourier ___domain (${\hat{X}}_{p_c,q_c}[c^1,d^1]$) and evaluate the product (${\hat{X}}_{p_c,q_c}[c^1,d^1]$,$V[c^1,d^1]$)
By utilizing the re-indexing method, the data undergoes processing as it is wrapped around the origin (${\hat{V}}_{p_c,q_c}[c^1,d^1]$)
To define the discrete curvelet coefficients $C_U{\hat{D}}_{p_c,q_c,r_c}$, by conducting the reverse 2D FFT on ${\hat{V}}_{p_c,q_c}$.

The fast curvelet transform incorporates the coefficient with a wrapping procedure at each scale and orientation, similar to $p_c, q_c$, to create the feature vectors. To determine the list of rankings, the size of the image has taken into consideration $m^r, m^c$ is computed as:

$$\begin{aligned} {\hat{s}} = \lceil {\log _{2}{(min(m^r, m^c)}-3}\rceil \end{aligned}$$

(3)

Here, each image dimension is 128 $\times$ 128, it’s value ${{\hat{s}}}$ value is 4, which means every image is sub-divided the curvelet transforms into four levels. Exclude the first and last scales and consider the remaining scales (2,3) of several substandard information. At angles $\alpha$ and $\alpha +\pi$, the curvelet creates similar coefficients at every scale. Hence, at every scale, consequently, the coefficients in the feature vectors are of significant magnitude; further reduction is required to choose the essential features, necessitating additional feature reduction.

Dimensionality reduction serves as a technique in machine learning and data analysis, aimed at diminishing the multitude of input features or variables within a dataset. The fundamental objective is to streamline the dataset, retaining essential information while minimizing complexity.

Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) are techniques utilized for feature dimensionality reduction in machine learning. PCA strives to discover a reduced-dimensional representation of the data while retaining the maximum possible variability from the original dataset. It identifies the primary directions of variation within the data and maps the data onto a reduced subspace defined by these identified directions. This specific subspace, known as the primary subspace, is formed by the principal components, which are the eigenvectors derived from the covariance matrix of the dataset. Conversely, LDA is a supervised method designed to discover a reduced-dimensional representation of the data, with the objective of optimizing the distinction between various classes. t operates by mapping the data onto a subspace where the variance between classes is maximized and the variance within each class is minimized. The resulting subspace is chosen to maximize Fisher’s linear discriminant criterion. This approach is practical in various applications, such as face recognition and image classification. Combining the two techniques and employing this method makes it feasible to decrease the dimensionality of the data while simultaneously optimizing the discrimination between different classes, leading to better performance on classification tasks.

The efficacy of Linear Discriminant Analysis (LDA) may be enhanced in scenarios involving with high-dimensional features and constrained sample sizes, the scatter matrix serves as a representation of ($S_m$) is seldom remarkable⁶⁴. Such a problem is called a small sample size (SSS). To form $S_m$ denoted by matrix of non-singular, lowest ${\hat{D}} + C$ (so, ${\hat{D}}$ defines the size of the feature matrix and enumerates the classes as C ) list of technically inapplicable features required⁶⁵. Our deployed scheme incorporates a fusion reduction technique known as $PCA-LDA$ In the course of our implemented efforts to tackle this issue. At first, ${\hat{DE}}$ specified as diminution of feature vector data PCA on MA-hyper-parameteres afer that LD- specified as Linear Discriminant Analysis (LDA) feature parameter and, then final parameters is $LD< MA <{\hat{DE}}$. We arrange the estimated eigenvalues of the elements in descending order to identify the most critical characteristics. So, NCSOV, method is provided for calculating the characteristics of every feature normalization sum of cumulative variance. Then, $i^{th}$ parameters NCSOV have been evaluated by:

$$\begin{aligned} NCSOV(i)=\frac{\sum _{y=1}^{i}{\beta {(y)}}}{\sum _{y=1}^{{\hat{DE}}}{\beta {(u)}}}: here,1\leqslant i\leqslant {\hat{DE}} \end{aligned}$$

(4)

Now, $\beta {(y)}$ specified as eigenvalue of $y^{th}$ feature, then, parameters of feature matrix as ${\hat{DE}}$. Lastly, our simulation have considered the threshold value manually and selected features as relevant when NCSOV value enhance the threshold value. The initial LD number of eigenvectors meeting the specified condition is identified as the resultant reduction feature for the classification task.

MOD-POA+ELM-based classification

This segment is structured into three parts to enhance comprehension: the traditional extreme learning machine (ELM), the pelican optimization algorithm (POA), and the integrated novel classification approach known as MOD-POA+ELM. In current decades, single-hidden layer feed-forward neural networks (SLFNs) have found extensive application in approximating continuous functions and identifying separate regions. Learning algorithms based on gradients, like The Levenberg-Marquardt (LM) and back-propagation (BP) algorithms have been widely utilized for the training of these Single-Layer Feed-forward Networks (SLFNs). However, several challenges associated with this learning method are observed, including slow training speed, susceptibility to getting stuck in local minima, overfitting, and extensive iterations due to inadequate learning steps^6,66. A recently developed learning technique, called the extreme learning machine (ELM), effectively addresses these inherent in gradient-based learning approaches. ELM surpasses these limitations and demonstrates the capability to excel in identifying high-performance patterns and resolving regression functions⁶⁷. In ELM, The values of parameters such as input weights and hidden biases for the hidden node are assigned randomly. In contrast, the output weights of the ELM are determined through a straightforward inverse operation that involves the output matrix of the hidden layer. The mathematical derivation of the ELM approach is elaborated below.

We used $\mathbb {{\hat{N}}}$ as set of distinct training sample $(r_i,s_i)$, where, $r_i=\left[ r_{i1},x_{i2},\ldots ,r_{iL}\right] ^T \in R^L$ and $s_i=[s_{i1},s{i2},\ldots ,$ $h_{iC}]^T \in R^C$. The activation functions $\delta (.)$, number of hidden nodes and SLFNs consist of having $s_{hidden}$ and are listed as:

$$\begin{aligned} \sum \limits _{i=1}^{t_s}wve^o_i \delta (r_j)=\sum \limits _{i=1}^{t_s}wve^o_i \delta (wve^s_i \cdot r_j + {\hat{b}}_i)=o_j, \ \ j=1,2,\ldots ,\mathbb {{\hat{N}}} \end{aligned}$$

(5)

Here, the weight vector is $wve^s_i=\left[ wt^s_{i1},wve^s_{i2},\ldots ,wt^s_{il}\right] ^T$ which specifies a connection based on hidden neuron ($i^{th}$) and the input neurons. The interpretation likely revolves around the weight vector of the concealed neuron $wt^o_i=\left[ wve^o_{i1},wve^o_{i2},\ldots ,wve^o_{iC}\right] ^T$. Finally, the bias of the hidden neuron $i^{th}$ denoted as ${\hat{b}}_i$. The projected significance of SLFNs in $\mathbb {{\hat{R}}}$ is error-free.

$$\begin{aligned} \sum \limits _{i=1}^{t_q}wve^o_i \delta (wve^q_i \cdot p_j + {\hat{b}}_i)=q_j, \ \ j=1,2,\ldots ,\mathbb {{\hat{N}}} \end{aligned}$$

(6)

The explanation for the vector format represented by Equation 6 is provided below::

$$\begin{aligned} {\hat{\textbf{HI}}}wve^o={\hat{\textbf{T}}} \end{aligned}$$

(7)

Here, ${\hat{\textbf{HI}}}(wve^h_1,wve^h_2,\ldots ,wve^h_{ve_q},{\hat{b}}_1,{\hat{b}}_2,\ldots ,{\hat{b}}_{v_h},r_1,r_2,\ldots ,r_\mathbb {{\hat{N}}})$

The vector resulting from the hidden layer can be acquired by ${\hat{\textbf{HI}}}$. After, its resultant weight $wve^o$The assessment involved examining the outcome of the lowest norm least squares (NLS) to determine as:

$$\begin{aligned} \hat{wve^o}={\hat{\textbf{HI}}}^\dagger {\hat{\textbf{T}}} \end{aligned}$$

(8)

For (Eq. 8), ${\hat{\textbf{HI}}}^\dagger$ viewed as Moore–Penrose (MP) matrix Inverse using simplified form is ${\hat{\textbf{HI}}}$, Streamline the extreme learning machine through simplification. The latest solution boasts the lowest norm value compared to all other Nonlinear Least Squares (NLS) solutions. Consequently, ELM exhibits superior speed compared to the majority of traditional learning algorithms, requiring fewer iterations for hyperparameter tuning.

Proposed modifed-pelican optimization algorithm (MOD-POA)

The developed MOD-POA has been used to choose optimal features; and the optimization process has enhance the classification accuracy of the deployed glaucoma fundus image detection. Pelican optimization algorithm consists of better competitive results and an effective balance between exploration and exploitation comparing algorithms for competitiveness, ensuring optimal resolutions for optimization challenges⁶⁸. Therefore, challenges arise when assessing the POA’s performance in real-life situations. So it is required to develop an enhanced MOD-POA method. It helps to reduce the convergence rate, and also it attains a greater accuracy rate. It solves constraint optimization problems and can deploy in worldwide applications. It deployed to a large number of datasets. In the improved method of MOD-POA, the prey position is updated by Eq. 9.

$$\begin{aligned} loc=\frac{BE + WO}{2} \end{aligned}$$

(9)

Here, the loc denotes the prey’s ___location, and the terms BE and WO represent the parameter optimisation’s best and worst index values. Initialization: It is considered that there are U be the number of pelicans in V dimensional space, the ___location of an $i^{th}$ pelican in V dimensional space is expressed as [$Ri= ri_1, ri_2, ri_3, \dots ri_V$]. The ___location $r_i$ of U the pelicans is represented in Eq. 10

$$\begin{aligned} r_{i,j}= & {loc}_{bj} + ra \cdot (u_{bj} - l_{bj}), \quad i=1,2,3,\dots , U, \quad j=1,2,3,\dots , V \end{aligned}$$

(10)

$$\begin{aligned} {\textbf{R}}= & \left[ {\begin{array}{c} Ri_1 \\ Ri_2 \\ \vdots \\ Ri_i \\ \vdots \\ Ri_u \\ \end{array}}\right] =\left[ {\begin{matrix} ri_11 & ri_i2 & \ldots & ri_1u & \ldots & ri_1u \\ ri_12 & ri_22 & \ldots & ri_2u & \ldots & ri_2u\\ \ldots & \ldots & \ldots & \ldots & \ldots & \ldots \\ ri_i1 & ri_i2 & \ldots & ri_iu & \ldots & ri_iu\\ \ldots & \ldots & \ldots & \ldots & \ldots & \ldots \\ ri_u1 & ri_u2 & \ldots & ri_uv & \vdots & ri_vu\\ \end{matrix}}\right] \end{aligned}$$

(11)

Where, $R^i$ is specified as the population matrix of an $i^{th}$ pelican, in POA, every population group is considered a pelican, which is an aspirant solution. So, based on each aspirant solution, the objective function of the deployed is analyzed. The value for every objective method is computed using an accurate function vector, shown in Eq. 12.

$$\begin{aligned} {\textbf{O}} = \left[ {\begin{array}{c} O_1 \\ \vdots \\ O_a \\ \vdots \\ O_u \\ \end{array}}\right] _{ b \times 1} = \left[ {\begin{array}{c} O(Ri)_1 \\ \vdots \\ O(Ri)_a \\ \vdots \\ O(Ri)_v \\ \end{array}}\right] _{ v \times 1} \end{aligned}$$

(12)

Here, O is specified as the objective function vector and $T_i$ is defined by the value of the objective function in terms of $i_th$ aspirant solution. Hence, POA provokes the features and strategies of pelicans while trapping and hunting the targeted prey for upgrading the aspirant solutions. This process of hunting and the method utilized in it is assumed in two ways:

1.
Moving towards the direction of the prey (investigation phase)
2.
Flying on the surface of the water (utilization phase)

Moving towards the prey

It determines the position of the pelican’s target prey and advances in the direction of greater altitude. The irregular distribution of the prey is employed to explore and maximize the pelican’s investigative capability. The adjustment to the pelican’s position in each iteration is provided by the Eq. 13.

$$\begin{aligned} Ri_{iz}^{i+1}={\left\{ \begin{array}{ll} Ri_{iz}^{s} + ra.\left( l_n-\lambda .Ri_{iz}^s\right) ,T(Ri_t)\le (T(Ri_i))\\ Ri_{iz}^{i} + ra.\left( Ri_{iz}^s- l_n\right) ,T(Ri_t)\le (T(Ri_i)) \end{array}\right. } \end{aligned}$$

(13)

Where t is specified as the current iteration number, $Ri_{iz}^{s}$ shows the position of $i^{th}$ pelican in the $n^{th}$ dimension; $l_n$ specified as the prey position; $T(Ri_t)$ is the main objective function value; $T(Ri_i)$ shows the fitness function. In the POA framework, a new position for the pelican is obtained when there is an enhancement in the objective function. This kind of advancement is called effective improvement, and the algorithm’s movement towards suboptimal regions has prevented. This procedure is achieved using the following Eq. 14.

$$\begin{aligned} Ri_i={\left\{ \begin{array}{ll} Ri_i^s, O_i^s < O_i;\\ Ri_i, \text {else,} \end{array}\right. } \end{aligned}$$

(14)

Here, $Ri_i^s$ is denoted as the new status of pelican’s, $O_i^s$ Is the value of the objective function determined by phase 1.

Winging on the water surface

The pelican arrives at the water’s surface. It expands its wings towards the water’s surface to catch the fish and gather them into the throat pouch. Thus, this hunting process of the pelican is calculated in Eq. 15.

$$\begin{aligned} Ri_{iv}^{p+1}=R_{iz}^s+ \lambda (\frac{V- v}{V}).(2.wx-1).Ri_{iz}^s \end{aligned}$$

(15)

Hence, the term v is specified as the recent list of iterations; P is set as the maximum iteration number; w is defined as the surrounding population to explore locally available members to merge an enhanced solution; wx represents the random number between (0,1); at this section, the updating has also used to agree or neglect the position of new pelican efficiently, which is evaluated in Eq. 16.

$$\begin{aligned} Ri_i={\left\{ \begin{array}{ll} Ri_i^p, O_i^s < O_i;\\ Ri_i, \text {else,} \end{array}\right. } \end{aligned}$$

(16)

Here, $Ri^s_i$ is specified as the new status of the $i^{th}$ pelican, and $O^s_i$ sets the objective function value based on phase 2. The pseudocode of the deployed MOD-POA is explained in Algorithm 1.

Results

The proposed CAD model is run over “PARAM Shavak” supercomputer features an high perperformance computing system incorporates an Intel(R) Xeon(R) Gold 5220R CPU @ 2.20GHz, equipped with either one or two many-core GPU accelerator cards like the NVIDIA P5000 and K40. Additionally, the system includes a minimum of two multi-core CPUs, each possessing at least 12 cores, we have 64 GB of RAM, 8 TB of storage, and three Tera-Flops (TFs) of computing power at their peak outfitted with a development environment for parallel programming and computing power exceeding 2TFs.

In the course of the experiment, we utilized two conventional datasets, namely G1020⁵² and ORIGA⁵³ (The link to access the dataset is available in the data availability section). We have resized the images to 128 $\times$ 128. To avoid the problem of over-fitting, we have employed a cross-validation procedure known as FCV. This approach includes dividing the dataset into five separate subsets, with careful attention to avoid any overlap between them. The model is trained in four distinct subgroups, followed by testing in the remaining one. This process is iterated five times, ensuring that each subset is utilized as the evaluation set precisely once. Employing such strategy enhances our ability to gauge the model’s performance while mitigating the risk of overfitting to the training data. This approach splits the entire dataset into training and testing sets used in both datasets.

The used model has been validated on two conventional data sets with images of fundus glaucoma, namely G1020⁵² and ORIGA⁵³. The G1020 dataset, which consists of 1020 publicly available retinal fundus images, offers a valuable resource for glaucoma classification, with 724 images representing healthy cases and 296 depicting instances of glaucoma. Furthermore, the Open Retinal Fundus Images for Glaucoma Research and Analysis (ORIGA) dataset stands out as a widely used repository, featuring 650 fundus images. Within the ORIGA dataset, 482 images represent healthy patients, while 168 images are from patients diagnosed with glaucoma.

Explainability results

We have implemented the suggested computational framework for Glaucoma classification, and explainability. To enhance the credibility, in order to improve usability and trust, we integrated glaucoma classification and explainability visualizations. The heatmaps produced using list of employed methods emphasize the significant areas on the input image that influenced the predictions. The area for cupping in the middle of the optic disc has closely associated with occurrences of glaucoma²³. In general, an elevated cup-to-disk ratio (CDR) raises the suspicion and probability of the patient having glaucoma. There are non visible cupping issues and no signs of glaucoma present in the healthy eye. So there are no areas affected by glaucoma to point out and see. The main focus of glaucoma eye is on the optic nerve where the cupping problem is prominent. Usually, the red spots were primarily found at the optic nerve head (ONH), where they have the most significant influence on the categorization. The significance is demonstrated through the highlighted colors in the sequence, starting from the most crucial to the least crucial illustrated in Fig. 3, we contrasted the two interpretability displays using Vanilla Gradients (VG), Guided Backpropagation(GBP), Integrated Gradients (IG), Guided Integrated Gradients (GIG), SmoothGrad, GCAM Gradient-weighted Class Activation Mapping (Grad-CAM), Guided Grad-CAM (GGCAM) methods. In general, combination of overall explainable artificial intelligence (XAI) methods improves by offering superior visual interpretations of MOD-POA+ELM model detection including enhanced object localization, clarification of multiple object instances in one image, and improved explanation-based knowledge transfer compared to Grad-CAM²⁴.

The Vanilla gradient (VG) is the simplest way to visualize specific regions in an image that have the most significant influence on the neural network’s classification result⁶⁹, Another method to compute the gradient of a specific output in relation to the input is to employ guided techniques gradient descent is used in reverse to adjust the weights of a neural network through Guided Backpropagation(GBP)⁵⁵. Then, introduced integrated gradients (IG) as a solution for the common problem of saturation came across in methods that rely on gradients⁵⁴. Kapishnikov et al.⁵⁶ introduced guided integrated gradients (GIG) as a method for adoption based on the concept of a path, input image, reference point, and the complex model to be interpreted. Smilkov et al.⁵⁷ proposed a modification to tackle a common issue in gradient-based optimization methods called smoothGrad. In⁵⁸, the authors presented the class activation mapping (CAM) visualization methods for a variety of uses scope of ELM. The GACM employs gradients to offer visual explanations without requiring retraining or improving the design of the model. In order to help advance the development of transparent healthcare automation systems, we integrated XAI methods into our suggested recognition system. In XAI heatmap visualization, significant areas are typically emphasized signaling greater importance for the classification outcome²³. The strength of the highlighted areas changes based on factors like the model’s prediction certainty and the intricacy of the input image. The key areas of the input are strongly highlighted in red in the sample visualizations of each class in our dataset, showing the model’s certainty. On the other hand, the unimportant regions are emphasized with cool hues, as seen in the peripheral areas of CDR ratio. Implementing this XAI visualization system will benefit HealthCare staff by aiding in both the automated identification of glaucoma categories and in comprehending and confirming machine-generated decisions²⁵. Healthcare workers can decrease the rate of decision errors by examining the XAI visualization to comprehend how the intelligent model assigns a particular instance to a specific class²⁶. This implies that individuals can have greater confidence in a computer’s assessment of glaucoma detection. Therefore, the glaucoma recognition system powered by XAI will not just automate the identification process, but also instill confidence and dependability in AI system decision-making during diagnosis, resulting in a cost-efficient, effective, and speedy diagnosis system that promotes early medical intervention in severe cases.

Discussion

The experimental results of all these schemes have evaluated using two well-established datasets, namely G1020 and ORIGA. We have observed that our proposed MOD-POA+ELM obtains better results with the least hidden nodes over two standard datasets compared with several algorithms namely, KNN, SVM, BPNN, ELM, MOD-ELM, POA-ELM exhibited lower condition numbers and norm values, resulting in improved classification performance. Here, we simulated the novelty of a model deployed with twenty-eighth features. Here, the proposed model is compared with state-of-art models of Various classifiers; Likewise, within the ORIGA dataset, accuracy is observed in algorithms such as KNN, SVM, ELM, MFO-ELM, and POA-ELM as 86.26%, 87.50%, 88.50%, 90.25%, 91.25%, and 92.00% accordingly. Similarly to the ORIGA dataset, the list of algorithms such as KNN, SVM, ELM, MFO-ELM, and POA-ELM heaving accuracy 87.69%, 88.85%, 89.62%, 90.38%, 91.15%, and 92.66% accordingly. The comparison Fig. 4a,b shows that at least 32 features in PCA obtain 93.14%, 96.34% in both G1020 and ORIGA datasets. Then we have implemented combined dimensionality reduction techniques called PCA-LDA has been obtained the best result, that is 93.25% and 96.75% in G1020 and ORIGA accordingly with 28 features described in Table 1. Then, to validate the performance of the proposed model, we employed a 10$\times$5 fold stratified cross-validation (SCV) approach. In each fold, the model is manually tuned with hyper-parameters to learn high-level features. ELM is regarded as the foundational classifier in the classification approach, where the modified POA optimization technique has optimized the weight and biases.We’ve additionally examined our model using an alternative meta-heuristic optimization approach known as MFO^70,71. In this paper, different conventional classifiers have been utilized, like KNN, SVM, and BPNN. For each classifier mentioned, a population size of 20 and a maximum iteration limit of 100 have been employed in the study. In the experimental results of $10\times 5$ cross-validation with ten iterations shown in Table 2. Convergence of the results of the training sample of one run visualized in Fig. 5a,b.

Table 1 Comparative analyses (%) of the deployed model based on PCA versus PCA + LDA method.

Full size table

Table 2 The effectiveness of the suggested CAD model using the Retinal dataset result(%), 10$\times$ 5-fold, $FO_N$- Fold Number, R-Run, Acc-Average accuracy.

Full size table

From the experimental evaluation, we observed that the results obtained from our deployed work demonstrate superior classification performance compared to other existing models while utilizing a reduced number of features. The experimental observations indicate that the proposed MFO-ELM, POA-ELM, and MOD-POA+ELM models achieve lower condition and norm values, resulting in improved accuracy compared to the traditional ELM approach. In Table 3 observation, we have presented a table comparing the performance of different classifiers with our deployed work, named FCDT-WRP-PCA-LDA-MOD-POA+ELM, based on various evaluation metrics like accuracy, sensitivity, and specificity. The results demonstrate that our proposed model compared with two datasets namely ACRIMA and RIM-One performs superior classification in both datasets. The overall explainable methods have been employed to create heat maps for visual explanations. Therefore, a machine learning-based computational model can serve as an effective tool to assist in the identification of glaucoma eye conditions.

Table 3 Comparative Analysis (%) deployed CAD scheme Glaucoma datasets with specifications.

Full size table

Performance comparison of existing CAD models

In this section, the efficiency of the model has been evaluated by comparing it with various advanced classifiers in the field. The evaluation has based on two Fig. 6a,b, allowing for a comprehensive comparison. The overall experiments have been conducted on four publicly available datasets ACRIMA, RIM-One, G1020, and ORIGA obtained the classification results namely 91.45%, 92.43%, 93.25% and 96.75% . The experimental results of the proposed model has been compared and tabulated in Table 4. In our experimental analysis, we observed that our employed scheme compared with explainable methods which achieves superior classification results with both datasets.

Table 4 Analysis the classification results with existing models.

Full size table

Conclusion

In this research, a computer-aided diagnosis (CAD) model has been developed, in which the FDCT-WRP technique has employed to extract curve-like structures from a glaucoma fundus image. This approach effectively captured 2D similarities, such as curves in the images. The dimensionality reduction techniques employed, namely PCA+LDA, have been utilized to identify and select the most relevant features. Hence, POA-ELM methods have followed the merits of enhancing the condition value, which is crucial to achieving improved generalization results and faster learning speed. Hence, the deployed scheme has several limitations. The deployed work has tested and validated using a dataset of glaucoma images, but it has not integrated with multiple image datasets. To focus on two problems, the proposed model is being learned. In addition, we introduce the xAI and its application in medical image classification withe the help of existing work. In future, The task of classifying various classes will be undertaken. Finally, exploring an automated optimization technique with less number of parameters for the proposed MOD-POA algorithm would be beneficial. This would alleviate the need for extensive parameter tuning that is currently required.

Data availability

The datasets used and/or analyzed during the current study are available publicly and can be access with the link provide below: G1020: https://www.kaggle.com/datasets/arnavjain1/glaucoma-datasets?select=G1020. ORIGA: https://www.kaggle.com/datasets/arnavjain1/glaucoma-datasets?select=ORIGA. In addition, for dataset specification and its detailed analysis the following can be refer for G1020⁵² and ORIGA⁵³.

References

Tham, Y.-C. et al. Global prevalence of glaucoma and projections of glaucoma burden through 2040: A systematic review and meta-analysis. Ophthalmology 121, 2081–2090 (2014).
Article PubMed Google Scholar
Tham, Y.-C. et al. Inter-relationship between ocular perfusion pressure, blood pressure, intraocular pressure profiles and primary open-angle glaucoma: The Singapore epidemiology of eye diseases study. Br. J. Ophthalmol. 102, 1402–1406 (2018).
Article PubMed Google Scholar
Saha, S., Vignarajan, J. & Frost, S. A fast and fully automated system for glaucoma detection using color fundus photographs. Sci. Rep. 13, 18408 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Kansal, V., Armstrong, J. J., Pintwala, R. & Hutnik, C. Optical coherence tomography for glaucoma diagnosis: An evidence based meta-analysis. PLoS One 13, e0190621 (2018).
Article PubMed PubMed Central Google Scholar
Shyamalee, T., Meedeniya, D., Lim, G. & Karunarathne, M. Automated tool support for glaucoma identification with explainability using fundus images. IEEE Access (2024).
Huang, G.-B., Zhu, Q.-Y. & Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 70, 489–501 (2006).
Article Google Scholar
Huang, G.-B., Zhou, H., Ding, X. & Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B 42, 513–529 (2011).
Article Google Scholar
Broomhead, D. S. & Lowe, D. Radial basis functions, multi-variable functional interpolation and adaptive networks (Tech. Rep, Royal Signals and Radar Establishment Malvern (United Kingdom), 1988).
Muduli, D., Dash, R. & Majhi, B. Automated diagnosis of breast cancer using multi-modal datasets: A deep convolution neural network based approach. Biomed. Signal Process. Control 71, 102825 (2022).
Article Google Scholar
Muduli, D., Dash, R. & Majhi, B. Fast discrete curvelet transform and modified PSO based improved evolutionary extreme learning machine for breast cancer detection. Biomed. Signal Process. Control 70, 102919 (2021).
Article Google Scholar
Muduli, D., Dash, R. & Majhi, B. Enhancement of deep learning in image classification performance using vgg16 with swish activation function for breast cancer detection. In Computer Vision and Image Processing: 5th International Conference, CVIP 2020, Prayagraj, India, December 4-6, 2020, 191–199 (Springer, 2021).
Muduli, D., Kumar, R. R., Pradhan, J. & Kumar, A. An empirical evaluation of extreme learning machine uncertainty quantification for automated breast cancer detection. Neural Comput. Appl. 1–16 (2023).
Sharma, S. K. et al. A diabetes monitoring system and health-medical service composition model in cloud environment. IEEE Access 11, 32804–32819 (2023).
Article Google Scholar
Sharma, S. K., Priyadarshi, A., Mohapatra, S. K., Pradhan, J. & Sarangi, P. K. Comparative analysis of different classifiers using machine learning algorithm for diabetes mellitus. In Meta Heuristic Techniques in Software Engineering and Its Applications: METASOFT 2022, 32–42 (Springer, 2022).
Manghnani, P. & Moghe, A. Glaucoma detection using bi-dimensional empirical mode decomposition from retinal fundus images. Int. J. Intell. Eng. Syst. 14 (2021).
Manghnani, P. & Moghe, A. A. Bdimfs based features fusion and classification for glaucoma prediction. In 2024 IEEE International Students’ Conference on Electrical, Electronics and Computer Science (SCEECS), 1–6 (IEEE, 2024).
Dixit, S., Kirar, B. S. & Agrawal, D. K. Performance analysis of deep learning models for accurate glaucoma detection. In 2024 IEEE International Students’ Conference on Electrical, Electronics and Computer Science (SCEECS), 1–6 (IEEE, 2024).
Kirar, B. S., Reddy, G. R. S. & Agrawal, D. K. Glaucoma detection using SS-QB-VMD-based fine sub-band images from fundus images. IETE J. Res. 69, 4909–4920 (2023).
Article Google Scholar
Kirar, B. S., Agrawal, D. K. & Kirar, S. Glaucoma detection using image channels and discrete wavelet transform. IETE J. Res. 68, 4421–4428 (2022).
Article Google Scholar
Agrawal, D. K., Kirar, B. S. & Pachori, R. B. Automated glaucoma detection using quasi-bivariate variational mode decomposition from fundus images. IET Image Proc. 13, 2401–2408 (2019).
Article Google Scholar
Kirar, B. S. & Agrawal, D. K. Computer aided diagnosis of glaucoma using discrete and empirical wavelet transform from fundus images. IET Image Proc. 13, 73–82 (2019).
Article Google Scholar
Kirar, B. S. & Agrawal, D. K. Current research on glaucoma detection using compact variational mode decomposition from fundus images. Int. J. Intell. Eng. Syst. 12, 1–10 (2019).
Google Scholar
Gunning, D. et al. Xai-explainable artificial intelligence. Sci. Robot. 4, eaay7120 (2019).
Article PubMed Google Scholar
Tonekaboni, S., Joshi, S., McCradden, M. D. & Goldenberg, A. What clinicians want: contextualizing explainable machine learning for clinical end use. In Machine learning for healthcare conference, 359–380 (PMLR, 2019).
Messina, P. et al. A survey on deep learning and explainability for automatic report generation from medical images. ACM Comput. Surv. 54, 1–40 (2022).
Article Google Scholar
Temme, M. Algorithms and transparency in view of the new general data protection regulation. Eur. Data Prot. L. Rev. 3, 473 (2017).
Article Google Scholar
Appice, A., Guccione, P. & Malerba, D. Transductive hyperspectral image classification: Toward integrating spectral and relational features via an iterative ensemble system. Mach. Learn. 103, 343–375 (2016).
Article MathSciNet Google Scholar
Lavrac, N., Keravnou, E. & Zupan, B. Intelligent data analysis in medicine. Encyclopedia Comput. Sci. Technol. 42, 113–157 (2000).
Google Scholar
Yin, F. et al. Model-based optic nerve head segmentation on retinal fundus images. In 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2626–2629 (IEEE, 2011).
Lowell, J. et al. Optic nerve head segmentation. IEEE Trans. Med. Imaging 23, 256–264 (2004).
Article PubMed Google Scholar
Singh, L. K., Khanna, M., Thawkar, S. & Singh, R. Nature-inspired computing and machine learning based classification approach for glaucoma in retinal fundus images. Multimedia Tools Appl. 1–49 (2023).
Agboola, H. A. & Zaccheus, J. E. Wavelet image scattering based glaucoma detection. BMC Biomed. Eng. 5, 1 (2023).
Article CAS PubMed PubMed Central Google Scholar
Al-Akhras, M. et al. Using soft computing techniques to diagnose glaucoma disease. J. Infect. Public Health 14, 109–116 (2021).
Article PubMed Google Scholar
Usman, M., Fraz, M. M. & Barman, S. A. Computer vision techniques applied for diagnostic analysis of retinal oct images: A review. Arch. Comput. Methods Eng. 24, 449–465 (2017).
Article MathSciNet Google Scholar
Khan, S. I. et al. Automated glaucoma detection from fundus images using wavelet-based denoising and machine learning. Concurr. Eng. 30, 103–115 (2022).
Article Google Scholar
Acharya, U. R., Bhat, S., Koh, J. E., Bhandary, S. V. & Adeli, H. A novel algorithm to detect glaucoma risk using texton and local configuration pattern features extracted from fundus images. Comput. Biol. Med. 88, 72–83 (2017).
Article PubMed Google Scholar
Dey, A. & Dey, K. N. Automated glaucoma detection from fundus images of eye using statistical feature extraction methods and support vector machine classification. In Industry Interactive Innovations in Science, Engineering and Technology: Proceedings of the International Conference, I3SET 2016, 511–521 (Springer, 2018).
Latif, J. et al. Enhanced nature inspired-support vector machine for glaucoma detection. Comput. Mater. Continua 76 (2023).
Ananya, S., Bharamagoudra, M. R., Bharath, K., Pujari, R. R. & Hanamanal, V. A. Glaucoma detection using hog and feed-forward neural network. In 2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS), 1–5 (IEEE, 2023).
Parashar, D. & Agrawal, D. K. Automated classification of glaucoma stages using flexible analytic wavelet transform from retinal fundus images. IEEE Sens. J. 20, 12885–12894 (2020).
Article ADS Google Scholar
Shyla, N. J. & Emmanuel, W. S. Automated classification of glaucoma using dwt and hog features with extreme learning machine. In 2021 third international conference on intelligent communication technologies and virtual mobile networks (ICICV), 725–730 (IEEE, 2021).
Balasubramanian, K. & Ananthamoorthy, N. P. Correlation-based feature selection using bio-inspired algorithms and optimized KELM classifier for glaucoma diagnosis. Appl. Soft Comput. 128, 109432 (2022).
Article Google Scholar
Morales, S., Naranjo, V., Angulo, J. & Alcañiz, M. Automatic detection of optic disc based on PCA and mathematical morphology. IEEE Trans. Med. Imaging 32, 786–796 (2013).
Article PubMed Google Scholar
Huang, G.-B. & Siew, C.-K. Extreme learning machine: Rbf network case. In ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004., 2, 1029–1036 (IEEE, 2004).
Maheshwari, S., Pachori, R. B. & Acharya, U. R. Automated diagnosis of glaucoma using empirical wavelet transform and correntropy features extracted from fundus images. IEEE J. Biomed. Health Inform. 21, 803–813 (2016).
Article PubMed Google Scholar
Huang, M.-L., Chen, H.-Y., Huang, W.-C. & Tsai, Y.-Y. Linear discriminant analysis and artificial neural network for glaucoma diagnosis using scanning laser polarimetry-variable cornea compensation measurements in taiwan chinese population. Graefes Arch. Clin. Exp. Ophthalmol. 248, 435–441 (2010).
Article PubMed Google Scholar
Muduli, D., Sharma, S. K., Kumar, D., Singh, A. & Srivastav, S. K. Maithi-net: A customized convolution approach for fake news detection using maithili language. In 2023 International Conference on Computer, Electronics & Electrical Engineering & their Applications (IC2E3), 1–6 (IEEE, 2023).
Huang, G.-B., Wang, D. H. & Lan, Y. Extreme learning machines: A survey. Int. J. Mach. Learn. Cybern. 2, 107–122 (2011).
Article Google Scholar
Zhu, Q.-Y., Qin, A. K., Suganthan, P. N. & Huang, G.-B. Evolutionary extreme learning machine. Pattern Recogn. 38, 1759–1763 (2005).
Article ADS Google Scholar
Dua, S., Acharya, U. R., Chowriappa, P. & Sree, S. V. Wavelet-based energy features for glaucomatous image classification. IEEE Trans. Inf Technol. Biomed. 16, 80–87 (2011).
Article PubMed Google Scholar
Sharma, S. K. et al. An evolutionary supply chain management service model based on deep learning features for automated glaucoma detection using fundus images. Eng. Appl. Artif. Intell. 128, 107449 (2024).
Article Google Scholar
Bajwa, M. N. et al. G1020: A benchmark retinal fundus image dataset for computer-aided glaucoma detection. In 2020 International Joint Conference on Neural Networks (IJCNN), 1–7 (IEEE, 2020).
Zhang, Z. et al. Origa-light: An online retinal fundus image database for glaucoma analysis and research. In 2010 Annual international conference of the IEEE engineering in medicine and biology, 3065–3068 (IEEE, 2010).
Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. In International conference on machine learning, 3319–3328 (PMLR, 2017).
Springenberg, J. T., Dosovitskiy, A., Brox, T. & Riedmiller, M. Striving for simplicity: The all convolutional net. arXiv preprint arXiv:1412.6806 (2014).
Kapishnikov, A. et al. Guided integrated gradients: An adaptive path method for removing noise. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5050–5058 (2021).
Smilkov, D., Thorat, N., Kim, B., Viégas, F. & Wattenberg, M. Smoothgrad: removing noise by adding noise. arXiv preprint arXiv:1706.03825 (2017).
Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision, 618–626 (2017).
Do, M. N. & Vetterli, M. The finite ridgelet transform for image representation. IEEE Trans. Image Process. 12, 16–28 (2003).
Article ADS MathSciNet PubMed Google Scholar
Candès, E. J. & Donoho, D. L. Ridgelets: A key to higher-dimensional intermittency?. Philosophical Trans. Royal Soc. London Ser. A: Math. Phys. Eng. Sci. 357, 2495–2509 (1999).
Article ADS MathSciNet Google Scholar
Candes, E. J. & Donoho, D. L. Curvelets: A surprisingly effective nonadaptive representation for objects with edges (Stanford Univ Ca Dept of Statistics, Tech. Rep., 2000).
Candes, E., Demanet, L., Donoho, D. & Ying, L. Fast discrete curvelet transforms. Multiscale Model. Simulat. 5, 861–899 (2006).
Article MathSciNet Google Scholar
Nayak, D. R., Dash, R., Majhi, B. & Prasad, V. Automated pathological brain detection system: A fast discrete curvelet transform and probabilistic neural network based approach. Expert Syst. Appl. 88, 152–164 (2017).
Article Google Scholar
Lu, G.-F. & Zheng, W. Complexity-reduced implementations of complete and null-space-based linear discriminant analysis. Neural Netw. 46, 165–171 (2013).
Article CAS PubMed Google Scholar
Martinez, A. M. & Kak, A. C. Pca versus lda. IEEE Trans. Pattern Anal. Mach. Intell. 23, 228–233 (2001).
Article Google Scholar
Huang, G.-B., Zhu, Q.-Y. & Siew, C.-K. Extreme learning machine: a new learning scheme of feedforward neural networks. In 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), vol. 2, 985–990 (IEEE, 2004).
Xavier, F. J. Odmnet: Automated glaucoma detection and classification model using heuristically-aided optimized densenet and mobilenet transfer learning. Cybern. Syst. 1–33 (2023).
Trojovskỳ, P. & Dehghani, M. Pelican optimization algorithm: A novel nature-inspired algorithm for engineering applications. Sensors 22, 855 (2022).
Article ADS PubMed PubMed Central Google Scholar
Simonyan, K. Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034 (2013).
Mirjalili, S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowl.-Based Syst. 89, 228–249 (2015).
Article Google Scholar
Muduli, D., Dash, R. & Majhi, B. Automated breast cancer detection in digital mammograms: A moth flame optimization based elm approach. Biomed. Signal Process. Control 59, 101912 (2020).
Article Google Scholar
Xu, X. et al. Automatic glaucoma detection based on transfer induced attention network. Biomed. Eng. Online 20, 1–19 (2021).
Article Google Scholar
Chaudhary, P. K. & Pachori, R. B. Automatic diagnosis of glaucoma using two-dimensional fourier-bessel series expansion based empirical wavelet transform. Biomed. Signal Process. Control 64, 102237 (2021).
Article Google Scholar
Zhao, X. et al. Glaucoma screening pipeline based on clinical measurements and hidden features. IET Image Proc. 13, 2213–2223 (2019).
Article Google Scholar
Deperlioglu, O. et al. Explainable framework for glaucoma diagnosis by image processing and convolutional neural network synergy: Analysis with doctor evaluation. Futur. Gener. Comput. Syst. 129, 152–169 (2022).
Article Google Scholar
Sonti, K. & Dhuli, R. A new convolution neural network model KR-NET for retinal fundus glaucoma classification. Optik 283, 170861 (2023).
Article Google Scholar
Shoukat, A. et al. Automatic diagnosis of glaucoma from retinal images using deep learning approach. Diagnostics 13, 1738 (2023).
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open access funding provided by Mälardalen University.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, C.V. Raman Global University, Bhubaneswar, 751012, India
Debendra Muduli, Santosh Kumar Sharma & Rakesh Ranjan Kumar
Department of Computer Science, Birla Institute of Technology, Ranchi, Jharkhand, 847226, India
Rani Kumari
Department of Information Technology, Vi3, Image Analysis, Uppsala University, Uppsala, Sweden
Rani Kumari
College of Computing and IT, Department of Data and Cybersecurity, University of Doha for Science and Technology, Doha, Qatar
Adnan Akhunzada
Department of Electrical-Electronics Engineering, Istinye University, 34010, Istanbul, Turkey
Korhan Cengiz
Department of Computer Science and Engineering, Indian Institute of Technology, Dhanbad, 826001, India
Dinesh Kumar Sah
Division of Networked and Embedded Systems, Mälardalen University, 721 23, Västerås, Sweden
Dinesh Kumar Sah

Authors

Debendra Muduli
View author publications
Search author on:PubMed Google Scholar
Rani Kumari
View author publications
Search author on:PubMed Google Scholar
Adnan Akhunzada
View author publications
Search author on:PubMed Google Scholar
Korhan Cengiz
View author publications
Search author on:PubMed Google Scholar
Santosh Kumar Sharma
View author publications
Search author on:PubMed Google Scholar
Rakesh Ranjan Kumar
View author publications
Search author on:PubMed Google Scholar
Dinesh Kumar Sah
View author publications
Search author on:PubMed Google Scholar

Contributions

D.M. conceived the experiment(s), R.K., D.K.S. and S.K.S. conducted the experiment(s), D.K.S. and K.C. analyzed the results. R.R.K. reviewed the manuscript. K.C. and A.A. provided additional insights and feedback on the manuscript.

Corresponding author

Correspondence to Dinesh Kumar Sah.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Muduli, D., Kumari, R., Akhunzada, A. et al. Retinal imaging based glaucoma detection using modified pelican optimization based extreme learning machine. Sci Rep 14, 29660 (2024). https://doi.org/10.1038/s41598-024-79710-7

Download citation

Received: 09 May 2024
Accepted: 12 November 2024
Published: 29 November 2024
DOI: https://doi.org/10.1038/s41598-024-79710-7