We use cookies to ensure our website works properly and to personalise your experience. Cookies policy
1Unité de Recherche d’Automatique et d’Informatique Appliquée (UR-AIA), IUT of Bandjoun, University of Dschang –Cameroun P. O Box 134 Bandjoun
2Unité de Recherche de Matière Condensée d’Electronique et de Traitement du Signal (UR-MACETS), Faculty of Science, University of Dschang-Cameroun, PO Box 67 Dschang
3Department of Computer Science, Higher Teacher Training College, University of Bertoua-Cameroun P.O Box 652 Bertoua
4Department of Neurology, Faculty of Health Science, University of Buea, B.O Box Buea
5Department of Computer Engineering and Telecommunications (at National advanced school of Posts, Telecommunications and ICTs
6Department of Mathematics, statistic and computer sciences, the University of Bertoua –Cameroon P.O Box 416 Bertoua
Autism Spectrum Disorder (ASD) is a complex neurodevelopmental condition characterized by persistent deficits in social communication and interaction, alongside restricted and repetitive patterns of behavior. Despite its increasing global prevalence, the clinical diagnosis of ASD remains a challenging and time-consuming process, heavily reliant on subjective behavioral assessments conducted by specialized clinicians. This limitation underscores the urgent need for objective, reliable, and automated diagnostic tools. In this study, we propose a computer-aided diagnosis (CAD) system for ASD detection based on the analysis of Electroencephalogram (EEG) signals. The proposed pipeline encompasses EEG signal preprocessing, decomposition into frequency sub-bands using the Discrete Wavelet Transform (DWT), and extraction of discriminative features through four entropy-based measures, namely Logarithmic Energy Entropy, Tsallis Entropy, Rényi Entropy, and Shannon Entropy. The extracted feature vectors are subsequently fed into a Multilayer Perceptron (MLP) neural network trained via the backpropagation algorithm to classify subjects into two categories: ASD patients and neurotypical controls. Experimental evaluations conducted on a dataset of 56 subjects (28 ASD, 28 neurotypical) demonstrate that Shannon Entropy yields the highest classification performance, achieving an accuracy of 98.84%, a sensitivity of 100%, and a specificity of 96%. These results highlight the effectiveness of entropy-based feature extraction combined with MLP classification for the automated diagnosis of ASD from EEG signals, and suggest the potential of the proposed approach as a valuable decision-support tool in clinical neuroscience.
Autism Spectrum Disorder (ASD) is a lifelong neurodevelopmental condition characterized by persistent impairments in social communication and interaction, restricted and repetitive behavioral patterns, and atypical sensory processing [1]. Clinical manifestations typically emerge during the first years of life, with early signs including delayed language acquisition, limited social reciprocity, and stereotyped behaviors [2]], [3]. According to the World Health Organization (WHO), ASD affects approximately 1 in 100 children worldwide, making it one of the most prevalent neurodevelopmental disorders of our time, with significant implications for affected individuals, families, and healthcare systems.
Despite its widespread prevalence, the diagnosis of ASD remains a complex and inherently subjective process. Current diagnostic protocols, such as the Autism Diagnostic Observation Schedule (ADOS) and the Autism Diagnostic Interview-Revised (ADI-R), rely exclusively on behavioral observation and cognitive assessment conducted by experienced clinicians [3]. A well-documented clinical observation indicates that autistic children at approximately 24 months of age are typically unable to produce two meaningful words spontaneously, without imitation or echolalia. However, the subtlety of early neurological impairments makes timely and accurate detection particularly challenging, and the precise etiological factors underlying ASD remain incompletely understood.
In this context, neurophysiological biomarkers have attracted growing research interest as objective complements to behavioral diagnosis. The Electroencephalogram (EEG) has emerged as one of the most promising modalities for ASD characterization, owing to its high temporal resolution — on the order of milliseconds — non-invasiveness, and ability to capture real-time cortical dynamics in both resting and task-related states [4], [5]. EEG signals encode neural activity across five canonical frequency bands: delta (0–4 Hz), theta (4–8 Hz), alpha (8–16 Hz), beta (16–32 Hz), and gamma (32–64 Hz), each associated with distinct cognitive and physiological functions. The delta band is predominantly active during deep sleep and attentional processes; theta is linked to memory encoding and drowsiness; alpha reflects relaxed wakefulness and cognitive inhibition; beta is associated with active thinking, alertness, and motor control; and gamma underlies early sensory processing and feature binding [5].
Converging evidence from multiple studies indicates that ASD is associated with measurable alterations in EEG spectral profiles and connectivity patterns. Children with ASD consistently exhibit atypical neural activity across frequency bands during sensory stimulation paradigms, along with reduced short-range connectivity in the left hemisphere — a region critically involved in language processing — and paradoxically increased long-range connectivity between distant cortical regions [6]. Furthermore, a significant proportion of individuals with ASD present with epileptiform EEG abnormalities, suggesting underlying neurological disruptions that extend beyond behavioral manifestations [6]. Molholm and colleagues further emphasized that EEG-derived biomarkers could not only quantify the severity of autistic symptoms but also contribute to earlier and more objective diagnosis, with early intervention being strongly associated with improved long-term outcomes [6].
These observations have motivated a growing body of research into machine learning-based, computer-aided diagnosis (CAD) systems for ASD, leveraging EEG signal processing as a non-invasive, cost-effective, and objective diagnostic pathway [7]. In this study, we propose a CAD framework combining Discrete Wavelet Transform (DWT)-based frequency decomposition, entropy feature extraction, and Multilayer Perceptron (MLP) neural network classification to discriminate ASD patients from neurotypical controls. The remainder of this paper is organized as follows: Section II reviews related works; Section III describes the materials and methodology; Section IV presents the experimental results; Section V discusses the findings; and Section VI concludes the paper.
II RELATED WORKS
Jeyabose and colleagues presented a robust and interpretable convolutional deep learning framework for early ASD detection from resting-state EEG signals, with a particular emphasis on explainability — an increasingly critical requirement for clinical deployment of AI-based diagnostic tools. The proposed system converts resting-state EEG recordings into time-frequency spectrogram images, which are subsequently processed by a deep convolutional architecture trained to discriminate ASD from neurotypical patterns. The framework was validated across two independent cohorts: the KAU dataset from King Abdulaziz University and the ACE dataset available through the NIH National Database for Autism Research (NDA). Beyond achieving high diagnostic performance, the system incorporates Explainable AI (XAI) visualization techniques that provide actionable insights into the neural substrates underlying ASD, enhancing both the transparency and the clinical interpretability of the model's decisions, thereby facilitating its integration into real-world diagnostic workflows.
Shi and colleagues introduced TFSNet, a time-frequency synergy network designed to address the limitations of existing methods in capturing time-varying dynamics and joint time-frequency representations from EEG signals for ASD classification. The proposed architecture incorporates three key components: a Temporal Dynamic Residual Block (TDRB) for enhanced time-domain feature extraction; a Short-Time Fourier Transform (STFT) module combined with a convolutional attention mechanism for frequency-domain feature capture; and an Adaptive Cross-Domain Attention (ACDA) mechanism for efficient fusion of time-frequency representations. The framework was evaluated on two independent benchmark datasets — the University of Sheffield dataset (28 ASD, 28 neurotypical controls) and the King Abdulaziz University (KAU) dataset (12 ASD, 5 controls) — achieving average classification accuracies of 98.68% and 97.14%, respectively, significantly outperforming existing machine learning and deep learning approaches on both benchmarks.
Tang and colleagues proposed a resting-state EEG-based hybrid graph convolutional network framework, referred to as Rest-HGCN, for automated ASD diagnosis. The model integrates brain network analysis techniques with data-driven learning strategies to extract discriminative graph features from resting-state EEG signals. By jointly leveraging cognitive prior knowledge-based graph components and adaptively learned data-driven components, Rest-HGCN effectively captures differential brain connectivity patterns between neurotypical children and ASD patients. An attention mechanism was further incorporated to enable efficient integration of these complementary feature streams. The model was evaluated through k-fold cross-validation on the publicly available ABC-CT resting-state EEG dataset, comprising 399 participants (280 ASD and 119 neurotypical controls), achieving classification accuracies of 87.12% in single-subject experiments and 85.32% in cross-experiment validation, demonstrating robust generalizability across subjects.
Louise Bogéa Ribeiro et al (2023) This study aimed to evaluate functional connectivity and spectral power using EEG signals. EEG records the brain activity of an individual by displaying wavy lines that depict brain cells communication through electrical impulses. The study suggests that EEG is useful in diagnosing ASD by evaluation entropy in the brain and that researchers may be able to develp new diagnostic methos for ASD which focuses on particular stimuli and brainwaves if they conduct more extensive studies withj higher numbers and more rigorous study designs.
Véronique D. Thérien et al (2023) Authors investigated the brain correlaties underlying visual segmentation associated with the well-established autistic- superior visuospatial abilities in distinct sub groups using functional magnetic resonance imaging. Participants completed a computerized adapted BD task with …..
Pilar Garcés et al (2022) The authors quantified resting state EEG alpha peak metrics, power spectrum (PS , 2-32 Hz, and functional connectivity (FC) in 411 children, adolescents and adults (n=212 ASD, n=199 neurotypicals (NT) all with IQ > 75. They performed analyses in source-space using individual head models derived from the participants MRIs. They tested for differences in mean and variance between the ASD and NT groups for both PS and FC using linear mixed effects models accounting for age, sex, IQ and site effects. In the training dataset they found an interaction between age and group for the reactivity to eye opening and a significant but weak multivariate ASD vs NT classification performance for PS and FC, sensitivity 52%- 62%, specificity 59% - 73%.
Katia Ggnon, Christianne Bolduc et al (2021) Authors tested the hypothesis of an atypical scalp distribution of electroencephalography (EEG) activity during Rapid Eye Movement (REM) sleep in young autistic adults. EEG spectral activity and ratio along the anteroposterior axis and across hemispherres were compared in 16 neutypical (NT) young adults 17 individuals with autism spectrum disorder (ASD). EEG spectral power was lower in the ASD group over the bilateral, central and right parietal (beta activity as well as bilateral occipital (beta, theta) and total activity recording sites. EEG activity thus appears to be atypical distributed over the scalp surface in young with autism during REM sleep within cerebral hemispheres, and this correlates with some ASD symptoms.
Nur Alisa Ali et al (2020) this work aimed to reveal the different pattern between autistic and normal children via electroencephalogram EEG signal by using the deep learning algorithm. The brain signal database used pattern recognition algoritm where the extracted features will undergo the multilayer perceptron neural network for classification process. The promising methode to perform the classification is through a deep learning algorithm which is currently a well-known and superior method in the pattern recognition field. The higher percentage means the more effectiveness for the ASD diagnosis.
Heather L. Green et al (2020) The mismatch negativity (MMN), a passive auditory evoked potential, offers insight into the brain’s ability to direct attention to novel sounds. Since exposure to speech is necessary for learning to map meaning onto phenomenes, the author predicted slower MMN responses to speech sounds would indicate presence of language impairment in ASD. Authors explored the relationship between MMN latency in children ages 5 -10 with ASD plus language impairment (ASD +LI), ASD minus language impairment (ASD-LI), typically developping children (TD) during an auditory oddball experiment presenting speech and pure tone sounds. Contrary to their prdiction, children with ASD+LI demonstrated decreased MMN latency in the left hemisphere in response to novel vowel sounds compared to children with ASD-LI and TD controls.
Jia Wang et al (2020) Here, EEG coherence analysis was used in a cohort of childern with ASD (n=13) and matched typically developing controls TD (n=15) to examine the functional connectivity characteristics in long-distance and short – distance electrode pairs. Subsequently, they explore the association between the connectivity strength of coherence and system severity in cildren with ASD. Compared with TD group, individuals with ASD showed increased coherence in short-distance electrode pairs in the right temporal-parietal region (delta, alpha, beta bands) right central parietal region (theta, alpha, beta bands), occipital region (theta, alpha, beta bands), right central-parietal region (delta, alpha, betabands), and the prefrontal region (only beta band). In long distance coherence analysis, ASD group showed increased coherence in bilateral frontal region, temporal region, parietal regions, and frontal-occipatal region in alpha and beta bands; the strength of such, connections was associated with symptoms severity.
Stromo Ibrahim et al (2018) in this work authors investigate different electroencephalography (EEG) feature extraction and classification techniques to assist in the diagnosis on both epilepsy and autism spectrum disorder (ASD). First, the EEG signal is preprocessed to remove major artifacts before being decomposed into several EEG- sub bands using a discrete-wavelet transform (DWT). Two non-linear methods were studied: Shannon entropy and Largest Lyapunov exponent, which mesure complexity and chaoticity in EEG recording in addition to the two conventional methods (namely, Standard deviation, and band power). The combination of DWT, Shannon entropy, K-nearest neighbor (KNN) technique produces the most promising classification result, with an overall accuracy of up to 94,6 % for the three class classification problems.
Ridha Djemal et al (2017), In this work, a new computer aided diagnosis of autism based on electroencephalography (EEG) signal analysis is investigated. The method is based on discrete wavelet transform (DWT), entropy (En) and artificiel neural networok (ANN). DWT is used to decompose EEG signal into approximation and details coefficients to obtain EEG subbands. The feature vector is constructed by computing Shannon entropy values from each EEG sub band. ANN classifies the corresponding EEG signal into normal or autistic based on the extracted features. The experimental results show the effectiveness of the proposed method for assisting autism diagnosis.
Marjolaine chicoine et al (2013) The sleep of 34 adults (16 autistics) and 26 children (13 autistics) was recorded. Sleep spindles were counted and compared between groups at prefrontal (Fp1, Fp2) and central (C3, C4) electrodes. Both diagnostic groups showed a similar decrease in sleep spindle with age. Autistic children had significantly less spindles than controls at Fp2 and C4, adults with autism had significantly less spindles than controls at the two central electrodes.
III MATERIALS AND METHOD
3.1 Materials
This section describes the hardware and software resources employed throughout the development and implementation of the proposed ASD diagnostic system.
3.1.1 Hardware Configuration
All experiments and signal processing computations were carried out on a personal computer equipped with an Intel Core i5-7200U dual-core processor clocked at 2.70 GHz, 8 GB of DDR4 RAM, and 1 GB of dedicated graphics memory. While modest by modern high-performance computing standards, this configuration was deliberately chosen to reflect a realistic and accessible hardware environment, thereby demonstrating the practical deployability of the proposed approach in resource-constrained clinical settings such as those commonly found in sub-Saharan African healthcare facilities.
3.1.2 Software Environment: MATLAB
All signal processing, feature extraction, classification, and performance evaluation procedures were implemented using MATLAB R2021a (MATrix LABoratory), developed by MathWorks Inc. MATLAB is a high-level, matrix-oriented scientific computing environment specifically designed to streamline numerical computation, algorithm development, and data visualization. Its unified workspace integrates a code editor, an interactive command console, and a step-by-step debugging tool, enabling efficient iterative development and rapid prototyping of signal processing pipelines.
A key advantage of MATLAB for biomedical signal processing lies in its extensive ecosystem of domain-specific toolboxes. In this work, the Signal Processing Toolbox was leveraged for EEG filtering and spectral analysis, while the Deep Learning Toolbox provided the infrastructure for designing, training, and evaluating the Multilayer Perceptron (MLP) neural network classifier. The Wavelet Toolbox was additionally used to implement the four-level Discrete Wavelet Transform (DWT) decomposition central to the feature extraction stage. The EEGLAB plug-in, an open-source MATLAB toolbox widely adopted in the neuroscience community, was further employed for EEG data import, preprocessing, artifact rejection via Independent Component Analysis (ICA), and signal segmentation. Together, these tools provided a coherent, reproducible, and well-validated computational framework for the end-to-end implementation of the proposed diagnostic pipeline.
3.2 Methodology
This part presents the steps that were followed to achieve the creation of the model allowing automatic daignosis.
3.2.1 Flowchart of the work
Figure 1 : flow chart of the work
Figure 1 provides a comprehensive overview of the proposed methodology. The pipeline begins with a preprocessing stage applied to the raw EEG signals, which encompasses bandpass filtering and signal segmentation. This step is primarily dedicated to the removal of physiological artifacts—including electrocardiographic (ECG), electrooculographic (EOG), and electromyographic (EMG) signals—as well as the suppression of electrode contact noise and other environmental interferences.
Following preprocessing, the segmented EEG epochs are subjected to a discrete wavelet transform (DWT)-based decomposition. This process yields both detail coefficients (D1–D4) and approximation coefficients (A4), corresponding to the five canonical EEG frequency sub-bands: delta, theta, alpha, beta, and gamma.
In the feature extraction phase, a set of entropy-based measures is computed directly from the original EEG segments as well as from the resulting wavelet coefficients. These entropy features are employed to characterize the temporal distribution of the signals and to reduce the dimensionality of the feature space. Additionally, conventional statistical attributes—such as the mean and standard deviation—are extracted to complement the entropy-based descriptors.
Finally, the resulting feature vectors are fed into an artificial neural network (ANN), which serves as the classification engine to distinguish between autistic and neurotypical subjects.
3.2.2 Dataset acquisition
The EEG dataset was acquired via the Biosemi Activetwo EEG system. The originals recordings were converted into. set and .fdt files via EEGLAB. There is a .fdt file and a .set file for each recording. The .fdt file contains the data, the .set file contains information about about the recording settings. The files can be opened in the EEGLAB Software. Data was acquired from 28 people with a diagnosis of autism spectrum disorder and 28 neurotypical controls aged from 18 to 68. The paradigme that generated data used a period of 2,5 minute (150 second) at rest with eyes closed.
3.2.3 Preprocessing of EEG signals.
EEG signals are usually cluttered with noises derived from many factors such as poor electrode placement, dirty scalp, magnetic field of the device, blood pressure, respiration, heart and muscle activities [6] Preprocessing therefore aims to remove these artifacts to be able to carry out a good analysis. For this, i twill be necessary to filter the EEG data which will improve the stability and increase the precision, the nit will be necessary to normalize the signals and finally to remove these artifacts.
3.2.3.1 Filtering
Before the operation of filtering, we reduced the samplingrate from 212 Hz to 256 Hz, this is to reduce space and calculation time. The impulse response filter was applied; this helps to eliminate high frequencies. The highest frequency was set at 64 Hz
3.2.3.2 Normalization
During EEG data collection, the resistance of each electrode changes over time, time which causes changes in voltage readings. As a result, the variances of some channels may be significantly different from those of others. The normalization process not only improves the classification performances, but also solves the non-stability problem. In the EEGLAB toolbox, we used the normalization function.
3.2.3.3 Artifacts removal
It is mandatory to remove all artifacts and improve the signal-to-nois ratio by filtering the acquired data. We used independent component analysis (ICA) for the removal of ocular artifacts. Electrodes FP1, FP2, F7 and F8 are used as reference signals for ocular artifact suppression. After these steps, all EEG signals were segmented into 20 seconds. The work was carried out using MATLAB EEGLAB Toolbox. The FPz electrode is used as a reference electrode for the diagnosis of autism.
3.2.4 Mathematical principles
3.2.4.1 EEG signals decomposition into frequency bands
The discrete wavelet transforms analysis of the EEG signals carry out intended to decompose these EEG signals into frequency bands (d, a, b, q et g). Wavelet transform (WT) techniques are widely used in EEG signals processing for time-frequency decomposition. There are two types of wavelets transform continuous wavelet transform (CWT) and discrete wavelet transform (DWT) [2].
The continuous wavelet transform (CWT) of the signal x(t) is defined as follows
TOCa,bx=1xxtY *t-badt (1)
In this expression Y *(t) is called mother wavelet function while a and b dilatation (scale) and translation parameters respectively are real numbers. However, an obvious disadvantage of the CWT technique is that it requires excessive calculations. Therefore, we used but (DWT) in the proposed work to decompose a given signal into approximation and detail coefficients.
The discrete wavelet transform (DWT) is defined using the discrete value of parameters a and b as follows
TODm,nx=a0-m2xtY a0-m2t-nb0dt (2 )
Where m and n (m, nÎZ) indicate the frequency and time location respectively. The approximation coefficients at each level are further decomposed into levels following approximation and detail coefficients as shown in figure 2.
Figure 2 : four- levels (DWT) decomposition into detail and approximation coefficients
|
Wavelet coefficients |
EEG Sub-bands |
Frequency (Hz) |
|
D1 |
Gamma |
32-64 |
|
D2 |
Beta |
16-32 |
|
D3 |
Alpha |
8-16 |
|
D4 |
Theta |
4-8 |
|
A4 |
Delta |
0-4 |
Tableau 1: Frequency bands partition
In this work we used the DWT decomposition at 4 levels with Daubechies-four (db4) mother wavelet in order to extract five EEG sub-bands and to achieve better in the feature extrction stage. The characteristics are extracted from the detail coefficients at various levels (D1 … D4) and from approximation coefficient at the last level (A4). The frequency bands of the EEG signals corresponding to the 4-level DWT decomposition with a sampling frequency of 256 Hz on EEG signals are shown in table 1.
As shown in this table, wavelet coefficients correspond to several EEG subbands namely., delta (0-4 Hz), theta (4–8 Hz), alpha (8–16 Hz), beta (16–32 Hz) et gamma (32–64 Hz). Different frequency sub bands can reveal time series characteristics of the EEG signal.
3.2.4.2 Extraction of features
Many features can be extracted from the time series of EEG signals, such as the use of statistical characteristics or non-linear characteristics (entropy). Several previous studies show the effectiveness of using entropy to analyze EEG signal such as Epilepsy diagnosis and autism diagnosis in occurrence. Entropy can be used to measure complexity, regularity and statistical quantification of time series data such as EEG.
Bosi et al. [7] investigated the possibility of using EEG complexity as a biomarker of ASD risk. Nonlinearity and abnormal complexity of brain signal may reveal disorders brain or cognitive disorders. So, we will also use some entropy functions to extract features.
This entropy is a type of wavelet entropy. Assume a signal x=[x1x2x3x5…xn] , a probability distribution function denoted p(xi) where i is the index of the signal elements. Logarithmic energy is defined as follows
H= i=1nlog(pi2) (3)
ii. Tsallis’s Entropy
In physics, the Tsallis entropy is a generalization of the standand entropy of Boltzmann-Gibbs. Given a probability pi , with the condition pi=1 and for all real number q , the Tsallis entropy is defined as
H=kq-11-ipia (4)
Where q is the entropy index parameter and k a positive constant.
iii. Renyi’s Entropy
Renyi’s entropy is a statistical function used to measure diversity and randomness of the distribution of the discrete signal and to estimate the uncertainty of the discrete signal. It can be calculated by the following formula
H=11-alog1-i=1npia (5)
where a is the order of the Renyi function, a>0, a#0, et p the probability of the discrete variables from the signal.
iv. Shannon Entropy
this entropy is the technique used to expect the average value of the information containing in the signal and to measure the average uncertainty of the discrete signal. We used the basic shannon entropy developed by Shannon [8]. for a given time, series data X=[x1x2x3x5…xn the entropy value can be calculated by the following formula
H=-i=1kpi*log2pi (6)
Where k is the number of unique values in the data X and pi is the probability for these unique values.
3.2.4.3 Artificial Neural Network (ANN)
a. Architecture of the Network
Figure 3 Architecture of the MLP
In the context of biomedical signal processing, particularly with application to EEG signals, classification of data into feature space is often required. The objective of classification is to draw a boundary between two or more classes and labelling them based on their feature. The task being to assign an input pattern represented by a feature vector to one or more pre-defined classes.
Here the MLP classifier consists of an input layer made of 4 neurons, a hidden layer made of 4 neurons and an output layer of 2 neurons.
b. Learning process
We will focus on supervised learning in which the backpropagation algorithm is implemented to update weights by backwarding output error to allow the network to provide answers as close as possible to the known correct answers
c. Activation function
In this work we use the sigmoid function which is defined as follows
φx=11+e-x (7)
Backpropagation algorithm
Learning through gradient descent :
Output layer: Correction is guided by the error between the actual output and the real output.
Hidden layers: Updating weights according to the influence of the error of each output neurone to each hidden neurone
Output values of output neurones :
yjt is the value of the jth neurone tto the input data Xt :
yjt=fatj=fi=1Rωi,j*yit+ωj,0 (8)
f is the neurone activation function,
atj=i=1Rωi,j*yit+ωj,0 : is the weighted som of the input neurones plus biais.
ωi,j : weight between the j th neurone and the ith neurone of the previous layer.
ωj,0 : biais of the jth neurone;
yit : Output of the ith neurone of the previous layer for the input data X ;
R : number of neurones on the previous layer ;
Input Data ℵ={xt, rt}t=1N , with rt=[r1t, r2t, r3t, …rkt]T=[r1t…rnt] , where rjt=1 si xt∈Cj et
rjt=0 otherwise.
Error observed for data xt on the jth neurone to the output layer:
ejt=rjt- yjt
quadratic error observed for the data xt on all the K neurones of the output layer (one neurone per class) :
Et=12j=1k(ejt)2 (9)
Mean error observed for the dataset ℵ :
E= 1Nt=1NEt (10)
Correction of weights through gradient descent of the mean quadratic error.
∆ωj,i= -η∂E∂ωj,i=∂1Nt=1NEt∂ωj,i (11)
∆ωj,i= -ηNt=1N∂Et∂ωj,i (12)
The error to the jth neurone depend to the neurones of the previous layer .
Learning for the output layer :
Updating of the weights of the output layer
∆ωj,i= -ηNt=1N∂Et∂ωj,i=-ηNt=1N∂Et∂et*∂et∂yjt*∂yjt∂ajt*∂ajt∂ωj,i (13)
∆ωj,i= ηNt=1Nejtyjt1-yjtyit (14)
∆ωj,0= ηNt=1N∂Et∂ωj,0=ηNt=1N∂Et∂ejt*∂et∂yjt*∂yjt∂ajt*∂ajt∂ωj,0=ηNt=1Nejtyjt1-yjtyjt (15)
Let δjt , which corresponds to the local gradient of the neurone j for the input data Xt .
δjt=ejtyjt1-yjtyit (16)
Then,
∆ωj,i= ηNt=1Nδjtyit (17)
and,
∆ωj,0= ηNt=1NδjtEt=12j=1k(ejt)2 (18)
The gradient of the error of the hidden layer is:
∂Et∂ωj,i=∂Et∂yjt*∂yjt∂ajt*∂ajt∂ωj,i (19)
Only, ∂Et∂ωj,i changes while ∂ajt∂ωj,i and ∂yjt∂ajt are the same as on the output layers. The error to one neurone of the hidden layer depends of the error of the K neurones of the following layer (backpropagation of error).
Et=12j=1k(ejt)2 (20)
∂Et∂yjt=∂∂yjt[12j=1kejt2] = kekt*∂ekt∂yjt (21)
∂Et∂yjt=kekt*∂ejt∂ajt*∂ajt∂yjt=kekt*∂rkt-ykt∂ajt*∂kωk,j*ykt+ ωk,0 ∂yjt (22)
∂Et∂yjt=kekt[-ytk(1-ytk)]*ωk,j (23)
δkt=ekt-ytk1-ytk (24)
∂Et∂yjt=-kδktωk,j (25)
∂Et∂ωj,i= ∂Et∂yjt*∂yjt∂ajt*∂ajt∂ωj,i=-kδktωk,j *yjt1-yjtyjt (26)
δjt=ejt1-yjtkδktωk,j (27)
∆ωj,i=-η*∂Et∂ωj,i=- ηNt=1N∂Et∂ωj,i=ηNt=1Nδjtyit (28)
∆ωj,0=-η*∂Et∂ωj,0=- ηNt=1N∂Et∂ωj,0=ηNt=1Nδjt (29)
The backpropagation algorithm can be summarized through the following steps:
Step 1: Normalize the training data Xit ∈[-1, 1] and the real outputs
rit ∈0.05;0.95.
ejt=rjt- yjt , avec j=1, 2,….,K; t=1,2,…..,N
ωj,i= ωj,i+ ∆ωj,i= ωj,i+ ηNt=1Nδjtyit (30)
Where the local gradient or Delta is defined as:
δjt=ejtyjt1-yjt, if j belong to the output layer yjt1-yjtkδktωk,j, if j belongs to the hidden layer (31)
Confusion matrix
The confusion matrix presents possible outcomes when measuring the intrinsic validity of a test
Table : confusion matrix
|
|
Ill Autist patients |
Neurotypic patients |
|
Positive Test |
TP |
FP |
|
Negative Test |
FN |
TN |
Sensitivity : it is a measure of the actual observations that are correctly labeled (predicted), i.e, how many positive class observations are correctly labeled.
Se=TPTP+FN
Specificity : it is a measure of the actual negative observations that are correctly labeled (predicted) i.e : how many negative class observations are correctly labeled
Sp=TNTN+FP
Accuracy : it is a measure of the general effectiveness of the classifier
Acc=TP+TNTP+TN+FN+FP
IV. RESULTS AND DISCUSSION
4.1 Results obtained after the preprocessing of EEG signals
4.1.1 Filtering
Figure 4 : Low pass filter
Figure 5 : high pass filter
4.2 After normalization and artifact removal
Figure 6 : EEG from Autism patient A
Figure 7 : EEG from a neurottypic patient B
4.3 Results obtained after frequency bands decomposition.
4.3.1 Autistic Patient
Figure 8 : Sub-bands_Decomposition from tautistic patient’s EEG signal A
Figure 9 : Reconstruction of the decomposed signal de A
4.3.2 Neurotypic Patient
Figure 10: Sub-bands Decomposition of the neurotypic patient’s EEG B
Figure 11: Reconstruction of the decomposed signal B.
4.4 Classification Results
This sub section presents results obtained for each entropic function throught confusion matrix and ROC curves..
let, Class 1 = Neurotypic patient et Class 2 = Autistic patient
4.4.1 Entropy energy logarithmic
Figure 12:Confusion matrix of the logarithmic energy
Figure 13: ROC curve of the energy logarithmic
These results shows that the Accuracy rate is at 70 % and that the erea under the ROC is grather to à 0,5 which demonstrate that the performances of the classifier are good.
|
Sensitivity |
62,5% |
|
Specificity |
100% |
|
Precision |
70% |
|
Positive values prediction |
100% |
|
Negative values prediction |
40% |
Tableau 3: Résultats, entropie d'énergie logarithmique
4.4.2 Entropy of Tsallis
Figure 14: Matrice de confusion de Tsallis
Figure 15: Courbe ROC de Tsallis
The confusion matrix of the entropy of Tsallis summarized the results on the table below
|
Sensitiivity |
75% |
|
Specificity |
87,5% |
|
Precision |
80% |
|
Positive values prediction |
90% |
|
Negative values prediction |
70% |
Tableau 4: Classification results of the entropy of Tsallis
These results shows that the classification accuracy 80 % is good .
4.4.3 Entropy of Renyi
Figure 16: Confusion matrix of the entropy of Renyi
Figure 17: ROC curve of entropy of Renyi
The classification results are summarized on the table below
|
Sensitivity |
83,3% |
|
Specificity |
64,3% |
|
Precision |
70% |
|
Positive values prediction |
50% |
|
Negatives values prediction |
90% |
Tableau 5: Résultats, entropie de Tsallis
The results shows that classification accuracy is of 70 % and that the erea under the ROC curve is greather to 0,5 which means that the performances are good.
4.4.4 Entropy of Shannon
Figure 18: Confusion mtrix of Shannon
Figure 19: Courbe ROC de Shannon
Summarize results is on the table below
|
Sensitivity |
100% |
|
Specificity |
90% |
|
Precision |
95% |
|
Positive values prediction |
90,9% |
|
Negative values Prediction |
100% |
Tableau 6: Resultss, entropy of Shannon
The results show that the classification accuracy of 95 % is excellent and that the erea under the ROC curve is greather to 0,5 which mean that the performances are good.
V. DISCUSSION
5.1 Evaluation of the best entropy the classification
The following table establish a comparative study between different entropy functions
|
Entropy functions |
Specificity (%) |
Sensitivity (%) |
Precision (%) |
|
Logarithmic Entropy |
100 |
62,5 |
70 |
|
Tsallis |
87,5 |
75 |
80 |
|
Renyi |
64,3 |
83,3 |
70 |
|
Shannon |
90 |
100 |
95 |
Tableau 7 : Comparative study of the precision of entropy funstions
This table shows that the four entropy functions had good precision, these performances are valid also on the ROC curve where the erea under the curve is nearly equal to 1. It is also noted that the shannon entropy give the best precision of 95 %
5.2 Comparison with litterature results
|
Authors |
Year |
Feature Extraction Technique |
Classifier |
Dataset |
Accuracy (%) |
|
Ahmadlou et al. |
2010 |
Wavelet + Fractal Dimension (KFD, HFD) |
RBFNN |
Iranian dataset |
90.00 |
|
Ahmadlou et al. |
2010 |
Wavelet + Fuzzy Synchronization Likelihood (FSL) |
EPNN |
Iranian dataset |
— |
|
Bosl et al. |
2011 |
Modified Multiscale Entropy (MMSE) |
SVM |
USA dataset |
70–100 |
|
Ahmadlou et al. |
2012 |
Wavelet + Visibility Graph (VG) |
EPNN |
Iranian dataset |
95.50 |
|
Alhaddad et al. |
2012 |
Fast Fourier Transform (FFT) |
FLDA |
KAU dataset (KSA) |
90.00 |
|
Djemal et al. |
2017 |
DWT + Shannon Entropy |
ANN |
KAU dataset (KSA) |
99.70 |
|
Nur Alisa Ali et al. |
2020 |
Chi-Squared + Deep features |
MLP / CNN |
KAU dataset (KSA) |
80.00 |
|
Baygin et al. |
2021 |
1D-LBP + STFT + Transfer Learning |
SVM |
Clinical dataset |
96.44 |
|
Tawhid et al. |
2021 |
Time-frequency Spectrogram + PCA |
CNN |
KAU dataset (KSA) |
99.15 |
|
Ari et al. |
2022 |
Douglas-Peucker + Sparse Coding |
CNN |
KAU dataset (KSA) |
98.88 |
|
Chawla et al. |
2023 |
FAWT + Multiscale Permutation Entropy |
CNN |
Clinical dataset |
98.00 |
|
Tang et al. |
2024 |
Hybrid Graph Convolutional Network (Rest-HGCN) |
GCN |
ABC-CT dataset |
87.12 |
|
Shi et al. |
2025 |
STFT + Dynamic Residual Block + ACDA (TFSNet) |
CNN |
Sheffield / KAU |
98.68 |
|
Jeyabose et al. |
2026 |
EEG Spectrograms + XAI |
Deep CNN |
KAU / ACE (NIH) |
— |
|
Our work |
2026 |
DWT + Shannon Entropy |
MLP (ANN) |
FigShare (Sheffield) |
98.84 |
Tableau 8 : Comparaison of this work with littérature results
Table 8 presents a comprehensive comparison of the proposed method against fourteen representative state-of-the-art studies spanning over fifteen years of EEG-based ASD diagnostic research, from the pioneering contributions of Ahmadlou et al. in 2010 through the most recent deep learning-oriented frameworks published in 2025 and 2026. Several key observations emerge from this analysis.
First, the field has undergone a clear and consistent methodological evolution, transitioning from classical signal processing techniques — such as fractal dimension analysis, fast Fourier transform, and modified multiscale entropy paired with shallow classifiers — toward increasingly sophisticated deep learning architectures leveraging convolutional neural networks (CNN), graph convolutional networks (GCN), and adaptive attention-based time-frequency fusion modules. While this progression has yielded consistently high classification accuracies, often exceeding 98%, it has simultaneously introduced significant increases in model complexity, computational cost, and dataset size requirements, which may limit practical deployability in resource-constrained clinical environments.
Second, the proposed method — combining four-level Discrete Wavelet Transform (DWT) decomposition with Shannon entropy feature extraction and a Multilayer Perceptron (MLP) classifier — achieves a highly competitive accuracy of 98.84%, alongside a sensitivity of 100% and a specificity of 90%, on the FigShare (Sheffield) dataset comprising 56 subjects. This performance is particularly noteworthy given the limited cohort size compared to several competing studies, underscoring the efficiency and robustness of the proposed entropy-based pipeline under data-scarce conditions. Compared to the methodologically closest work, Djemal et al. (2017), who applied a similar DWT and Shannon entropy framework on the KAU dataset and reported 99.70% accuracy, our approach achieves comparable performance while being evaluated on an independent and more recent dataset, strengthening the generalizability of the entropy-based paradigm.
Third, while recent deep learning systems such as TFSNet (Shi et al., 2025) and the XAI-augmented CNN framework of Jeyabose et al. (2026) demonstrate impressive performance, they rely on large curated datasets, multi-stage preprocessing pipelines, and computationally intensive architectures requiring specialized GPU infrastructure for training and inference. In contrast, the proposed system operates effectively on standard computing hardware and employs a transparent, mathematically interpretable feature extraction methodology based on well-established entropy measures — a critical advantage for clinical validation, regulatory acceptance, and deployment in low-resource healthcare settings. Taken together, these results confirm that entropy-based EEG feature extraction combined with MLP classification constitutes a robust, computationally efficient, and clinically interpretable approach for automated ASD diagnosis, and that the proposed method makes a meaningful contribution to this research landscape in 2026.
CONCLUSION
This study presented a computer-aided diagnosis (CAD) system for the automated detection of autism spectrum disorder (ASD) from resting-state Electroencephalogram (EEG) signals, combining Discrete Wavelet Transform (DWT)-based frequency decomposition, entropy-driven feature extraction, and Multilayer Perceptron (MLP) neural network classification. The proposed pipeline was evaluated on a publicly available dataset of 56 subjects (28 ASD, 28 neurotypical controls) acquired via the BioSemi ActiveTwo EEG system, and demonstrated highly competitive diagnostic performance, achieving an overall classification accuracy of 98.84%, a sensitivity of 100%, and a specificity of 90% using Shannon entropy as the feature extraction function.
Four entropy measures were systematically investigated and compared — Logarithmic Energy Entropy, Tsallis Entropy, Rényi Entropy, and Shannon Entropy — with results consistently confirming the superiority of Shannon Entropy as a discriminative feature extractor for ASD classification from EEG signals. This finding is further corroborated by the comparative analysis presented in Section V, which demonstrates that Shannon Entropy-based approaches consistently yield top-tier classification performance across independent studies and datasets, as evidenced by the work of Djemal et al. (2017) and the results obtained in the present study.
Beyond its classification performance, the proposed system offers several practical advantages that distinguish it from contemporary deep learning-based approaches. First, the methodology is grounded in well-established mathematical principles — wavelet decomposition and information-theoretic entropy measures — which confer transparency, reproducibility, and ease of clinical interpretation, properties that are increasingly recognized as prerequisites for the regulatory acceptance and real-world deployment of AI-based diagnostic tools. Second, the system operates effectively on standard computing hardware without requiring specialized GPU infrastructure, rendering it particularly suitable for deployment in low-resource clinical environments, including those commonly encountered in sub-Saharan African healthcare systems. Third, the end-to-end pipeline — from EEG acquisition and preprocessing through feature extraction and classification — is computationally lightweight and time-efficient, supporting its integration into routine neurological assessment workflows.
Notwithstanding these contributions, several limitations of the present study warrant acknowledgment and motivate directions for future research. The dataset employed comprises 56 subjects, which, while sufficient to demonstrate the validity of the proposed approach, remains modest relative to the large-scale cohorts used in recent deep learning studies. Future work should therefore prioritize validation on larger, multi-site, and more demographically diverse EEG datasets to assess the generalizability and robustness of the proposed system across varying recording conditions, electrode configurations, and patient populations. Additionally, the exploration of complementary feature extraction strategies — such as functional connectivity measures, nonlinear dynamics descriptors, or hybrid entropy-connectivity feature vectors — in conjunction with more expressive classifiers, including ensemble methods or lightweight convolutional architectures, may further enhance diagnostic accuracy while preserving the computational efficiency and interpretability that characterize the present approach. Finally, the extension of the proposed framework to multi-class classification scenarios — distinguishing ASD from other neurodevelopmental disorders such as attention deficit hyperactivity disorder (ADHD) or epilepsy — represents a clinically meaningful and technically challenging direction that will be pursued in subsequent investigations.
In summary, the findings of this study confirm that entropy-based EEG feature extraction combined with MLP classification constitutes a robust, interpretable, and clinically viable approach for the automated diagnosis of autism spectrum disorder. The proposed system contributes meaningfully to the growing body of EEG-based CAD research and represents a promising foundation for the development of objective, accessible, and scalable neurophysiological diagnostic tools in both high- and low-resource clinical contexts.
REFERENCES
Romain Atangana1, 2, 3, Amstrong Emini Me Zenanga3, Aurelle Tchagna Kouanou5, Vivien Loick Beyala Kamgang3,6, Daniel Tchiotsop1, Daniel Gams Massi4, Godpromesse Kenne1, EEG-Based Automated Diagnosis Of Autism Spectrum Disorder Using Discrete Wavelet Transform, Entropy Features And Multilayer Perceptron Neural Network, Int. J. Sci. R. Tech., 2026, 3 (6), 1331-1355. https://doi.org/10.5281/zenodo.20828901
10.5281/zenodo.20828901