Electroencephalography in Schizophrenia

Electroencephalography in Schizophrenia: Comparison

Please note this is a comparison between Version 1 by João Paulo Teixeira and Version 2 by Catherine Yang.

Electroencephalography (EEG) provides a non-invasive tool for the study of the brain’s temporal and spatial register of electric activity. Schizophrenia is a complex and heterogeneous disease, manifesting deficits that underlie many overlapping pathological mechanisms distributed across multiple brain regions. Patients with schizophrenia have sensory processing deficits and high-level attention-dependent cognitive deficits. These deficits can be assessed by the time-locked EEG activity in stimuli called ERPs and extracting the features. EEG oscillations are considered biomarkers or features of complex states in health and schizophrenia persons. The oscillatory activity of the EEG in schizophrenia patients indicates abnormal temporal integration and interregional connectivity of brain networks during neurocognitive function. EEG signal analysis can be performed in the time, frequency, and time–frequency domains.

schizophrenia
speech
EEG
ERP

1. EEG Features

This section describes the main features that can be used to diagnose schizophrenia with an EEG signal. The spatial position in 64 EEG electrodes can be observed in Figure 13. Additional details about the EEG approach can be found in ^[1][90].

Figure 13.

Depiction of the sixty-four electrodes’ layouts on a 2D representation of the scalp.

To explain the underlying abnormalities in patients diagnosed with schizophrenia, a multi-set canonical correlation analysis (MCCA) was performed by ^[2][85] to combine functional magnetic resonance imaging (fMRI), EEG, and structural magnetic resonance imaging (sMRI) parameters. In the work of Shim et al. ^[3][80], three sets of parameters were used: sensor-level parameters (124 parameters), source-level parameters (314), and a combination of both.

On the other hand, some authors researched something more specific. Bougou et al. ^[4][87] focused on the delta and theta bands (0.5–8.5 Hz) by applying a Butterworth, order 5, band-pass filter to study the connectivity. The authors calculated connectivity measures: cross-correlation (COR), quadratic magnitude coherence (COH), imaginary part of quadratic magnitude coherence (iCOH), phase-locked value (PLV), phase-locked index (PLI), p-index (RHO), transfer entropy (TE), mutual information (MI), Granger causality (GC), partial directed coherence (PDC) and directed transfer function (DTF).

Vittala et al. ^[5][73] used transcranial magnetic stimulation (TMS) combined with EEG to alter and measure the neurophysiological parameters of cortical function, including oscillatory activity, cortical inhibition, connectivity, and synchronization.

Using the nonlinear features, including complexity (Cx), Higuchi fractal dimension (HFD), and Lyapunov exponents (Lya), the authors of ^[6][91] increased the prediction of a schizophrenia classifier up to 100%. With the decomposition of the EEG into wavelets of six levels (thus creating seven sub-bands), it is also possible to diagnose subjects with schizophrenia with a high accuracy ^[7][82].

Based on the phase space dynamics (PSD) of EEG signals C, it can be confirmed that the PSD shape of the Cz channel (Figure 13) in schizophrenia is more regular than in healthy people and can be applied as a biomarker. Via graphical analysis, it is also possible to identify schizophrenia. The PSD maps of signals from the EEG to a higher dimensional space, and the features to be used are extracted with (up to) 19 channels. Generally, the PSD of EEG signals is a suitable technique for discriminating between healthy and schizophrenic groups. Furthermore, the Cz channel is better than other channels at detecting schizophrenia using the PSD of EEG signals ^[8][88].

According to Akbari et al. ^[8][88], the best accuracy (94.8%) is obtained with graphical features, namely, the summation of distances between Heron’s circular (SDHC), the summation of the shortest distance from each point relative to the 45-degree line (SH45), and the summation of the area of the triangles making successive points and the coordinate center (TACR), as obtained from 12 channels. This procedure was performed by using the phase space dynamic (PSD) of EEG signals. First, the PSD of two EEG signals was plotted on Cartesian space, and then graphical features were extracted to evaluate the chaotic behavior of PSD based on healthy and schizophrenic subjects. The PSD of EEG signals can be transferred to successive triangles. By averaging the coordinates of the corners of each triangle, the centroid coordinate of the triangle is obtained, and it is the same as the centroid of the corresponding Heron’s circle. The SDHC quantifies the variability of the PSD. It can be used to evaluate the complexity of PSD. The SH45 measures the width of the PSD shape from the bisector of the first and third trigonometric regions, and TACR measures the variation rate of the PSD shape of EEG signals ^[8][88].

Baygin et al. ^[9][8] proposed a model for the automatic detection of schizophrenia based on Collatz conjectures (Collatz conjecture is a mathematical model used in information security applications) using EEG. This model can generate features, is highly accurate, and requires little time to run, allowing it to achieve a 99.47% correct classification. This model comprises three stages. The first consists of a new feature generation with Collatz Conjecture, named the Collatz pattern. Combining the Collatz pattern and the maximum absolute pooling decomposer creates new multilevel features (low-level and high-level features). The second step involves applying the iterative neighborhood components analysis to select the clinically significant features. The last step consists of choosing features fed to the K-nearest neighbors (KNN) classifier for the automated detection of schizophrenia ^[9][8].

The hit rates for identifying schizophrenia conditions using EEG parameters range from 82.36% ^[4][87] to 100% ^[6][91]. Using EEG parameters, the authors of ^[9][8] applied a combination of techniques and KNN classifiers, achieving the classification accuracy of 99.47% and 93.58% in two datasets using 19 and 10 channels, respectively. Using various Machine Learning tools such as Support Vector Machine and the leave-one-out cross-validation training procedure, the authors of ^[6][91] correctly classified 88.24% of the cases. Random Forest classifier with Direct Transfer Function obtained a correct classification of 82.36% in the work of ^[4][87]. The authors of ^[8][88] used KNN and a generalized regression neural network (GRNN) and achieved 94.8% accuracy. The maximum accuracy was obtained with a probabilistic neural network (PNN) reaching 100%.

2. Description of EEG Features

This section describes the previously mentioned EEG feature details.

The cross-correlation (COR) corresponds to a measure of the similarity of two series as a function of the displacement of one relative to the other and the quadratic magnitude coherence (COH) between two variables, corresponding to the cross-spectral density function, which is derived from the FFT of cross-correlation normalized by their individual auto-spectral density functions. The imaginary part of the quadratic magnitude coherence (iCOH) is derived by keeping only the imaginary part of the complex numbers, which is the coherence ^[4][87].

The phase-locked value (PLV) characterizes the phase synchronization between two narrow-band signals, and the phase-locked index (PLI) is a measure of phase-lock that is zero in the case of linear mixing and nonzero when there is a consistent nonzero phase difference between the two signals. The p-index (RHO) quantifies the deviation of the cyclic relative phase distribution from the uniform distribution, approximating the probability density by the relative frequencies obtained with histograms of relative phases ^[4][87].

The transfer entropy (TE) measures the time-asymmetric transfer of information between two processes. Mutual information (MI) quantifies the amount of information that can be obtained about a random variable by observing another. Granger causality (GC) states that, for two simultaneously measured signals, one can predict the first signal better by incorporating the past information from the second signal than when only using information from the first signal. The directed transfer function (DTF) is similar to Granger causality but uses the elements of a different transfer matrix ^[4][87].

Complexity (Cx) consists of numerical information that is transformed into symbolic information after distinct words are created by decomposing symbolic sequences ^[6][91], which are encoded by the length of L(n). This feature can be defined by Equation (1):

C x = L (n) n

The Higuchi fractal dimension (HFD) measures the self-similarity and irregularity of a time series. This feature is estimated using the slope of the linear fit over the log–log plot of the size and scales of the time series. The range of values is between 1 and 2. The Lyapunov exponents (Lya) show the average growing ratio of the primary distance between two neighboring points in the phase space ^[6][91]. Equation (2) can calculate this feature:

‖ δ X i (t) ‖ ‖ δ X i (0) ‖ = 2^{λ_{i} t (t \to \infty () λ_{i} = \lim t \to \infty \frac{1}{t} l o g_{2} ‖ δ X i (t) ‖ ‖ δ X i (0) ‖)}

where the distance between the point at time 0 is defined by

‖ δ X i (0) ‖

and the point at time t is defined by

‖ δ X i (t) ‖

The phase space dynamics (PSD) of EEG signals can be transferred to successive triangles by averaging the coordinates of corners of each triangle ((a_i, a_i+1),(a_i+1, a_i+2),(a_i+2, a_i+3)), by which it is possible to obtain the centroid coordinate of the triangle. This coordinate is the same as that of Heron’s circle. The summation of distances between Heron’s circle (SDHC) ^[8][88] is computed as a graphical feature and is defined by Equation (3):

PSDHC = \sum i = 1 m - 4 \sqrt{(a_{i + 1 + a_{i + 2 + a_{i + 3 3 - a_{i} + a_{i + 1 + a_{i + 2 3 () 2 + (a_{i + 2 + a_{i + 3 + a_{i + 4 3 - a_{i + 1 + a_{i + 2 + a_{i + 3 3 () 2 \sqrt{}}}}}}})}}}}})}

The summation of the shortest distance from each point relative to the 45-degree line (SH45) quantifies the data scatter rate around the 45-degree line. The SH45 measures the width of the PSD shape from the bisector of the first and third trigonometric regions, known as a line y = x ^[8][88]. It can be described by Equation (4):

SH 45 = \sum i = 1 m - 1 |a_{i + 1 - a_{i} || \sqrt{2}}|

The summation of the area of the triangles making successive points and the coordinate center (TACR) measures the variation rate of the PSD shape of the EEG signal ^[8][88]. The TACR is defined by Equation (5):

TACR = \sum i = 1 m - 2 |d e t [\begin{matrix} 0 & a_{i} \begin{matrix} a_{i + 1 \begin{matrix} 0 & a_{i + 1 \begin{matrix} a_{i + 2 \begin{matrix} 1 & 1 & 1 \end{matrix} [] ||} \end{matrix}} \end{matrix}} \end{matrix} \end{matrix}]|

3. ERP Biomarkers in Schizophrenia

The EEG activity time-locked to stimuli is denominated event-related potentials (ERPs). ERPs are commonly used to capture neural activity related to sensory processes and consist of the averaged neural activity upon the repeated presentation of the same stimulus ^[10][92].

While studying the brain’s response to stimuli, participants might elicit spontaneous and involuntary neural activity during any moment of the recording. The neural response to stimuli is highly sensitive to the subject’s attention, the presence of motor acts, and inner thoughts, introducing random segments of activity in the signal that might even overshadow the targeted response. Conversely, the neural activity evoked by that particular stimulus will always be present at the moment of its presentation. Consequently, the spontaneous and variable activity will be filtered out by averaging activity across trials with the same stimulus, whereas the signal phase-locked to stimuli onset will become evident ^[11][83].

ERPs are widely used in EEG analysis since they are relatively simple to compute. Furthermore, classifiers are rarely used. To discriminate them, more straightforward statistical methods, such as Analysis of Variance (ANOVA), are often enough and constitute the majority of the methods used for this type of EEG analysis. As a result, the results presented here for using ERPs to diagnose schizophrenia are reported from the statistically significant differences found in both the latency and amplitude of these ERPs when comparing healthy subjects and patients to the diagnosis of schizophrenia within the mentioned studies.

In many cases, the auditory task (hearing a beep) is used to measure the cognitive decline in schizophrenia with ERP waveform alteration and reduced activity in specific cortical regions in schizophrenia ^[12][77]. Figure 24 represents the main ERP biomarkers.

Figure 24. The idealized auditory ERP response is recorded at the head’s vertex. The y-axis is inverted where positive peaks are pointed downwards and negative peaks are pointed upwards.

Mismatch negativity (MMN) is a component of ERP or an event-related magnetic field (ERMF) that occurs in response to unexpected and rare stimuli in the surrounding environment. It is considered an important parameter for neuropsychiatric disorders and for schizophrenia, in particular ^[13][93].

The N1 component consists of a negative deflection at about 100 ms ^[14][76]. It is evident when an unexpected stimulus is presented ^[15][79]. A reduced N1 amplitude during vowel vocalization compared to passive listening and directed inner speech compared to a silent condition is seen in controls but not in patients ^[16][17][94,95].

Of the various ERPs (Figure 24), the components P50 (or Pa), N1, MMN, and P3 have received the most attention, as they are reliably impaired in schizophrenia and are, therefore, considered the most promising biomarker data ^[1][90].

The P50 (or Pa) is the earliest and smallest ERP component in auditory amplitude, reaching a general positive peak between 40 and 75 ms ^[18][96]. It is used to measure sensory switching using a conditioning test paradigm that involves the repeated presentation of a pair of auditory stimuli, S1 (condition) and S2 (test). The increased amplitude measurement (S2/S1) in patients is well established in the literature ^[19][97] and is related to their inability to filter the incoming flow of information and protect the brain from information overload ^[20][98]. Although its association with neuropsychological processes is still ambiguous ^[21][99], the P50 and S2 amplitude ratios have been linked to performance and attention ^[22][100]. In addition, P50 suppression impairment seems to be present in the risk phase, the prodromal phase, and the first episode ^[23][101].

The P3 component reflects information processing associated with attention and memory mechanisms ^[1][90]. For auditory stimuli, it consists of a positive peak deflection of 250–400 ms in adulthood ^[15][79]. Still, its latency and amplitude vary significantly depending on biological factors (e.g., genetics, intelligence, age, and smoking status, among others) ^[1][90]. A P3 component is triggered during oddball tasks in which multiple stimuli are presented and one of them occurs infrequently. A fair amount of research involves this ERP component as a P3 amplitude deficit, especially when evoked by auditory stimuli ^[24][78]. This is considered the most consistent and robust finding in schizophrenia ^[24][25][26][78,102,103].

Mismatch Negativity (MMN), typically generated 100 to 250 ms after stimulus onset, can be used as an objective index of sound discrimination accuracy and auditory sensory memory ^[1][90]. It is generated by the brain’s automatic response to any change in auditory stimulation that exceeds a specific threshold, roughly corresponding to the behavioral discrimination threshold ^[1][90]. Impoverished MMN production, reflected in attenuated amplitudes, is also a consistent finding in schizophrenia ^[27][104]. Interest is growing in studying MMN impairment with more complex paradigms (e.g., multiple sensory dimensions, complex sounds, and changes in stimulation patterns). These complex paradigms activate more complex brain regions as opposed to simpler deviations (e.g., pitch, duration, and intensity) that activate lower levels of the auditory system ^[28][105].

Although both ERPs share common mechanisms, MMN and P3 most likely portray different dysfunctions in schizophrenia.

Recent studies ^[28][105] indicate that MMN deficits generated during auditory tasks contribute to 18.7% of the variance in P3 deficits when both are examined. This proves that the high-level attention-dependent cognitive deficits central to schizophrenia do not originate from potentially preceding impairments at lower sensory, perceptual, or cognitive processing levels ^[24][78].

Some works used temporal, demographic, and time-frequency features of EEG. Zhang et al. ^[10][92] employed temporal features N1, N1TD, P2, P2TD (TD is time duration), and an EEG baseline as well as demographic (education and age) and temporal frequency features (power spectrum). These features were taken from an EEG-ERP with 9 electrodes: Fz, FCz, Cz, FC3, FC4, C3, C4, CP3, and CP4 (see Figure 13).

Kim et al. ^[29][84] used microstate and conventional EEG features extracted from five regions of interest (ROI): left anterior (Fp1, F7, and F3), right anterior (Fp2, F4, and F8), left posterior (T7, C3, P7, P3, and O1), right posterior (C4, T8, P4, P8, and O2), and central (Fz, Cz, and Pz) (see Figure 13). However, ERPs allow healthy and schizophrenic subjects’ discrimination based on P3, MMN, or N1 biomarkers and resting state signal complexity. Statistical measures or oscillatory power are also successful ^[30][106].