Cortical Classification with Rhythm Entropy for Error Processing in Cocktail Party Environment Based on Scalp EEG Recording

Tian, Yin; Xu, Wei; Yang, Li

doi:10.1038/s41598-018-24535-4

Download PDF

Article
Open access
Published: 17 April 2018

Cortical Classification with Rhythm Entropy for Error Processing in Cocktail Party Environment Based on Scalp EEG Recording

Yin Tian¹^na1,
Wei Xu¹^na1 &
Li Yang¹

Scientific Reports volume 8, Article number: 6070 (2018) Cite this article

1445 Accesses
10 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Using single-trial cortical signals calculated by weighted minimum norm solution estimation (WMNE), the present study explored a feature extraction method based on rhythm entropy to classify the scalp electroencephalography (EEG) signals of error response from that of correct response during performing auditory-track tasks in cocktail party environment. The classification rate achieved 89.7% with single-trial (≈700 ms) when using support vector machine(SVM) with the leave-one-out-cross-validation (LOOCV). And high discriminative regions mainly distributed at the medial frontal cortex (MFC), the left supplementary motor area (lSMA) and the right supplementary motor area (rSMA). The mean entropy value for error trials was significantly lower than that for correct trials in the discriminative cortices. By time-varying network analysis, different information flows changed among these discriminative regions with time, i.e. error processing showed a left-bias information flow, and correct processing presented a right-bias information flow. These findings revealed that the rhythm information based on single cortical signals could be well used to describe characteristics of error-related EEG signals and further provided a novel application about auditory attention for brain computer interfaces (BCIs).

Mobile BCI dataset of scalp- and ear-EEGs with ERP and SSVEP paradigms while standing, walking, and running

Article Open access 20 December 2021

Classification of mental workload using brain connectivity and machine learning on electroencephalogram data

Article Open access 21 April 2024

Target of selective auditory attention can be robustly followed with MEG

Article Open access 06 July 2023

Introduction

In everyday life, the flood of sensory information were regulated by attention system into a manageable stream, and attention orienting played a primary role in complex visual environment by finding relevant information and filtering out irrelevant information to bias the target selection and processing¹. Typically, two mechanisms were thought to be included in the process: endogenous orienting (goal-driven, top-down), directed the attention to the information related locations in space, and exogenous orienting (stimulus-driven, bottom-up), reflexively triggered by prominent and behaviorally relevant stimuli². Classic research on attention orienting was involved by the analysis of the cocktail party phenomenon coined by Cherry in 1953³.

The cocktail party effect was the phenomenon that people can focus their auditory attention on a stimulus while filtering out other stimuli, similar with a partygoer being able to concentrate on a single conversation in a noisy room, namely, the process reflected the influence of top-down attention. It might also describe a similar phenomenon that occurs when one can immediately detect words of importance originating from unattended stimuli, for instance hearing one’s name in another conversation, which referred to the bottom-up controlled attention^4,5,6.

A lot of researches was conducted to investigate dynamic changes in cortical activity during tracking the dynamic speech stimulus^4,5, and the findings suggested that attentional orienting modulated the neural responses to one of speakers’ voices. If a listener successfully tracked one speaker in a multi-speakers’ environment, the neural responses showed highly correlated with the attended speaker^4,7,8,9. And the neural generator of this effect was localized in the left hemisphere¹⁰. However, when people were absent-minded, they often failed to keep track of the goals that needed to be noticed; that is, the wrong execution was mostly due to the lack of attention to the target stimulus.

Over the last decade, a lot of researches focused on the theories of attention orienting¹¹. In contrast, little is known about the connection of attention with BCIs. Several recent studies have investigated the impact of attention on BCIs. A typical example was to utilize attentional modulation of steady-state visual evoked potential (SSVEP) to implement an online BCI system, and the results showed the SSVEP amplitude could be enhanced by attention, thus improving the speed and accuracy of the BCI system¹². Another work researched SSVEP under strong attention and poor attention of flash conditions, the results indicated that the SSVEP was modulated by attention and the effect of modulations was related to the frequency of flash stimulation¹³. Further, the simultaneously presented tactile and visual stimuli were used to investigate the influence of attention shift on SSVEP. The significant attention switching was observed within both types of stimuli and between different stimuli. Similarly, the unattended experiments were also investigated in the BCI studies. For instance, the error-related negative (ERN) potential reflected an error-monitoring process of the brain and could be detected in scalp EEG recordings^14,15. The ERN arose after an erroneous response and the maximum peak was localized at the medial frontal regions^14,16. Recently, researchers found that the ERN potential was used in BCI for adjusting command outputs of BCI systems when subjects observed incorrect outputs from BCI systems, thus facilitating the development of BCI systems with improved accuracy¹⁷.

Although BCI studies regarding attention have been conducted, some shortcomings still existed. Firstly, methods, which were used to select feature from the EEG signals, were not related to the cognitive functions. For example, information entropy described the generating rate of new information of nonlinear dynamical systems and it has been shown as an effective measure to select EEG signals features¹⁸. However, information entropy ignored the association between EEG activities and subjects’ cognitive states. Consequently, a similar order state of EEG signal sequence could be found among different cognitive states, which hampered the applications in clinical areas¹⁹. Secondly, the BCI performance was limited by the measurement manners of EEG. Compared with the intracranial EEG, which was directly recorded from the cortex surface, the scalp EEG could be easily affected by the effect of volume conduction and reference electrodes^20,21,22, causing an imprecise measurement of physiological significance. Although the intracranial EEG described more precise temporal and spatial information than that of the scalp EEG, it was invasive and only feasible for a very limited number of subjects^21,23. Thirdly, the regional EEG parameters could not adequately reflect the cognitive process. Multiple brain regions were involved in the cognitive processing and reflected by the EEG activities²⁴. Recently, network analysis methods have attracted wide-spread attention in neuroscience and it proved to be an efficient way to measure the connections between regions in the cognitive functions^25,26.

In the present study, the cocktail party experiment paradigm, which was closer to the real environment, was used to research the error-related attention. In order to overcome the drawbacks of previous studies on the scalp EEG, the weight minimum norm (WMN) method was used to estimate cortical activities with single-trial. Then, we adopted a feature extraction method, rhythm entropy, to classify the error-related auditory processing from correct auditory processing based on single cortical signals. Rhythm entropy (RhEn) was developed by combining the information entropy with the power of EEG rhythm, which was an important feature for cognitive research based on spontaneous EEG²⁷. Finally, adaptive directed transfer function (ADTF), one of the most frequently used methods for assessing the dynamic causality relationship among various brain regions^28,29, was applied to calculate the time-varying connectivity patterns in the different conditions.

Result

Reaction Time (RT)

As shown in Fig. 1, mean RTs for correct response were shorter than those for error response (paired t-test: t = −7.2, p < 0.05, d = −0.66).

Brain regions with high discriminative power

For visual representation, the cortical spatial distribution was reconstructed according to the R² value on each dipole. Three brain regions, i.e. medial frontal cortex (MFC), left supplementary motor area (lSMA) and right supplementary motor area (rSMA), exhibited greater correlation than others (Fig. 2A and Table 1), got high discriminative power for identifying error response trials from correct response trials during tracking the cued speaker.

Table 1 Brain regions with high discriminative power.

Full size table

Classification accuracy

The SVM with LOOCV achieved an average accuracy of 89.7%(SD ± 3.6%). A good generalization performance of SVM classifier were observed (Table 2 and Fig. 2B), i.e. SP: 89.6% ± 4.9%, SE: 89.8% ± 5.2%, AUC: 94.0% ± 5.2%.

Table 2 Classification results of SVM classifier.

Full size table

Relationship between RT and rhythm entropy (RhEn)

RhEn of the MFC induced by error processing (i.e. error trials) was significant positively correlated with RT. RhEn of the lSMA and rSMA induced by error processing were non-significantly correlated with RT, respectively (Fig. 3A). No correlations were observed during correct response between RT and RhEn of each discriminative brain region (Fig. 3B). More detailed information was also shown in Table 3.

Table 3 Correlation Analysis between RT and entropy and Pair t-test between error response and correct response in discriminative regions.

Full size table

Information flow between the discriminative regions

To investigate time characteristics of two response types, i.e. correct and error, time-varying networks were conducted. Here, significant discriminative regions, i.e. the medial frontal cortex (MFC), left supplementary motor area (lSMA) and right supplementary motor area (rSMA), were utilized to serve as network nodes. For time series of networks was estimated by cortical signals within three discriminative regions, which calculated by the averaged scalp ERPs (Fig. 4). Results of the time-varying network analysis were illustrated in Fig. 5A and B.

The error time-varying networks showed the left-bias information flows between the lSMA and the MFC (i.e. lSMA → MFC and MFC → lSMA, Fig. 5A). Then, right information flow between rSMA and MFC appeared, while the correct time-varying networks showed the right-bias information flow between the rSMA and the MFC (Fig. 5B).

Relationship between RT and information flow

Information flows between the network nodes changed with time varying. During error processing, the weaker DTF values of information flow between the medial frontal cortex and the left supplementary motor area, i.e. MFC → lSMA and lSMA → MFC, of time-varying networks at 430 ms were related to longer error RT (Fig. 5C), respectively. The pure flow between the MFC and the lSMA showed an information flow from MFC to lSMA, positively being correlated with error RTs. During correct processing, the stronger DTF values from the MFC to the right SMA, i.e. MFC → rSMA and rSMA → MFC, of time-varying networks at 200 ms were related to longer correct RT (Fig. 5D), respectively. The pure flow between the MFC and rSMA showed an information flow from MFC to rSMA, negatively being correlated with correct RTs.

Disscussion

Based on the scalp EEG recording, the present study utilized the RhEn of cortical signal with single-trial, calculated by weighted minimum norm solution estimation (WMNE), to identify two response types (correct vs. error) differences at the cortical level during performing the auditory-tracked tasks in multi-speakers’ environment. We found that: 1) An averaged accuracy achieved 89.7% and discriminative cortex differences mainly distributed in the medial frontal cortex (MFC), the left SMA, and the right SMA; 2) The mean RhEn for error trials was significantly lower than that for correct trials in the discriminative cortices. In addition, the larger RhEn for error trials was related to longer RT. 3) Time-varying networks analysis based on discriminative regions and averaged cortical source waveforms further revealed that error-related networks represented the left-bias information flow and the correct-related networks represented the right bias information flow.

RhEn of discriminative cortices and RT

Our classified method successfully extracted reliable differences between correct and error response with the mean classification rate of 89.7%(with SD: ±3.6%), which was superior to previous related works^5,7,30,31 that this method was non-invasive cortical dynamical signal with single trials of short-time duration (~700 ms EEG data).

Previous study found that the medial frontal cortex played a crucial role in producing error-related potential^4,32,33,34. Activation of the MFC reflected error-related processing³⁵. As shown in Fig. 2A, we also found that the discriminative cortical regions between correct processing and error processing being obtained via R² values, i.e. the relationship between rhythm entropies and class tags, and SVM mainly focused on the medial frontal cortex, revealing that the MFC was an important brain area in monitoring function between the correct and error processing in multi-speaker environment.

Entropy was a powerful tool to quantify complexity in nonlinear dynamics of neural activities. The irregularity and unpredictability of brain activity induced by attentional selection were regarded as neural complexity related to brain functions and information processing between neurons³⁶. Previous studies have found that the lower entropy meant worse behavior performance^19,37. We also found that RT for error trials significantly longer than that for correct trials (Fig. 1), indicating that increased uncertainty induced by unsuccessfully focusing attention to the cued speaker. And the mean RhEn significantly lower for error trials in MFC than that for correct trials (Fig. 3C and Table 3), suggesting that if a listener successful focused attention on a cued speaker with correct trials, the integration of segregated neuronal groups and incoming stimuli with ongoing performance induced high complexity of cortical dynamics. While if a listener did not attend a cued speaker (i.e. error trials), the decoupling and isolation of the underlying system from external factors may lead to the lower RhEn values, consistent with the previous theory confirmed by representational mathematical models^36,38.

In addition, the larger RhEn values following with longer RT in error trials (Fig. 3A and Table 3) represented the increased irregularity and unpredictability, while there existed a trend that smaller RhEn values were related to the longer RT in correct trials (Fig. 3B and Table 3) indicated that the decreased coupling between internal system and external factors may result in the low complexity. Therefore, the lower RhEn values at the MFC in error trials during auditory processing suggested that the decoupling and segregation between the MFC and the dorsolateral prefrontal cortex, i.e. the left and the right SMA were involved in abnormal attentional control and consequently in wrong cognitive performance as well.

Previous studied found that the posterior central gyrus might be involved in the generation of processes that activated the error related potential³⁹ and left dorsal lateral frontal cortex was selectively active during error trials⁴⁰. Our findings suggested that the contribution of the left SMA and MFC to high classification rate may be related to wrong responses. And the right SMA may mainly be relevant to correct response types due to previous findings that activation was observed in the right SMA in correct trials^35,41.The right-bias information flow may provide an evidence to support the idea during correct response (Fig. 5B).

Information flow via time-varying networks

As described above, converging evidence revealed that left SMA was closely correlated with error related potential^39,40. Our results of time-varying networks displayed information flow between the MFC and the left SMA in error trials (Fig. 5A). Previous findings suggested that the error-processing system consisted of a monitoring system for detecting errors and an optimized behavior compensation system⁴². When perceiving an error existing because of failure to attend the cued speaker, the left SMA firstly sent information to the MFC, and at the same time, a feedback was received from the MFC. In the process of error handing, the MFC acted as a filter to match stimuli (i.e. error or correct feedback) and reactions⁴³, and then information was sent to the right SMA. In correct trials, a right-bias information flow between the MFC and the right SMA was existed (Fig. 5B). During the period from 320 ms to 540 ms, similar time-varying network connectivity patterns were observed in both correct and error trials, implying the underlying coordination of the MFC and bilateral SMA to control cognitive performance.

Moreover, during the error time-varying network at 430 ms (Fig. 5C), the smaller ADTF values of both MFC → lSMA and lSMA → MFC were related to the longer RT (Fig. 5C, left and middle panels). The pure flow showed MFC → lSMA was positively correlated with RT (Fig. 5C, right panel). During the correct time-varying network at 200 ms (Fig. 5D), the bigger ADTF values of both MFC → rSMA and rSMA → MFC were related to the longer RT (Fig. 5D, left and middle panels). The pure flow showed MFC → rSMA was negatively correlated with RT (Fig. 5D, right panel). These findings suggested that the coupling with MFC and lSMA was weaken leading to increased RT in error trials. And the coupling with MFC and rSMA was strengthened resulting in shorter RT in correct trials.

BCI Application

In the present study, cortical activities could provide more precise spatial physiology information compared to scalp EEG²¹; Compared to traditional entropy methods such as approximate entropy, RhEn has an ability to incorporate individual’s cognitive state²⁷. As shown in Table 2 and Fig. 2B, a short time duration (~700 ms) was enough to distinguish individual’s error states using the present method, which allowed the possibility of near real-time EEG processing. Using time-varying network analysis, time-varying network at 430 ms was related to error processing in wrong response and time-varying network at 200 ms was related to correct processing in right response (Fig. 5), which both were earlier than individual’s RT. These findings suggested a possible role for improving BCIs performance in the future. It was noted that for test samples, the time cost of one trial calculation was about 3 s under the MATLAB platform. Here, the most time-consuming part was the minimum norm solution estimation which taken about 0.9 s for each calculation. In the future BCI application, the efficiency of operation could be improved by using faster programming language and optimized algorithms.

Conclusion

The present study was the first to utilize a feature extraction method, rhythm entropy coming from single cortical EEG signals with short-time duration(~700 ms) which were converted via WMNE from scalp EEG recordings during auditory processing in multi-speakers’ environment, to investigate error processing. Three brain areas, i.e. MFC, left SMA and right SMA, got high classification rate reflecting cortical discriminative source distribution between error processing and correct processing. Time-varying networks further revealed that information flows changed between these brain areas with time, i.e. time-varying network at 430 ms was related to error processing in wrong trials and at 200 ms was related to correct processing in right trials. Taken together, these findings suggested that the reduced cognitive performance on auditory error response was associated with impaired cortical information processing, as indicated by the lower complexity of the EEG.

Material and Methods

Participants

Twenty subjects (mean ± standard deviation (SD) age, 22 ± 3.5 years; all males; right-handed) took part in the experiment. None of them reported any history of hearing impairment or neurological problems. Informed consent was signed prior to the study, and subjects also received a monetary compensation after experiments. All experiments were approved by the ethical committee of Chongqing university of Posts and Telecommunications. All experimental methods were conducted in accordance with the ethical guidelines determined by the National Ministry of Health, Labour and Welfare and the Declaration of Helsinki (BMJ 1991; 302:1194).

Stimuli and Design

The experiment design was similar with the previous study⁴. A sentence contained in the form [ready “call sign” go to “color” “number” point now]. For example, ready “skylark” go to “blue” “four” point now. Here, 60 unique sentences were combined by two call signs (sparrow or skylark), three colors (red, blue or green) and three numbers (two, five or seven). All sentences were read by using Chinese.

Before the experiment, subjects firstly listened to each of speakers alone and were able to report the color and number with at least 100% accuracy. In the experiment, a fixation cross (0.5° × 0.5°) at the center of the monitor were displayed throughout the entire block of trials. Each trial began with the fixation cross flashing for 50 ms. After a 700 ms delay, a cue was presented for 50 ms. The cue was defined as a call sign that tracked by listeners. After a short (100–300 ms) SOA, two different sentences spoken by a male and a female speaker were simultaneous presented about 2 s: one to the left ear, and the other to the right ear. Subjects were required to attend to one sentence, which the call sign was cued, and responded to the point where the call sign bird would go. The point (i.e. color-number combination) was fixed and shown visually on a monitor during each trial block.

Scalp EEG recording and preprocessing

EEG was recorded using a 64-channel NeuroScan system (Quik-Cap, band pass: 0.05–100 Hz, sampling rate: 1000 Hz, impedances <5kΩ) at the scalp. The EEG analysis procedure was shown in Fig. 6. To monitor ocular movements and eye blinks, EOG signals were simultaneously recorded from four surface electrodes, one pair placed over the higher and lower eyelid and the other pair placed 1 cm lateral to the outer corner of the left and right orbit. Cz was used as the reference during recording online. Then, the EEG recordings were divided into epochs (200 ms pre- to 4500 ms post-stimulus onset). Trials with blinks and eye movement were rejected offline and an artifact criterion of ±75 μV was used at all the other scalp sites to reject trials with excessive electromyography (EMGs) or other noise transients. EEG recordings were filtered with a band-pass of 0.1–30 Hz. The data were re-referenced by reference electrode standardization technique⁴⁴ (REST, www.neuro.uestc.edu.cn/rest) (Fig. 6A). EEG epoching was then extracted for the time period beginning auditory stimulus offset and lasting until 700 ms after auditory stimulus offset, and performed the next analysis (Fig. 6B). Then, single-trial EEG epochs were sorted according to response types, i.e. correct-related and error-related, and were averaged from each subject to compute the ERPs (Fig. 6C).

Network nodes definition (discriminative pattern)

Network nodes definition involved the following main steps (Fig. 6B): 1) cortical activities estimation; 2) Information Entropy; 3) R-square analysis; 4) SVM and 5) Back projection.

Cortical activities estimation

The conventional source localization procedure, weighted minimum norm estimation (WMNE), was used to estimate the cortical activities. For single-trial EEG epochs, three frequency-bands, i.e. theta (4–8 HZ), alpha (8–13 HZ), beta (13–30 HZ), were separately extracted by the wavelet transformation. Then cortical activities were calculated by applying linear inverse operator W to the three frequency-band signals:

$${\rm{S}}({\rm{t}})={\rm{W}}x({\rm{t}})$$

(1)

where x(t) represented the n-channels EEG data at time t and S(t) denoted corresponding cortical activities. W was obtained by:

$${\rm{W}}={\rm{R}}{A}^{T}{(AR{A}^{T}+{\lambda }^{2}C)}^{-1}$$

(2)

Here, C and R referred to covariance matrices of the noise and sources, respectively. A was the gain matrix, calculated via the Brainstorm toolbox (http://neuroimage.usc.edu/brainstorm/), and the regularization parameter, ${\rm{\lambda }}$, was calculated by:

$${\rm{\lambda }}=\frac{trace(AR{A}^{T})}{trace(C)\,\ast \,SN{R}^{2}}$$

(3)

A fixed value of 5 was used for the signal-to-noise ratio (SNR), which reflected the value in the evoked response experiments⁴⁵.

Here, a 3-shell realistic head model was adopted for EEG source activities estimation, where the conductivities for the cortex, skull, and scalp were 1.0 Ω^{− 1} m^{− 1}, 1/80 Ω^{− 1} m^{− 1}, and 1.0 Ω^{− 1} m^{− 1}respectively. The solution space was restricted to the cortical grey matter, the hippocampus, and other possible source activity areas, consisting of 15002 cubic mesh voxels with 10 mm inter-distance. The lead field matrix was calculated by the boundary element method (BEM)⁴⁶.

Rhythm Entropy

After acquiring the cortical activities S(t), the power of cortical activities in each trial was then calculated by the following equation:

$${\rm{Power}}=\sum _{t=1}^{m}S{(t)}^{2}$$

(4)

where m was the number of sample points and the information entropy was measured as follow:

$${\rm{iEn}}=-\sum _{i=1}^{3}{P}_{i}lo{g}_{2}({P}_{i})$$

(5)

where ${P}_{i}$ (i = 1, 2, 3) represented the normalized power of power_i and was calculated via divided by the sum of three frequency-band powers, i.e. theta, alpha and beta:

$${P}_{i}=\frac{Powe{r}_{i}}{{\sum }_{i=1}^{3}Powe{r}_{i}}$$

(6)

R-square analysis

R² analysis was a common criterion of separability in BCIs research⁴⁷, and often used to indicate correlation between features and class tags:

$${R}^{2}=\frac{{(E{X}_{+1}-E{X}_{-1})}^{2}}{4{\sigma }_{X}^{2}}$$

(7)

where ${X}_{+1}$ presented feature vector of target and ${X}_{-1}$ represented feature vector of non-target. ${\sigma }_{X}$ was the standard deviation. The R² value reflected the difference in the power of the two classes, with the larger R² value denoting the greater difference between two classes³⁷. For determining the threshold for further SVM classification, ten thresholds, i.e. from 0.1 times the maximum R² value to the maximum R² value and step length was set to 0.1, were selected to evaluate the performance of SVM classification. The final threshold was set to 0.6 times of the maximum R² value among all dipoles because most of the subjects got the best classification rate under this threshold in the present study^48,49. The dipoles with R² value exceeding the threshold were chosen for further SVM classification.

Support vector machine (SVM)

In the experiment, the number of correct trials was greater than that of error trials (ACC > 80%, i.e. correct trials > 96, error trials < 24). For each subject, the number of error trials was at least 20 to ensure the training sample size. Correct trials were randomly selected to make it consistent with the amount of error trials.

SVM was developed by Vapnik based on statistics learning theory (SLT). As its excellent generalization performance, SVM has been applied in a wide variety of issues. SVM had the feature of empirical risk minimization (ERM) and global optimum solution⁵⁰. We trained a SVM classifier with radial basis kernel function to extract highly discriminative brain regions. The goal of a SVM classifier with RBF kernel was to find a decision function ${\rm{f}}({\rm{x}})=w^{\prime} \varnothing (x)+b$ by solving the following optimization problem⁵¹:

$$\begin{array}{c}\mathop{\min }\limits_{w,\varepsilon }\frac{1}{2}||w|{|}^{2}+C\sum _{i=1}^{N}{\varepsilon }_{i}\\ {\rm{s}}.{\rm{t}}.{y}_{i}(w^{\prime} \varnothing ({x}_{i})+b)\ge 1-{\varepsilon }_{i}\end{array}$$

(8)

where w was the normal of the hyperplane; the function $\varnothing $ mapped the vector ${x}_{i}$ in a higher dimensional space⁵²; ${\varepsilon }_{i}$ was a measure of the misclassification errors for non-separable cases; and C traded off the empirical risk and model complexity, and was set by grid search algorithm⁵³. Here, C ranged from 10⁻⁸ to 10⁸ and the step length was set to 10^0.8.

If the SVM classifier could reflect the relationship between features and the class labels very well, the classifier was considered that it could predict the classes of new samples with good performance. Therefore, classification accuracy (CA), sensitivity (SE), specificity (SP) and area under ROC curve (AUC) were utilized to evaluate the classification performance of SVM classifier⁵⁴. At the same time, leave-one-out cross-validation (LOOCV) was applied to evaluate the generalization performance of SVM for a small sample size.

The percentage of the number of samples predicted correctly in the test set over the total samples, CA, was calculated as follows:

$${\rm{CA}}=\frac{TP+TN}{TP+TN+FP+FN}\,$$

(9)

where true positive (TP) was the number of positive samples correctly predicted and true negative (TN) was the number of negative samples correctly predicted. False positive (FP) denoted the number of positive samples incorrectly predicted and false negative (FN) denoted the number of negative samples incorrectly predicted.

SE and SP were calculated by the following formula, respectively:

$${\rm{SE}}=\frac{TP}{TP+FN}$$

(10)

$${\rm{SP}}=\frac{TN}{TN+FP}$$

(11)

SE referred to the ratio of correctly classified positive samples to the total population of positive samples, whereas SP was the ratio of correctly classified negative samples to the total population of negative samples.

Back projection

Both R² analysis and SVM classifier were used to differentiate between correct-response related and error-response related brain regions (i.e. the positions of dipoles), each feature influenced the classification via its R² value. The larger the R² value was, the greater it affected the final discrimination. However, the correlational vector R² was in a dimension-reduced space. To determine the discriminative brain areas, R² values were mapped back to the high-dimensional space (i.e. dipole space). The correlational vector in the dimension-reduced subspace can be projected back to the original feature space according to the following formula:

$$Di=Ui\ast {R}^{2}$$

(12)

For a given dipole i, the correlational representation was ${D}_{i}$ and the identity matrix was Ui in current study. Finally, the correlations were reconverted into the MNI space to obtain the discriminative regions. A threshold was required to determine brain areas that had significantly distinct characteristics between the correct- and error-response trials. For each dipole, a statistically meaningful threshold was derived by using 0.6 times of the maximum R² value among all dipoles, because most of the subjects got the best classification rate under this threshold in the present study.

Time series estimation

For averaged ERPs at scalp, cortical activities (i.e. cortical ERPs) were estimated via WMNE (details described in the above section “Cortical activities estimation”). The time series in the discriminative brain regions localized in the above session of “network nodes definition” (Fig. 6B), were computed via averaging cortical activities of the dipoles within sources respectively (Fig. 6C).

Time-varying network analysis

After obtaining discriminative sources (Fig. 6B) and time series in discriminative sources (Fig. 6C), time-varying network analysis was performed (Fig. 6D).

ADTF calculation

The multivariate adaptive autoregressive (MVAAR) model of source waves was computed by the following equation:

$$X(t)=\sum _{k=1}^{p}w(k,t)X(t-k)+\varepsilon (t)$$

(13)

where X(t) represented the cortical source wave over the entire time window, w(k, t) was the coefficients matrix of the time-varying model, which calculated by the Kalman filter algorithm, and $\varepsilon (t)$ represented the multivariate independent white noise. The symbol p denoted the MVAAR model order selected by Schwarz Bayesian Criterion^28,55.

As mentioned above, the discriminative brain areas as the cortical sources (Fig. 6B) were applied in the time-varying network analysis. After obtaining the MVAAR model coefficient (w(k, t)), H(f, t) was obtained from w(f, t), which was then transformed by Eq. (13) into the frequency domain. The H_ij element of H(f, t) described the directional information flow between the jth and the ith element at each time point t as:

$$w(f,t)\ast X(f,t)=\varepsilon (f,t)$$

(14)

$$X(f,t)={w}^{-1}(f,t)\ast \varepsilon (f,t)=H(f,t)\ast \varepsilon (f,t)$$

(15)

where $w(f,t)=\sum _{k=0}^{p}{w}_{k}(t){e}^{-j2\pi f\Delta tk}$, w_k was the matrix of the time-varying model coefficients. $X(t)$ and $\varepsilon (t)$ were transformed into the frequency domain as X(f,t) and $\,\varepsilon (f,t)$, respectively.

Defining the directed causal interrelation from the jth to the ith element, the normalized ADTF was described between (0, 1) as follows,

$${\iota }_{ij}^{2}(f,t)=\frac{|{H}_{ij}(f,t){|}^{2}}{{\sum }_{k}^{n}|{H}_{ik}(f,t){|}^{2}}$$

(16)

To obtain the total information flow from a single node, the integrated ADTF was calculated as the ratio of summation of ADTF values divided by the interested frequency bands [f1, f2]:

$${\vartheta }_{ij}^{2}(t)=\frac{{\sum }_{f1}^{f2}{\iota }_{ij}^{2}(k,t)}{f2-f1}$$

(17)

We chose average ADTF values over 4–30 Hz to acquire the final directional information flow according to the range of three frequency bands.

Surrogate data testing

The distribution of ADTF estimator under the null hypothesis of no causal interactions was not well determined, since the ADTF function had a highly nonlinear correlation with the time series where it derived. In view of this, the phases of the Fourier coefficients were independently and randomly iterated to produce a new surrogate data, which was a nonparametric statistical test²⁸. The spectral structure of the time series was retained in the process of iterating the phases of the Fourier coefficients. The shuffling procedure was repeated 200 times for each model-derived time series of each subject to establish an empirical distribution of the ADTF value under the null hypothesis of no connectivity.

Correlation analysis

We performed Pearson correlation analysis to investigate the following relationships: 1) RT and entropy values of discriminative regions; and 2) RT and ADTF values of information flow between discriminative regions. All thresholds were set at p < 0.05. Here, ADTF value of information flow was adjusted by dividing the weighted degree of network.

References

Tian, Y., Liang, S. & Yao, D. Attentional orienting and response inhibition: insights from spatial-temporal neuroimaging. Neuroscience Bulletin 30, 141–152 (2014).
Article PubMed Google Scholar
Tian, Y., Chica, A. B., Xu, P. & Yao, D. Differential consequences of orienting attention in parallel and serial search: An ERP study. Brain Research 1391, 81–92 (2011).
Article CAS PubMed Google Scholar
Cherry, E. C. Some Experiments on the Recognition of Speech, with One and with Two Ears. Journal of the Acoustical Society of America 25, 975–979 (1953).
Article ADS Google Scholar
Mesgarani, N. & Chang, E. F. Selective cortical representation of attended speaker in multi-talker speech perception. Nature 485, 233–236 (2012).
Article ADS CAS PubMed Google Scholar
O’Sullivan, J. A. et al. Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. Cerebral Cortex 25, 1697–1706 (2015).
Article PubMed Google Scholar
Marchegiani, L., Karadogan, S. G., Andersen, T., Larsen, J. & Hansen, L. K. The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry’s Experiment after Sixty Years. International Conference on Machine Learning and Applications and Workshops 1, 183–188 (2011).
Google Scholar
Ding, N. & Simon, J. Z. Emergence of neural encoding of auditory objects while listening to competing speakers. Proceedings of the National Academy of Sciences of the United States of America 109, 11854–11859 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Ding, N. & Simon, J. Z. Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. Journal of Neurophysiology 107, 78–89 (2012).
Article PubMed Google Scholar
Zion Golumbic, E. M. et al. Mechanisms underlying selective neuronal tracking of attended speech at a “cocktail party”. Neuron 77, 980–991 (2013).
Article CAS PubMed PubMed Central Google Scholar
Power, A. J., Foxe, J. J., Forde, E. J., Reilly, R. B. & Lalor, E. C. At what time is the cocktail party? A late locus of selective attention to natural speech. European Journal of Neuroscience 35, 1497–1503 (2012).
Article PubMed Google Scholar
Corbetta, M., Patel, G. & Shulman, G. L. The Reorienting System of the Human Brain: From Environment to Theory of Mind. Neuron 58, 306–324 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. An independent brain-computer interface using covert non-spatial visual selective attention. Journal of Neural Engineering 7, 16010 (2010).
Article PubMed Google Scholar
Wu, Z. & Yao, D. The influence of cognitive tasks on different frequencies steady-state visual evoked potentials. Brain Topography 20, 97–104 (2007).
Article PubMed Google Scholar
Martin, S. & Christian, N. Error-related potentials during continuous feedback: using EEG to detect errors of different type and severity. Frontiers in Human Neuroscience 9, 155 (2015).
Google Scholar
Gehring, W. J., Coles, M. G., Meyer, D. E. & Donchin, E. A brain potential manifestation of error-related processing. Electroencephalography & Clinical Neurophysiology Supplement 44, 261–272 (1995).
CAS Google Scholar
Trujillo, L. T. & Allen, J. J. B. Theta EEG dynamics of the error-related negativity - Clinical Neurophysiology. Clinical Neurophysiology 118, 645–668 (2007).
Article PubMed Google Scholar
Tong, J., Lin, Q., Xiao, R. & Ding, L. Combining multiple features for error detection and its application in brain-computer interface. Biomedical Engineering Online 15, 1–15 (2016).
Article Google Scholar
Wang, F., Lin, J., Wang, W. & Wang, H. EEG-based mental fatigue assessment during driving by using sample entropy and rhythm energy. IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems. 1906–1911(2015).
Tian, Y. et al. Spectral Entropy Can Predict Changes of Working Memory Performance Reduced by Short-Time Training in the Delayed-Match-to-Sample Task. Frontiers in Human Neuroscience 11, https://doi.org/10.3389/fnhum.2017.00437 (2017).
Tian, Y. & Yao, D. Why do we need to use a zero reference? Reference influences on the ERPs of audiovisual effects. Psychophysiology 50, 1282–1290 (2013).
Article PubMed Google Scholar
Liu, T. et al. Cortical Dynamic Causality Network for Auditory-Motor Tasks. IEEE Transactions on Neural Systems and Rehabilitation Engineering 99, 1092–1099 (2016).
Article ADS Google Scholar
Yao, D. A method to standardize a reference of scalp EEG recordings to a point at infinity. Physiological Measurement 22, 693–711 (2001).
Article CAS PubMed Google Scholar
Xu, P. et al. Cortical network properties revealed by SSVEP in anesthetized rats. Scientific Reports 3, 2496 (2013).
Article PubMed PubMed Central Google Scholar
Tian, Y. et al. Predictors for drug effects with brain disease: Shed new light from EEG parameters to brain connectomics. European Journal of Pharmaceutical Sciences 110,26–36 (2017).
Tian, Y., Ma, W., Tian, C., Xu, P. & Yao, D. Brain oscillations and electroencephalography scalp networks during tempo perception. Neuroscience Bulletin 29, 731–736 (2013).
Article PubMed PubMed Central Google Scholar
Ding, J. R. et al. Altered functional and structural connectivity networks in psychogenic non-epileptic seizures. Plos One 8, e63850 (2013).
Article ADS PubMed PubMed Central Google Scholar
Singh, P., Joshi, S. D., Patney, R. K. & Saha, K. Fourier-Based Feature Extraction for Classification of EEG Signals Using EEG Rhythms. Circuits Systems & Signal Processing 35, 3700–3715 (2016).
Article MathSciNet CAS Google Scholar
Wilke, C., Ding, L. & He, B. Estimation of time-varying connectivity patterns through the use of an adaptive directed transfer function. IEEE transactions on bio-medical engineering 55, 2557–2564 (2008).
Article ADS PubMed PubMed Central Google Scholar
Li, F. et al. The time-varying networks in P300: a task-evoked EEG study. IEEE Transactions on Neural Systems & Rehabilitation Engineering A Publication of the IEEE Engineering in Medicine & Biology Society 24, 725–733 (2016).
Article Google Scholar
Choi, I., Rajaram, S., Varghese, L. A. & Shinn-Cunningham, B. G. Quantifying attentional modulation of auditory-evoked cortical responses from single-trial electroencephalography. Frontiers in Human Neuroscience 7, 115 (2013).
Article PubMed PubMed Central Google Scholar
Looney, D., Park, C., Xia, Y. & Kidmose, P. Towards estimating selective auditory attention from EEG using a novel time-frequency-synchronisation framework. International Joint Conference on Neural Networks 9, 1–5 (2010).
Google Scholar
Holroyd, C. B. & Coles, M. G. The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. Psychological Review 109, 679–709 (2002).
Article PubMed Google Scholar
Iannaccone, R. et al. Conflict monitoring and error processing: new insights from simultaneous EEG-fMRI. Neuroimage 105, 395–407 (2015).
Article PubMed Google Scholar
Van Veen, V. & Carter, C. S. The anterior cingulate as a conflict monitor: fMRI and ERP studies. Physiology & Behavior 77, 477–482 (2002).
Article Google Scholar
KIEHL et al. Error processing and the rostral anterior cingulate: An event-related fMRI study. Psychophysiology 37, 216–223 (2000).
Article CAS PubMed Google Scholar
Sohn, H. et al. Linear and non-linear EEG analysis of adolescents with attention-deficit/hyperactivity disorder during a cognitive task. Clinical Neurophysiology 121, 1863–1870 (2010).
Article PubMed Google Scholar
Zhang, R. et al. Predicting Inter-session Performance of SMR-Based Brain–Computer Interface Using the Spectral Entropy of Resting-State EEG. Brain Topography 28, 680–690 (2015).
Article PubMed Google Scholar
Pincus, S. M., Goldberger, L., Steven, M. & Physiologi Physiological time-series analysis: what does regularity quantify? American Journal of Physiology 266, 1643–1656 (1994).
Google Scholar
Davidson, D. J. & Indefrey, P. An event-related potential study on changes of violation and error responses during morphosyntactic learning. Journal of Cognitive Neuroscience 21, 433–446 (2009).
Article PubMed Google Scholar
Carter, C. S. et al. Anterior cingulate cortex, error detection, and the online monitoring of performance. Science 280, 747–749 (1998).
Article ADS CAS PubMed Google Scholar
Garavan, H., Ross, T. J., Kaufman, J. & Stein, E. A. A midline dissociation between error-processing and response-conflict monitoring. Neuroimage 20, 1132–1139 (2003).
Article CAS PubMed Google Scholar
Coles, M. G., Scheffers, M. K. & Holroyd, C. B. Why is there an ERN/Ne on correct trials? Response representations, stimulus-related components, and the theory of error-processing. Biological Psychology 56, 173–189 (2001).
Article CAS PubMed Google Scholar
Holroyd, C. B., Yeung, N., Coles, M. G. H. & Cohen, J. D. A Mechanism for Error Detection in Speeded Response Time Tasks. Journal of Experimental Psychology General 134, 163–191 (2005).
Article PubMed Google Scholar
Yao, D. et al. A comparative study of different references for EEG spectral mapping: the issue of the neutral reference and the use of the infinity reference. Physiological Measurement 26, 173–184 (2005).
Article ADS PubMed Google Scholar
Lin, F. H., Witzel, T., Dale, A. M., Belliveau, J. W. & Stufflebeam, S. M. Spectral spatiotemporal imaging of cortical oscillations and interactions in the human brain. Neuroimage 23, 582–595 (2004).
Article PubMed PubMed Central Google Scholar
Fuchs, M., Drenckhahn, R., Wischmann, H. A. & Wagner, M. An improved boundary element method for realistic volume-conductor modeling. IEEE transactions on bio-medical engineering 45, 980–997 (1998).
Article CAS PubMed Google Scholar
Pfurtscheller, G., Vaughan, A. T. M., Wolpaw, J. R., Mcfarland, D. & Birbaumer, N. Brain-computer interfaces for communication and control. Psychophysiology 43, 517–532 (2002).
Google Scholar
Hsu, H. H. & Hsieh, C. W. Feature Selection via Correlation Coefficient Clustering. Journal of Software 5, 1371–1377 (2010).
Google Scholar
Zhang, Y., Xu, P., Cheng, K. & Yao, D. Multivariate synchronization index for frequency recognition of SSVEP-based brain-computer interface. Journal of Neuroscience Methods 221, 32–40 (2013).
Google Scholar
Netherlands, S. Support Vector Machine (SVM). (Springer Netherlands, 2008).
Wang, L., Shen, H., Tang, F., Zang, Y. & Hu, D. Combined structural and resting-state functional MRI analysis of sexual dimorphism in the young adult human brain: an MVPA approach. Neuroimage 61, 931–940 (2012).
Article PubMed Google Scholar
Renukadevi, N. T. & Thangaraj, P. Performance Evaluation of SVM – RBF Kernel for Medical Image Classification. Global Journal of Computer Science & Technology (2013).
Liu, C., Yin, S. Q., Zhang, M., Zeng, Y. & Liu, J. Y. An Improved Grid Search Algorithm for Parameters Optimization on SVM. Applied Mechanics & Materials 644–650, 2216–2219 (2014).
Article Google Scholar
Galar, M., Fernandez, A., Barrenechea, E., Bustince, H. & Herrera, F. A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches. IEEE Transactions on Systems Man & Cybernetics Part C 42, 463–484 (2012).
Article Google Scholar
Schwarz, G. Estimating the dimension of a model. The annals of statistics 6, 461–464 (1978).
Article ADS MathSciNet MATH Google Scholar

Download references

Acknowledgements

This research is supported by the National Natural Science Foundation of China (#61671097); the Chongqing Research Program of Basic Science and Frontier Technology (No. cstc2017jcyjBX0007; No. cstc2015jcyjA10024); the Chongqing Key Laboratory Improvement Plan (cstc2014pt-sy40001) and the University Innovation Team Construction Plan Funding Project of Chongqing (CXTDG201602009).

Author information

Yin Tian and Wei Xu contributed equally to this work.

Authors and Affiliations

Bio-information College, ChongQing University of Posts and Telecommunications, ChongQing, 400065, China
Yin Tian, Wei Xu & Li Yang

Authors

Yin Tian
View author publications
You can also search for this author in PubMed Google Scholar
Wei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Li Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived, designed the experiments and wrote the manuscript: YT. Performed the experiments, analyzed the data and wrote the first draft: W.X. Contributed reagents/materials/analysis tools: W.X., Y.T. Discussed the experiment design, analyzed the data and discussed the experiment results: Y.T., L.Y., W.X.

Corresponding author

Correspondence to Yin Tian.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tian, Y., Xu, W. & Yang, L. Cortical Classification with Rhythm Entropy for Error Processing in Cocktail Party Environment Based on Scalp EEG Recording. Sci Rep 8, 6070 (2018). https://doi.org/10.1038/s41598-018-24535-4

Download citation

Received: 13 February 2018
Accepted: 05 April 2018
Published: 17 April 2018
DOI: https://doi.org/10.1038/s41598-018-24535-4

This article is cited by

Temporal contrast effects in human speech perception are immune to selective attention
- Hans Rutger Bosker
- Matthias J. Sjerps
- Eva Reinisch
Scientific Reports (2020)
The Influence of Listening to Music on Adults with Left-behind Experience Revealed by EEG-based Connectivity
- Yin Tian
- Liang Ma
- Sifan Chen
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Mobile BCI dataset of scalp- and ear-EEGs with ERP and SSVEP paradigms while standing, walking, and running

Classification of mental workload using brain connectivity and machine learning on electroencephalogram data

Target of selective auditory attention can be robustly followed with MEG

Introduction

Result

Reaction Time (RT)

Brain regions with high discriminative power

Classification accuracy

Relationship between RT and rhythm entropy (RhEn)

Information flow between the discriminative regions

Relationship between RT and information flow

Disscussion

RhEn of discriminative cortices and RT

Information flow via time-varying networks

BCI Application

Conclusion

Material and Methods

Participants

Stimuli and Design

Scalp EEG recording and preprocessing

Network nodes definition (discriminative pattern)

Cortical activities estimation

Rhythm Entropy

R-square analysis

Support vector machine (SVM)

Back projection

Time series estimation

Time-varying network analysis

ADTF calculation

Surrogate data testing

Correlation analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Temporal contrast effects in human speech perception are immune to selective attention

The Influence of Listening to Music on Adults with Left-behind Experience Revealed by EEG-based Connectivity

Comments

Search

Quick links