A Novel Transfer Learning with Organic Computing in Deep Learning for Stress Classification

ABSTRACT


INTRODUCTION
Stress triggers an individual's immune system to respond to external stimuli, resulting in both mental and physical reactions [1].Psychological inflammation can impair skin defense mechanisms and reduce immune and circulatory system effectiveness.Stress symptoms are less useful for stress analysis than non-intrusive elements like respiration rate, breathing patterns, or skin temperature [2,3].Hormone measurements are only monitored in laboratory settings, not in the human body [4].Psychological inflammation is associated with chronic health conditions such as diabetes, arthritis, and heart disease.The respiratory system plays a role in regulating hormone levels and maintaining defense and heart function.Techniques are utilized to predict and quantify hormone production [5], but the overall effectiveness of integration remains a challenge.Studies frequently use physiological signals to identify emotional states, as the sympathetic nervous system regulates emotions such as fear, anger, and panic [6].
Typically, changes in an individual's emotional state are a direct reflection of their psychological state.EDA is used to describe this phenomenon [7,8].As well, ECG has also been used for stress classification in the past decades [9].Stress classification using machine learning schemes such as Support Vector Machine (SVM), etc., has been investigated in previous years to learn various physiological signals and classify stress levels [10,11].On the contrary, such algorithms need the sophisticated and random signal processing of physiological information, which is unsuitable for designing classification frameworks using large-scale databases and the emergence of deep learning models.As a result, DL models have been extensively utilized in the field of stress classification through EDA and ECG since they process actual data and recognize the relevant characteristics with no preprocessing or attribute extraction processes [12,13].Even though DL models can learn characteristics, those models are data-hungry.Also, they cannot handle sudden concept drift.Concept drift refers to the phenomenon where the statistical properties of a dataset change over time.In the context of stress classification, it means that the patterns and relationships between physiological signals (such as EDA and ECG) and stress levels can evolve or shift due to various factors.This poses a significant challenge in real-time stress monitoring because a model trained on historical data may become less effective as new data patterns emerge.
The challenges of concept drift in deep learning-based stress classification models are the following: • Model degradation: As the relationship between physiological signals and stress levels changes, a model trained on older data may start making inaccurate predictions.This can lead to a decline in the model's performance.
• Data labeling: When dealing with concept drift, it is essential to continuously label new data to reflect the current stress levels accurately.However, obtaining real-time labeled data can be resource-intensive and time-consuming.
•Adaptation: Adapting to concept drift requires models to be dynamic and flexible.The challenge is to modify the model's structure or parameters to accommodate new data patterns effectively.This adaptation process needs to be automated and efficient.
•Real-time responsiveness: Concept drift often involves sudden changes in data patterns, and models must respond promptly to these changes.Delays in adapting to new patterns can result in inaccurate stress classifications.
To address the above-mentioned challenges, the DL models should be dynamically adjusted in response to concept drift.This ensures that the model remains accurate and responsive even as the relationships between EDA, ECG, and stress levels change over time.Hence, in this paper, a novel DTLOC model is proposed to handle concept drift and obtain better accuracy from available data by using a self-managing system for adapting the DL structure according to the error rate during the training process.Initially, the EDA and ECG signal databases are collected from the available sources.A DCNN is proposed with an OC paradigm and TL algorithm.The TL process exchanges the learned weight value or knowledge about the features of EDA and ECG among convolutional layers.An OC-based self-managing system can dynamically reconfigure the DCNN structure to solve the problem of sudden concept drift during stress classification using large-scale real-time data.Thus, this innovative approach overcomes the limitations of traditional DL models and has the potential to significantly improve stress analysis in practical applications.
The remaining article is written as follows: Section 2 reviews the research on the categorization of human stress levels.The DTLOC model is described in Section 3, and its effectiveness is presented in Section 4. Further, this study is summarized in Section 5.

LITERATURE SURVEY
Many studies aim to assess the impact of stress on an individual's life using physiological data.This section reviews the stress classification models based on machine learning and DL models using physiological data.

Stress classification using machine learning models
An ElectroOculoGraphy with Artificial Neural Network (EOG-ANN) [14] was presented for categorizing stress levels from EEG data.First, the pre-processing was conducted to remove noise from the EEG signal data using the autoregressive filtering scheme.Afterward, the time-domain characteristics were retrieved and fed to the ANN for stress prediction.But it was time-consuming for such a massive quantity of data.
1D CNN and a Multi-Layer Perceptron (MLP) [15] were designed to detect human stress.Initially, stressed and nonstressed states were distinguished by a binary classification.Then, a 3-class classification was performed to classify emotions into neutral, stressed, and amused states.But the dataset used in this model was limited, which may not be sufficient to define the overall human population.
The method of classifying EEG emotions using the LIBSVM classifier has been proposed [16].First, the Lempel-Ziv and wavelet coefficients were determined for the EEG signal.The coefficients were then classified into different emotional states by the LIBSVM.However, its success rate was lower when classifying multiple emotion classes.
Human emotion recognition was developed by learning multi-channel characteristics from the EEG signal [17].In this method, multi-channel EEG and textual feature fusion were applied in the time domain to recognize various human emotions, wherein the statistical traits were concatenated to create a feature vector.Moreover, the SVM was trained to recognize human emotions.But the training process takes a long time while increasing the number of data points.To design an Analysis of Variance (ANOVA) classifier for classifying stress levels [18].But it needs deep learning classifiers to increase the classification accuracy.A multiobjective evolutionary scheme, a fuzzy unkanked ruling generation scheme, and MLP [19] were used to analyze the database and detect the level of distress among students.But the number of instances in the database was not adequate.
To predict generalized anxiety levels based on the machine learning algorithm.In this analysis, 2-class and 3-class anxiety issues were categorized earlier by gathering the database during the COVID-19 epidemic in Saudi Arabia [20].The information was gathered from every area of the UK through an online inspection comprising queries to recognize aspects impacting anxiety levels after queries from the GAD-17, a monitoring device for generalized anxiety diseases.Then, the estimation systems were constructed by the SVM and J48 decision tree classifiers.However, as the number of classes increased, the system complexity also increased.
According to these models, it can be inferred that the machine learning models are not fit for large-scale datasets due to their high computational complexity.Additionally, they are unable to learn the comprehensive characteristics necessary for accurately classifying stress based on physiological signals.
To combat these problems, DL models have emerged for stress classification.

Stress classification using deep learning models
A deep learning-based approach [21] was developed for multimodal stress detection.This approach involved unsupervised feature learning and supervised stress classification.The unsupervised feature learning involved modality-based feature learning, which projects multimodal representations.The representation was processed using a Gated Recurrent Unit (GRU) to learn spatiotemporal features, and the resulting output was then fed into an auto-encoder for multimodal stress detection.However, the accuracy of the results was compromised due to the limited amount of data available.CNN model [22] was developed for categorizing acute cognitive stress into five distinct periods.However, it required significant computation and storage resources.
A subject-independent emotion recognition scheme from EEG data based on the Variational Mode Decomposition (VMD) and Deep Neural Network (DNN) [23].First, the VMD was applied to determine the features from the EEG data.Then, such features were classified by the DNN into different emotional states.Conversely, its training speed was extremely slow.
A method was presented for emotion recognition from EEG signals by Bara et al. [24].The zero-time windowing approach was used to extract instantaneous spectral features by utilizing the numerator group-delay function.This method allows for easy detection of epochs in all emotional states.The Quadratic Discriminant Recurrent Neural Network (QDRNN) was used to classify emotional states.However, accuracy was less because it considered only a limited signal and it did not handle the concept drift problem.
A novel approach for emotion recognition using EEG data was proposed by Gannouni [25], utilizing a three-dimensional CNN (3D-CNN).The 3D-CNN method extracts spatiotemporal features from EEG signals and captures the relationship between different channel positions by collecting data from multiple channels as input.Additionally, dimensional emotions were consolidated, saving computation time by processing multiple dimensional labels together.But, the concept drift issue may degrade the model performance.
Long Short-Term Memory (LSTM) network [26] was develoepd for categorizing stress levels from EEG data.First, the preprocessing was conducted to remove noise from the EEG signal data using the auto-regressive filtering scheme.Afterward, the time-domain characteristics were retrieved and fed to the LSTM for stress prediction.However, processing such a large amount of data was time-consuming.
Earlier DL models in the literature were incapable of addressing concept drift issues in real-time stress classification.This tends to degrade the model's adaptability and performance while varying data patterns, or the model's parameters during training.This study aims to address the concept drift problem in stress classification using a DL model by combining OC and TL strategies.

Figure 1. Architecture of DTLOC model for human stress classification
In this section, the proposed DTLOC model is explained for stress classification.In this model, 3 major processes are performed: (i) data acquisition; (ii) knowledge transfer (TL); and (iii) self-regulation (OC) for the DCNN classifier.Figure 1 illustrates the entire architecture of the DTLOC model.

Data acquisition
The first step is to obtain a publicly accessible multimodal dataset known as the WESAD (Wearable Stress and Affect Detection) database.The Trier Social Stress Test is used as a stress stimulus on 15 individuals (12 men and 3 women) during the data collection process.This data set focuses in particular on pregnant graduate students, heavy smokers, psychiatric illnesses, infectious diseases, and cardiovascular diseases.The 15 subjects examined had an average age of 27.5±2.4years.Each subject's data is linked to many selfreports that, during an affective stimulus, represent the subjective experience.This dataset includes triaxial acceleration signals obtained at 700 Hz from two different devices, such as a chest-worn device (RespiBAN professional) and a wrist-worn device, along with physiological modalities of high resolution such as ECG, EDA, etc.The Respiban is applied to the subject's chest.The respiration is monitored via a respiratory inductive plethysmograph sensor.The ECG data is recorded using a typical three-point ECG.The rectus abdomens, which enables the individual to move as freely as possible, record the EDA signal.Both individuals also recorded BVP (64Hz) and EDA (4Hz) on their non-dominant hands using the Empatica E4.The computer receives the recorded data and stores it locally for further processing.

Algorithm for TL
Input: DCNN categorizer ℎ(  ), ℎ(  ), γ1, γ2 and C Initialize   (1) = ℎ and   (2)  .Moreover, the trainer can modify the activation function using the present sample concerning some conditions.A goal of this training is to lessen the overall error.On the other hand, in this situation, if the distribution extremely alters frequently over , then the TL cannot working well.
To formulate the concept-drifting TL, a window dimension variable Pi is adopted, which is the quantity of samples obtained in the i th iteration.Additionally, the activation functions of 2 categorizers are kept.Therefore, at the t th iteration, for xt, its  ̂ is categorized by the ensemble function given in Eq. ( 8): The key issue is how to fine-tune the weight.It is evident that at the initial iteration, the DCNN-TL is recurrently 0, thus its activation function is weighted with 0, while the activation function of DTLOC is weighted with one in it.A below powerful exponential weighted modification is applied to adaptively alter the weights for the successive iterations: if mod(t, Pi)≠0:

Organic computing
Organic Computing (OC) is an approach to designing selfmanaging systems that takes inspiration from the selfregulation and adaptability found in natural systems.In OC, systems are designed to be dynamic and capable of adapting to changing conditions, similar to how living organisms adjust to their environments.The main concept is to develop selfmanaging systems that operate autonomously without constant human intervention.
In the context of the DTLOC model, OC is utilized to establish a self-managing system for classifying stress.The model can dynamically adjust its network structure and objectives in real time based on physiological signal information.This suggests that the model can adapt its configuration to effectively handle different stress conditions, similar to how a person adjusts their behaviour when faced with stress.Figure 2 represents structure of OC.
• Generalizability: The model is versatile and can be applied to classify various types of stress and emotions.It can be used with datasets of any size, making it suitable for a wide range of scenarios and applications.
• Abstraction level: The DTLOC model operates at a higher level of abstraction than traditional computational models.This means that it emphasizes objectives and goals rather than specific computational processes.This higher level of abstraction enables greater flexibility and adaptability.
•Scalability: The DTLOC model is scalable, allowing it to adapt its knowledge base as necessary.This adaptability makes it suitable for various environments and data sources, and it can continue to develop and expand to address emerging challenges.

Change management
The change management module is next to the component controller, which is responsible for identifying the need for change and deciding how to adapt.When it detects that the DTLOC's (i.e., DCNN) architecture or parameters need adjustment, it can initiate the reconfiguration process.This might involve changing the number of layers, the size of convolutional filters, other architectural elements, or objective functions.

Goal management
OC systems often operate with predefined objectives.In this case, the goal of the DTLOC model is to accurately classify stress based on physiological signals.The goal management module can guide the reconfiguration by determining which architectural or parameter changes are most likely to improve stress classification performance.
These OC modules continuously monitor the input data and system performance in real-time.If the physiological signals change, indicating different stress conditions, the system can adapt the DCNN architecture and parameters to better fit the new data distribution.

RESULTS AND DISCUSSION
The efficiency of the DTLOC model is assessed in MATLAB 2019b using the WESAD database and compared with the existing DL models: LSTM [26], DNN [23], LIBSVM [16], and CNN [22].The comparison is conducted in terms of the following metrics: •Accuracy: It is the percentage of precise classification over the total data instances tested.(11) In Eq. ( 11), TP is the quantity of distress instances precisely categorized as distress, TN is the quantity of stress instances precisely categorized as stress, FP is the quantity of stress instances categorized as distress, and FN is the quantity of distress instances categorized as stress.
•Precision: It measures the appropriately classified data instances at TP and FP rates.

Limitations, assumptions, and constraints
The DTLOC model outperforms other models in stress classification on the WESAD database, demonstrating higher precision, recall, F-score, and accuracy.It is important to consider the limitations, assumptions, and constraints that may affect the interpretation and generalizability of these findings.
•The results are based on the evaluation using the WESAD database, which is a specific dataset.The performance of the DTLOC model may not apply to other datasets that have different characteristics or data distributions.It is crucial to evaluate the model's performance on a wider variety of datasets to determine its ability to generalize.
•The DTLOC is designed to handle real-time concept drift in stress classification.The effectiveness of this model relies on the alignment between the concept drift in the dataset and real-world scenarios.The model's adaptability to different types of concept drift and its performance in dynamic, evolving environments should be further investigated.
• The model does not address overfitting issues.Overfitting can happen when a model performs extremely well on the training dataset but struggles to apply to new, unseen data.A thorough evaluation should assess both overfitting and generalization performance.

CONCLUSIONS
This paper introduces the DTLOC model, which uses DCNN with OC and TL to classify human stress levels based on psychological data.The experiments assessed the effectiveness of the DTLOC model using the WESAD database in MATLAB 2019b.The results show that the DTLOC model achieved an accuracy of 93.53%.On the WESAD dataset, the accuracy of the LIBSVM, LSTM, DNN, and CNN models were 80.89%, 82.66%, 84.74%, and 89.05%, respectively.The DTLOC model achieved precision, recall, and f-score values of 93.17%, 91.93%, and 92.55%, respectively.The values exceed those of current stress classification models.
This model can help identify individuals who are at risk of stress-related illnesses, such as anxiety, depression, and heart disease, enabling timely medical intervention.Identifying stress early can prevent post-traumatic stress disorder (PTSD) and improve overall mental health.This model has the potential to improve individuals' quality of life and enhance safety in various sectors.This model has the potential to be integrated into the cloud environment for real-time stress classification in the future.Additionally, future research can explore multi-modal fusion techniques to integrate different data sources, including social media text, images, audio, and physiological signals.This integration can lead to a more comprehensive classification of stress.

Figure 2 .
Figure 2. 3-layer Structure 3.3.1 Component control OC may involve monitoring the performance of individual components or subsystems of the DTLOC model.In this case, it could oversee the CNN architecture and parameters.If the CNN's performance starts to degrade or is not optimal for a given stress classification task, the component control module can trigger reconfiguration.
) +   ()  +  +   () +   () It is the percentage of data instances that are appropriately classified at TP and FN rates.

Figure 3
Figure 3 portrays the efficiency of various stress classification models in the WESAD database.It is observed that the effectiveness of the DTLOC model based on precision, recall, and f-score is greater than that of the other classification models due to the development of a self-management system