Time-frequency time-space long short-term memory networks for image classification of histopathological tissue

Pham, Tuan D.

doi:10.1038/s41598-021-93160-5

Download PDF

Article
Open access
Published: 01 July 2021

Time-frequency time-space long short-term memory networks for image classification of histopathological tissue

Tuan D. Pham¹

Scientific Reports volume 11, Article number: 13703 (2021) Cite this article

1251 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Image analysis in histopathology provides insights into the microscopic examination of tissue for disease diagnosis, prognosis, and biomarker discovery. Particularly for cancer research, precise classification of histopathological images is the ultimate objective of the image analysis. Here, the time-frequency time-space long short-term memory network (TF-TS LSTM) developed for classification of time series is applied for classifying histopathological images. The deep learning is empowered by the use of sequential time-frequency and time-space features extracted from the images. Furthermore, unlike conventional classification practice, a strategy for class modeling is designed to leverage the learning power of the TF-TS LSTM. Tests on several datasets of histopathological images of haematoxylin-and-eosin and immunohistochemistry stains demonstrate the strong capability of the artificial intelligence (AI)-based approach for producing very accurate classification results. The proposed approach has the potential to be an AI tool for robust classification of histopathological images.

Classification of breast cancer histology images using MSMV-PFENet

Article Open access 19 October 2022

Linxian Liu, Wenxiang Feng, … Jiamiao Yang

Adapting the pre-trained convolutional neural networks to improve the anomaly detection and classification in mammographic images

Article Open access 09 September 2023

Abeer Saber, Abdelazim G. Hussien, … Alaa Allakany

Deep learning-based histopathological segmentation for whole slide images of colorectal cancer in a compressed domain

Article Open access 18 November 2021

Hyeongsub Kim, Hongjoon Yoon, … Yosep Chong

Introduction

Image analysis in pathology is an important task that helps provide pathologists with quantitative information to be discovered in complex characteristics of pathology images. Conventional pathological quantification, which is based on the expertise of pathologists, is subjective in assessment, time-consuming for the analysis of large volumes of data, and may encounter difficulties when the reproducibility of results is desired. These factors have arisen the need for automated quantification of digital pathology data.

Histology is the study of the microscopic anatomy of biological tissues, while histopathology is a field of histology that involves the study of diseased tissue. The benefits of automated image analysis of histopathological images are multi-fold¹. Not only from the perspective of the ability for rendering accurate diagnosis, but the automated analysis can also provide insights into disease mechanisms for understanding biological abnormalities, optimal clinical patient-specific treatment, and biomarker discovery.

Automated image analysis of spatial structures of histopathological images were carried out in works reported in^2,3. A type of advanced machine-learning method such support vector machines (SVM) was utilized to develop a system for classifying normal tissue and tissue lesions from liver, lung, spleen, and kidney of bovine animals into different histologic categories⁴.

As image processing and classification using deep learning has been realized as a major direction of research in medical prognostics and health management⁵, using the state-of-the-art methods in artificial intelligence (AI) for pattern classification, several deep-learning models have recently been used for classification in digital pathology^6,7. Some of these works include a self-designed convolutional neural network (CNN) model for necrosis detection in whole-slide images of gastric cancer⁸, the use of a CNN model for pathology-based prediction of survival outcome of patients with lung cancer⁹, a pre-trained CNN (Inception v3) for detecting cancer subtype or gene mutations from histopathological images of non-small cell lung cancer¹⁰, pre-trained CNNs for identifying histologic growth patterns of lung cancer¹¹, and Bayesian CNN for classifying histopathological images of colorectal cancer¹². More recently, fusion of deep-learning features was performed for classifying histopathological images of breast tissue¹³; and texture features extracted from histopathological tissue images of prostate cancer were used for image classification with support vector machines to provide Gleason scores to the patients’ whole slide images, and the results found to be better than the use of deep learning¹⁴.

In this study, recurrent neural networks that learn time-frequency and time-space features for classification of time series or sequential data¹⁵ is further developed for classifying histopathological images. The networks receive input as the combination of multiple features extracted in time-frequency and time-space domains of the time series transformed from the images. The class modeling is then designed for training the networks. Comparative results obtained from testing three public datasets of histopathological images, including haematoxylin-and-eosin (H&E) stained tissue images of colorectal cancer, H&E stained tissue images of heart failure, and immunohistochemistry (IHC) stained tissue images of rectal cancer, show the capability of achieving very accurate classification by the current approach.

Materials and methods

Image data

Three public databases are used in this study and descibed as follows.

H&E colorectal-cancer data

The colorectal-cancer (CRC) histology data used in this study were originally studied in¹⁶. The H&E stained tissue images of the CRC are publicly available at URL: http://doi.org/10.5281/zenodo.53169. The dataset consists of ten anonymized H&E stained CRC tissue slides. Tumors of both low grade and high grade were included in the dataset. The slides were first digitized, then contiguous tissue areas were manually annotated and tessellated to produce 625 non-overlapping tissue images of $150 \times 150$ pixels for each of 8 types of tissue, resulting in a set of 5000 images. There are 8 tissue types for classification, which are: (1) tumor epithelium (tumor), (2) simple stroma (stroma), (3) complex stroma (complex), (4) immune cells (lymphoid), (5) debris, (6) normal mucosal glands (mucosa), (7) adipose tissue (adipose), and (8) background (no tissue or empty).

Figure 1 shows selected samples of the H&E stained CRC tissue images of the dataset.

H&E heart-failure data

The H&E stained heart-failure tissue image dataset, which includes left ventricular tissue from 209 patients was originally studied in¹⁷. The H&E stained tissue images of human heart failure are publicly available at the following URL: https://idr.openmicroscopy.org/webclient. The dataset consists of two cohorts of patients: (1) heart failure (N = 94) and (2) without heart failure (N = 115). The heart-failure tissue was collected from patients with clinically diagnosed ischemic cardiomyopathy (N = 51) or idiopathic dilated cardiomyopathy (N = 43). The non-failing patients were organ donors without a history of heart failure. All tissue types were sectioned, stained, and scanned during the data acquisition. The whole slide image of each patient was down sampled to 5$\times $ magnification, where eleven non-overlapping images considered as the regions of interest were extracted and the tissue border was manually refined. The total number of images for the heart-failure and non-heart-failure cohorts are 1034 and 1265, respectively, resulting in a dataset of 2299 images of $250 \times 250$ pixels.

Figure 2 shows selected samples of H&E stained sections of heart-failure and non-heart-failure tissue types.

IHC rectal-cancer data

The data were obtained from a cohort of 143 patients with rectal cancer from the Swedish rectal cancer trial of preoperative radiotherapy (pRT) between 1987 and 1990¹⁸. All patients were performed locally curative resection. Among them, 77 patients received tumor resection alone, and 59 received pRT followed by surgical tumor resection. The pRT was given at a total dose of 25 Gy in 5 fractions over a median of 8 days (6–14 days) before the surgery. The surgical tumor resection was carried out in a median of 4 days (range 0–8 days) after the pRT. Samples were collected from biopsy ($n= 96$), primary cancer or surgically resected tumor ($n=136$), adjacent normal mucosa ($n= 79$), and distant normal mucosa ($n=119$). The distant normal mucosa was taken from proximal or distal margin (4–35 cm from the primary tumor) of the resected rectum and was histologically free from tumor. The adjacent normal mucosa sample was taken adjacent to the primary tumor and was histologically free from tumor. None of the patients received preoperative or adjuvant chemotherapy. The mean follow-up period or the patients was 107 months (range 0–309 months).

The whole dataset has 235 images, where the size for each image is about $2500 \times 2700$ pixels. The image subsets consist of 40 and 14 images of biopsy without pRT having survival rate greater and less than 5 years, respectively; 32 and 11 images of biopsy with pRT having survival rate greater and less than 5 years, respectively; 54 and 25 images of tumor without pRT having survival rate greater and less than 5 years, respectively; and 36 and 23 images of tumor with pRT having survival rate greater and less than 5 years, respectively. Details about the procedure for the image extraction and the publicly available dataset for investigating the prediction and prognosis of DNp73 were described in¹⁹.

Figure 3 shows selected samples of IHC stained images of the rectal cancer obtained for biopsy and primary tumor samples with and without pRT.

Image vectorization

Because LSTM networks were designed for learning order dependence in time series or sequential data, the conversion of time-independent images into time series is necessary for extracting sequential features that will be used as input into the network. To extract features of the time series of histopathological images in time-frequency and time-space domains, the color (3D) images were first converted into grayscale (2D) images. Using the grayscale images, the vectorization of a 2D image or matrix is a linear transformation that converts the image into a column vector. Specifically, the vectorization of an $M \times N$ image I, denoted as J, is the $MN \times 1$ column vector obtained by stacking the columns of the image I on top of one another, giving

$$\begin{aligned} J = [I_{11}, \dots , I_{M1}, I_{12}, \dots , _{M2}, I_{1N}, \dots , I_{MN}]^T. \end{aligned}$$

(1)

Thus, J results in a time series of image I, which is ready for feature extraction in time-frequency and time-space domains, which are described in the following sections.

LSTM learning with time-frequency and time-space features of images

LSTM networks²⁰, which are a special type of the recurrence neural network (RNN), are mainly applied for classifying time series or sequential data such as language modeling and speech-to-text translation^21,22. The LSTM network was developed to solve the vanishing gradients problem encountered by the RNN. The essence of the LSTM design is the introduction of data-dependent configurations into the RNN cells to prevent the gradient of its objective function from vanishing during the data training²⁰. As a result of the redesign, LSTM networks can be more robust and versatile than RNNs²³.

As an extension of applications of the time-frequency and time-space LSTM (TF-TS LSTM) recently introduced in literature for classifying physiological signals¹⁵, images of histopathological tissue are vectorized into time series whose TF and TS features can be extracted for learning by the TF-TS LSTM. The extractions of the TF features that are the instantaneous frequency (IF) and spectral entropy (SE), and the TS features that are the fuzzy recurrence image entropy (FRIE) and fuzzy recurrence entropy (FRE) were described in detail in¹⁵. Details on how TF and TS features are used for learning by the TF-TS LSTM can also be found in¹⁵.

The procedure for implementing the TF-TS LSTM for image classification is as follows.

1.
Vectorize an image into time series.
2.
Extract IF and SE from the time series.
3.
Construct the fuzzy recurrence plot (FRP) of the time series.
4.
Extract FRIE and FRE of the FRP.
5.
Train the LSTM with the extracted TF and TS features.

The IF of a time series, which is the average of frequencies f over time instant t, is expressed as

$$\begin{aligned} IF(t) = \frac{\int _{-\infty }^{\infty } f P(t,f) df}{\int _{-\infty }^{\infty } P(t,f) df}, \end{aligned}$$

(2)

where P(t, f) is the power spectrum.

Because Eq. (2) applies to infinitely long signal, the IF for a signal of a finite length needs to be numerically estimated. A method for estimating the IF and adopted herein was described in¹⁵.

The SE at time t is computed as

$$\begin{aligned} SE = - \sum _{m=1}^N p(t,m) \log _2 p(t,m). \end{aligned}$$

(3)

where the probability at time t and frequency m is

$$\begin{aligned} p(t,m) = \frac{P(t,m)}{\sum _f P(t,f)}, \end{aligned}$$

(4)

in which $f \in [0, fs/2]$, and fs is the sampling frequency.

Next, given a time series, an embedding dimension, and a time delay, the phase space of the corresponding dynamical system can be constructed and represented with a collection of vectors ${\mathbf{X}} = (\mathbf {x}_1, \dots , \mathbf {x}_N)$. Elements of an FRP are defined as²⁴

$$\begin{aligned} FRP(i,j) = \mu (\mathbf {x}_i,\mathbf {x}_j), \, i, j = 1, \dots , N, \end{aligned}$$

(5)

where $\mu (\mathbf {x}_i,\mathbf {x}_j) \in [0, 1]$ is the fuzzy membership of similarity between $\mathbf {x}_i$ and $\mathbf {x}_j$.

By partitioning ${\mathbf{X}}$ into c clusters using the fuzzy c-means algorithm²⁵, an FRP has the following three properties:

1.
Reflexivity:
$$\begin{aligned} \mu (\mathbf {x}_i,\mathbf {x}_i) = 1, \, i=1, \dots , N. \end{aligned}$$
(6)
2.
Symmetry:
$$\begin{aligned} \mu (\mathbf {x}_i,\mathbf {v}_q) = \mu (\mathbf {v}_q,\mathbf {x}_i), \, i = 1, \dots , N, q = 1, \dots , c. \end{aligned}$$
(7)
3.
Transitivity:
$$\begin{aligned} \mu (\mathbf {x}_i,\mathbf {x}_j) = \max [\min \{\mu (\mathbf {x}_i,\mathbf {v}_q), \mu (\mathbf {x}_j,\mathbf {v}_q)\}], q = 1, \dots , c. \end{aligned}$$
(8)

The FRIE is then computed as

$$\begin{aligned} FRIE = - \sum _{l=1}^L p_l \log _2 p_l, \end{aligned}$$

(9)

where L is the number of gray levels of the FRP (represented by the fuzzy membership grades), and $p_l$ is the probability associated with gray level l.

Using the concept of the entropy of a fuzzy set²⁶, the FRE of an FRP is calculated as²⁷

$$\begin{aligned} FRE = \sum _{i=}^N \sum _{j=1}^N - \mu (\mathbf {x}_i,\mathbf {x}_j) \, \log _2 \mu (\mathbf {x}_i,\mathbf {x}_j) - [1-\mu (\mathbf {x}_i,\mathbf {x}_j)] \, \log _2[1-\mu (\mathbf {x}_i,\mathbf {x}_j)]. \end{aligned}$$

(10)

Class modeling and performance measures

Deep-learning based class modeling

In machine learning, methods for class modeling or one-class classification²⁸ aim to recognize samples of a particular class among all other class samples by focusing on the learning from a training set containing only the samples belonging to that class. This approach is different from conventional classification methods, which learn to differentiate two or more classes with training data containing samples from all the classes.

In this study, the bi-LSTM network was used for the class modeling. The network learned samples from a particular class while creating an equal number of samples for other class(es) from a single sample for each of other class(es). This design was based on the capability of the bi-LSTM network to enhance its learning from multiple copies of the same sample. Not only this design of class modeling can enhance the learning capability of the network, it can also address the problem of data imbalance often encountered in the classification of histopathological images²⁹. This deep-learning approach is referred to as image-LSTM.

The network layer was specified with an output size = 100, fully connected layer = number of classes, and, while the sigmoid function is used for the LSTM gate activation, the network ends with a fully connected layer, a softmax layer, and a classification output layer. Training options of the bi-LSTM were set as optimizer = ‘Adam’ (adaptive moment estimation), including $L_2$ regularization factor, maximum number of epochs = 80 for the 2-class classification of the H&E CRC, H&E heart-failure, and IHC rectal cancer data, and 180 for the 8-class classification of the H&E CRC data, minimum batch size = 150, initial learning rate = 0.01, and gradient threshold = 1.

To compute the instantaneous frequency and spectral entropy, the sampling frequency fs = 300 Hz, and frequency range = [0, fs/2]. In this study, the value for the sampling frequency is arbitrarily chosen to extract the time-frequency features of the transformed time series of the images, but commonly used for physiological signals (https://physionet.org/challenge/2017/). To compute the Shannon entropy and fuzzy entropy of an FRP, embedding dimension = 1, time delay = 1, and number of clusters = 3. For consistent input to the bi-LSTM, the length of the sequences of the time-space features (FRIE and FRE) was designed to match with that of the time-frequency (IF and SE) sequences by segmenting the transformed time series into the number of segments being equal to the length of the time-frequency sequences. Each segment of the time series was then sequentially processed to obtain the time-space sequences.

Figure 4 shows the architecture and sequential procedure of the image-LSTM for classifying histopathological images.

Evaluation metrics

Accuracy (ACC), sensitivity (SEN), specitivity (SPE), precision (PRE), and $F_1$ score were adopted as statistical measures of the tissue classification performance. ACC is defined as

$$\begin{aligned} \text {ACC} = \frac{\hbox{TP+TN}}{\hbox{P+N}}, \end{aligned}$$

(11)

where TP, TN, P, and N refer to true positive, true negative, condition positive, and condition negative, respectively. In this study, for each tissue class considered as positive, TP is the number of positive samples correctly classified as positive, TN the number of correctly classified negative samples, P the total number of positive samples, and N the total number of negative samples.

SEN is defined as

$$\begin{aligned} \text {SEN} = \frac{\hbox{TP}}{\hbox{P}}. \end{aligned}$$

(12)

SPE is defined as

$$\begin{aligned} \text {SPE} = \frac{\hbox{TN}}{\hbox{N}}. \end{aligned}$$

(13)

PRE is calculated as

$$\begin{aligned} \text {PRE} = \frac{\hbox{TP}}{\hbox{TP+FP}}. \end{aligned}$$

(14)

$F_1$ score is the harmonic mean of precision and sensitivity and determined as

$$\begin{aligned} F_1 = \frac{\hbox{2TP}}{\hbox{2TP+FP+FN}}, \end{aligned}$$

(15)

where FP is the number of negative samples incorrectly classified as positive, and FN the number of positive samples incorrectly classified as negative.

For the prediction and prognosis of DNp73 in rectal cancer, survival rates of more and less than 5 years are defined as true positive rate (TPR) and true negative rate (TNR), respectively, which are calculated as

$$\begin{aligned} \text {TPR}= & {} \frac{\hbox{TP}}{\hbox{P}}. \end{aligned}$$

(16)

$$\begin{aligned} \text {TNR}= & {} \frac{\hbox{TN}}{\hbox{N}}. \end{aligned}$$

(17)

Results

To carry out the image data pre-processing for sequential feature extraction described earlier, because of the large size of the IHC slides of rectal cancer, these images were resized to $250 \times 250 \times 3$ pixels to reduce the time for subsequent computations and network training. Figures 5, 6, and 7 show examples of the conversions of images into time series, time-frequency features, and time-space features using H&E CRC, H&E heart-failure, and IHC rectal cancer datasets, respectively. These features of the images were combined to constitute multi-dimensional sequences and used as the input into the bi-LSTM for learning and classification.

Tables 1 and 2 show the classification results obtained from different methods using the H&E CRC and heart-failure data obtained as averages of ten runs of tenfold cross-validation (CV), H&E heart-failure data as averages of ten runs of twofold (for comparison with previously reported results) and threefold CVs (for comparison with previously reported results), and IHC rectal-cancer data as averages of ten runs of tenfold CV, respectively.

Table 1 Classification results.

Full size table

Table 2 Prediction and prognosis of DNp73: IHC rectal-cancer data.

Full size table

For the 2-classification (stroma and tumor) of the H&E CRC data, the image-LSTM provided perfect results in terms of accuracy, sensitivity, specificity, precision, and $F_1$ score, where applicable, and outperformed other classification methods (support vector machines and histogram = SVM-histogram, and support vector machines and local binary patterns = SVM-LBP) using the same dataset reported in literature¹⁶.

Similarly, the image-LSTM outperformed the random forest (RF) and convolutional neural network (CNN) classifiers studied in¹⁷ in testing the H&E heart-failure dataset using twofold and threefold cross-validation procedures for evaluating the performance of classifiers.

Once again, the image-LSTM provided excellent and better results (100%) than many pre-trained convolutional neural network (CNN) models recently reported¹⁹ for classifying the biopsy and tumor samples with and without pRT in terms of accuracy, and survival rates > and $\le $ 5 years.

To examine how quickly the image-LSTM accuracy was improving, and whether the network trainings could be overfitted with the training data, Fig. 8 shows the plots of the model training histories in terms of accuracy and loss using six different datasets for the tenfold CV. In general, if the loss keeps decreasing, the model could be overfitted; and when the loss becomes equal (decreasing to a point of stability), the model is considered either to be of a good fit or reaches a local minimum. By observing the plots for different datasets, the model quickly improved the accuracy and all the trainings did not result in overfitting.

Discussion

Regarding the classification of the H&E CRC data, there are much larger differences in the average accuracy obtained from the SVM-histogram (about 15% difference) and SVM-LPB (about 20% difference) classifiers for 2-class and 8-class classifications, where the latter case is lower than the former. The average accuracy provided by the image-LSTM classifier for the 8-class classification (99.96%) is almost the same as for the 2-class classification (100%). The better performance of the image-LSTM suggests that while the sequential features of the time series transformed from the original images helped increase the differentiation of the class properties, the formation of multi-dimensional sequential features allows the enhancement of the deep learning of long-term dependencies in more than one dimension. In fact, an important task of an LSTM is to decide what information to forget and keep from the cell state, and these two things are combined to make an update to the state. The time-frequency and time-space features enabled the updating process more effective, resulting in a better classifier.

The transformation of images into time series for extracting time-frequency and time-space features and class modeling using the bi-LSTM have shown the classification design effective as the results obtained from the image-LSTM outperformed the CNN, which is also a deep-learning model and used the direct input of the images, in both twofold and threefold cross-validations.

H&E stained slides are used by pathologists for disease diagnosis and patient treatment. Images of cancer tissue are routinely examined by pathologists for cancer type identification and prognosis. The power of the precise classification of H&E stained images provided by the image-LSTM offers a useful AI tool to assist pathologists in carrying out timely analysis of large volumes of images.

The prediction and prognosis power of the protein marker DNp73 in rectal cancer can be discovered with the use of the image-LSTM for differentiating both biopsy and tumor samples of the rectal-cancer patients between with and without pRT. Recent results¹⁹ showed the usefulness of several pre-trained CNN models for discovering the function of DNp73 in rectal cancer. In comparison with the pre-trained CNN models, the present classification results obtained from the image-LSTM not only confirm the recent findings but also provide a better AI tool for clinical decision making and accurate forecast of the future course of the cancer under pRT.

The high performance of the image-LSTM in classifying IHC slides can be very useful for timely biomarker discovery. Current practice in pathology relies on manual scoring of protein expression mainly based on colors to assess the capability of prediction and prognosis of the candidate protein as a biomarker. The classification accuracies based on the expressions of DNp73 with pRT and without pRT suggest the power of the present AI method with an implication that if the examination of a tissue taken from a rectal cancer patient is predicted to have a short survival ($\le $ 5 years), then the clinical decision would be to recommend the patient to be treated with pRT³⁰.

While traditional IHC analysis did not provide any predictive and prognostic information of DNp73 expression in the rectal cancer patients without or with pRT¹⁹, both predictive and prognostic power of DNp73 expression can be discovered by the current AI approach. Such a discovery is very useful for optimal treatment and clinical decision making in pRT. The AI approach reported herein is not only useful for studying the protein expression in rectal cancer patients but can also be applied for discovering biomarkers in other types of cancer. A certain advantage discovered from the findings regarding the computer implementation of the AI is that accurate classification results can be achieved without the need for the manual extraction of regions of interest from whole-slide images. Such a freedom from the manual analysis is particularly useful for guaranteeing objective and reproducible results as well as alleviating time-consuming work to help life-science researchers focus on more important aspects in their study³⁰.

Conclusion

An approach for classifying histopathological images by feeding the LSTM with sequential features extracted from time-frequency and time-space domains together with the class modeling has been presented and discussed in the foregoing sections. The key attribute is the capture of effective sequential features of texture-rich images in pathology that can leverage the power of the LSTM in learning sequential characteristics for pattern recognition. Classification results obtained from testing the new approach with both H&E and IHC image data outperformed other baseline methods. The image-LSTM appears to be a useful AI tool in the big data analysis of digital pathology for disease diagnosis, prognosis, and biomarker discovery, where the effective handling of big data can significantly contribute to modern healthcare systems³¹. The tool presented herein can be applied for classifying other types of microscope images of cells or tissues.

Data availability

The computer software written in MATLAB for the image-LSTM tool is publicly available at the author’s personal homepage: https://sites.google.com/view/tuan-d-pham/codes.

References

Gurcan, M. N. et al. Histopathological image analysis: a review. IEEE Rev. Biomed. Eng. 2, 147–171 (2019).
Article Google Scholar
Bartels, P. H., Thompson, D., Bibbo, M. & Weber, J. E. Bayesian belief networks in quantitative histopathology. Anal. Quant. Cytol. Histol. 14, 459–473 (1992).
PubMed CAS Google Scholar
Hamilton, P. W., Anderson, N., Bartels, P. H. & Thompson, D. Expert system support using Bayesian belief networks in the diagnosis of fine needle aspiration biopsy specimens of the breast. J. Clin. Pathol. 47, 329–36 (1994).
Article CAS Google Scholar
Hattel, A. et al. Development and evaluation of an automated histology classification system for veterinary pathology. J. Vet. Diagn. Invest. 25, 765–769 (2003).
Article Google Scholar
Fink, O. et al. Potential, challenges and future directions for deep learning in prognostics and health management applications. Eng. Appl. Artif. Intell.92, 103678 (2020).
Komura, D. & Ishikawa, S. Machine learning methods for histopathological image analysis. Comput. Struct. Biotechnol. J. 16, 34–42 (2018).
Article CAS Google Scholar
Dimitriou, N., Arandjelovic, O. & Caie, P. D. Deep learning for whole slide image analysis: an overview. Front. Med. 6, 264 (2019).
Article Google Scholar
Sharma, H., Zerbe, N., Klempert, I., Hellwich, O. & Hufnagl, P. Deep convolutional neural networks for automatic classification of gastric carcinoma using whole slide images in digital histopathology. Comput. Med. Imaging Graph. 61, 2–13 (2017).
Article Google Scholar
Wang, S. et al. Comprehensive analysis of lung cancer pathology images to discover tumor shape and boundary features that predict survival outcome. Sci. Rep. 8, 10393 (2018).
Article ADS CAS Google Scholar
Coudray, N. et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat. Med. 24, 1559–1567 (2018).
Article CAS Google Scholar
Gertych, A. et al. Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Sci. Rep. 9, 1483 (2019).
Article ADS CAS Google Scholar
Raczkowski, L., Mozejko, M., Zambonelli, J. & Szczurek, E. ARA: accurate, reliable and active histopathological image classification framework with Bayesian deep learning. Sci. Rep. 9, 14347 (2019).
Article ADS CAS Google Scholar
Sitaula, C. & Aryal, S. Fusion of whole and part features for the classification of histopathological image of breast tissue. Health Inf. Sci. Syst. 8, 38 (2020).
Article Google Scholar
Xu, H., Park, S. & Hwang, T. H. Computerized classification of prostate cancer Gleason scores from whole slide images. IEEE/ACM Trans. Comput. Biol. Bioinform. 17, 1871–1882 (2020).
Article Google Scholar
Pham, T. D. Time-frequency time-space LSTM for robust classification of physiological signals. Sci. Rep. 11, 6936 (2021).
Article ADS CAS Google Scholar
Kather, N. J. et al. Multi-class texture analysis in colorectal cancer histology. Sci. Rep. 6, 27988 (2016).
Article ADS CAS Google Scholar
Nirschl, J. J. et al. A deep-learning classifier identifies patients with clinical heart failure using whole-slide images of H&E tissue. PLoS One13, e0192726 (2019).
Trial, Swedish Rectal Cancer. Improved survival with preoperative radiotherapy in resectable rectal cancer. N. Engl. J. Med. 8, 980–987 (1997).
Google Scholar
Pham, T. D., Fan, C., Zhang, H. & Sun, X. F. Artificial intelligence-based 5-year-survival prediction and prognosis of DNp73 expression in rectal cancer patients. Clin. Transl. Med.10, e159 (2020).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS Google Scholar
Soutner, D, & Muller, L. Application of LSTM Neural Networks in Language Modelling. In: Habernal I., Matousek V. (eds) Text, Speech, and Dialogue. TSD 2013. LNCS, vol. 8082. Springer, Berlin, 2013. https://doi.org/10.1007/978-3-642-40585-3_14
Lin, H. W. & Tegmark, M. Critical behavior in physics and probabilistic formal languages. Entropy 19, 299 (2017).
Article ADS Google Scholar
Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D: Nonlinear Phenomena404, 132306 (2020).
Pham, T. D. Fuzzy recurrence plots. EPL 116, 50008 (2016).
Article ADS CAS Google Scholar
Bezdek, J. C. Pattern recognition with fuzzy objective function algorithms (Plenum Press, New York, 1981).
Book Google Scholar
de Luca, A. & Termini, S. A definition of a nonprobabilistic entropy in the setting of fuzzy sets theory. Inf. Control 20, 301–312 (1972).
Pham, T. D. Fuzzy recurrence entropy. EPL 130, 40004 (2020).
Article ADS CAS Google Scholar
Oliveri, P. Class-modelling in food analytical chemistry: development, sampling, optimisation and validation issues-A tutorial. Anal. Chim. Acta 982, 9–19 (2017).
Article CAS Google Scholar
Singh, R. et al. Imbalanced breast cancer classification using transfer learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 18, 83–93 (2021).
Article CAS Google Scholar
Pham, T.D., Fan, C., Zhang, H., & Sun, X. F. AI-based five-year-survival prediction and prognosis of DNp73 expression in rectal cancer patients, Research Square, preprint at https://doi.org/10.21203/rs.3.rs-44401/v1(2020).
Raghupathi, W. & Raghupathi, V. Big data analytics in healthcare: promise and potential. Health Inf. Sci. Syst. 2, 3 (2014).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, 31952, Saudi Arabia
Tuan D. Pham

Authors

Tuan D. Pham
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.D.P. conceptualized, designed the study, implemented the methods, and carried out the computer experiments. T.D.P. wrote the manuscript.

Corresponding author

Correspondence to Tuan D. Pham.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pham, T.D. Time-frequency time-space long short-term memory networks for image classification of histopathological tissue. Sci Rep 11, 13703 (2021). https://doi.org/10.1038/s41598-021-93160-5

Download citation

Received: 29 April 2021
Accepted: 22 June 2021
Published: 01 July 2021
DOI: https://doi.org/10.1038/s41598-021-93160-5

This article is cited by

Pre-training in Medical Data: A Survey
- Yixuan Qiu
- Feng Lin
- Miao Xu
Machine Intelligence Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.