Traffic flow prediction using bi-directional gated recurrent unit method

Wang, Shengyou; Shao, Chunfu; Zhang, Jie; Zheng, Yan; Meng, Meng

doi:10.1007/s44212-022-00015-z

Traffic flow prediction using bi-directional gated recurrent unit method

Original Article
Open access
Published: 01 December 2022

Volume 1, article number 16, (2022)
Cite this article

Download PDF

You have full access to this open access article

Urban Informatics Aims and scope Submit manuscript

Traffic flow prediction using bi-directional gated recurrent unit method

Download PDF

Shengyou Wang¹,
Chunfu Shao²,
Jie Zhang³,
Yan Zheng⁴ &
…
Meng Meng ORCID: orcid.org/0000-0001-7240-6454⁵

3226 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

Traffic flow prediction plays an important role in intelligent transportation systems. To accurately capture the complex non-linear temporal characteristics of traffic flow, this paper adopts a Bi-directional Gated Recurrent Unit (Bi-GRU) model in traffic flow prediction. Compared to Gated Recurrent Unit (GRU), which can memorize information from the previous sequence, this model can memorize the traffic flow information in both previous and subsequent sequence. To demonstrate the model’s performance, a set of real case data at 1-hour intervals from 5 working days was used, wherein the dataset was separated into training and validation. To improve data quality, an augmented dickey-fuller unit root test and differential processing were performed before model training. Four benchmark models were used, including the Autoregressive Integrated Moving Average (ARIMA), Long Short-Term Memory (LSTM), Bidirectional Long Short-Term Memory (Bi-LSTM), and GRU. The prediction results show the superior performance of Bi-GRU. The Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and Mean Absolute Error (MAE) of the Bi-GRU model are 30.38, 9.88%, and 23.35, respectively. The prediction accuracy of LSTM, Bi-LSTM, GRU, and Bi-GRU, which belong to deep learning methods, is significantly higher than that of the traditional ARIMA model. The MAPE difference of Bi-GRU and GRU is 0.48% which is a small prediction error value. The results show that the prediction accuracy of the peak period is higher than that of the low peak. The Bi-GRU model has a certain lag on traffic flow prediction.

Deep Bi-directional Long Short-Term Memory Model for Short-Term Traffic Flow Prediction

Traffic Flow Prediction Model and Performance Analysis Based on Recurrent Neural Network

Traffic Flow Prediction Using Deep Learning Techniques

1 Introduction

Accurate short-term traffic flow prediction is crucial for transportation management. The main task of short-term traffic flow prediction is to forecast the next step of traffic flow based on historical traffic data (Nagy & Simon, 2018; Wu et al, 2018). The results from traffic flow prediction are served as a reference for both travel demand analysis and operation strategy development. Moreover, traffic flow prediction model is also one of the key components for smart transportation systems. Having the accurate predicted results is the fundamental basis to decide how to guide the optimal travel routes for traveler so that to reduce the traffic congestion and improve the traffic efficiency and safety.

Some classical traffic flow prediction models including Historical Average (HA) model, Autoregressive Integrated Moving Average (ARIMA) model (Van Der Voort et al, 1996), and Linear regression (LR) (Haoyi & Jing, 2011) have been well applied in practice. These models are simple in structure but have the disadvantage that the fluctuations in traffic flow can significantly affect the prediction performance. Some scholars have applied Support Vector Regression (SVR) model (Cheng et al, 2017; Wei & Liu, 2013) and Back Propagation Neural Network (BPNN) model (Kumar & Katiyar, 2013) in traffic flow prediction. These models have good applicability to complex situation, but also have certain drawbacks like complex model structure, large computational effort, and difficulty in determining model parameters.

In recent years, the rapid development of data collection and computing technologies has greatly improved the performance of short-term traffic flow prediction (Chen, Liu, et al., 2021; Wang et al, 2022). Deep learning method, which is one of the current leading techniques for short-term traffic flow prediction, has been widely proposed and applied in ITS. Deep learning method can be divided into tress branches. One branch is designed for mining spatial characteristics, such as Convolution Neural Network (CNN) (Cao & Wang, 2019), Graph Convolutional Networks (GCN) (Wang et al, 2022). One is proposed for extracting the temporal characteristics, such as Recurrent Neural Network (RNN) (Duives et al, 2019), Long Short Term Memory (LSTM) neural network, Bidirectional Long Term Memory (Bi-LSTM) neural network (Ma et al, 2022), Gate Recurrent Unit (GRU) neural network (Shu et al, 2022); Others have the generative adversarial network (GAN) (Zhu et al, 2020), AutoEncoder (Wei et al, 2019). Since short-term traffic flow prediction is estimated by learning the temporal characteristics of historical observation, which belongs to the time series prediction problem, we mainly focus on time series-related deep learning methods.

Among the methods above, RNN is proposed firstly for the extraction non-linear features of time sequences, and has the advantage of strong memory, and sharing parameters, which have been proven to greatly improve the accuracy of short-term traffic flow prediction compared with the traditional parameters models, such as HA, ARIMA and the Kalman Filter (KF) model (Gu et al, 2019; Tedjopurnomo et al, 2022). However, RNN has the disadvantage of gradient disappearance and gradient explosion, which cannot learn the long-term dependencies of the time-series data well. Therefore, to address the problem of RNN, the variant of RNN, LSTM is proposed by adding three gates (e.g. input gate, out gate, forget gate) in the hidden layer of RNN to control the retention and forgetting of information, and further memory the long-term (e.g. 12 hour, 1 day) and short-term (e.g. 1 hour, 2 hours) information of time series (Bogaerts et al, 2020; Kim & Lee, 2022; Yang, Chen, et al., 2019). Therefore, LSTM can memory more comprehensive time characteristics than RNN. LSTM has been applied to short-term traffic flow prediction, and has shown more effective prediction performance than RNN (Yang, Sun, et al., 2019). Subsequently, the Bi-LSTM is proposed combined two LSTM layers in opposite directions for mining the sequential and inverse-order time series information, which has been applied in traffic flow prediction (Ma et al, 2022). However, since the LSTM and Bi-LSTM have three gates in the hidden layer, which requires a large number of parameters and time for training and fitting, scholars streamlined the complex structure of the LSTM model and proposed GRU with two gates of hidden layer to improve the model efficiency (Shahid et al, 2020). Researchers applied the GRU on traffic flow prediction and showed a higher prediction efficient than LSTM, and a higher accuracy rate than RNN (Sun & Tao, 2020). Similar to the Bi-LSTM, Bidirectional Gated Recurrent Unit (Bi-GRU) neural network (Huang et al, 2021), which consists of two GRU layers with opposite directions, was subsequently proposed and proved to be effective in the natural language domain (Li et al, 2020). However, few literatures have applied the Bi-GRU in short-term traffic flow prediction to demonstrate its prediction performance.

As reviewed, Bi-GRU, LSTM, Bi-LSTM, and GRU belong to recurrent neural networks which play a key role in the field of time series prediction. When designing a combined model of traffic flow prediction, Bi-GRU, LSTM, Bi-LSTM, and GRU could be selected as the parts for extracting temporal characteristics. However, do these models have some similar prediction results? What are the differences in the prediction performance of these models for the traffic flow prediction problem? Is there a model among these models recommended for urban managers in terms of traffic flow prediction? To answer these questions, we calibrate and validate the Bi-GRU model using real traffic flow data, and test the prediction performance in two scenarios. In summary, the contribution of this paper is twofold: on the one hand, we apply the Bi-GRU model in short-term traffic flow prediction and discuss its prediction performance compared with LSTM, Bi-LSTM, and GRU models. On the other hand, we further explored the performance of Bi-GRU for short-term traffic flow prediction model in peak and low-peak periods on each road sections, which can demonstrate the results from both spatial and temporal aspects. The research outcome will provide references for researchers and managers in selecting traffic flow prediction models associated with recurrent neural networks.

The rest of the paper is organized as follows: Section 2 summarizes the existing literature. Section 3 explains the proposed model in detail. And in section 4, we describe the experimental data and model evaluation methods. In Section 5, the predictive performance of the proposed model is evaluated and the model output is analyzed. Finally, Section 6 presents the main conclusions and future directions of this study.

2 Literature review

2.1 Short-term traffic flow prediction

As a hot research topic in the transportation system, short-term traffic flow prediction methods have achieved rich research results in the past 50 years (Nagy & Simon, 2018). The methods of short-term traffic flow prediction can be broadly classified into traditional parametric, and non-parametric models (Kaffash et al, 2021).

The traditional parametric models have been widely applied to address short-term traffic flow prediction problems (Wu et al, 2014). In general, parametric models assume that the time-varying traffic flows obey one or several distributions and predict traffic flow by parameter fitting. Among parametric models, the ARIMA model (Williams, 2001), LR and KF model (Xie et al, 2007) have satisfactory applicability to traffic flow prediction problems. For example, Yao et al (2016) combined the K-nearest neighbors (KNN) method and the KF technique to dynamically predict real-time traffic flow. In the numerical test, the proposed model performed better than a single KNN model. Li (2020) applied the multiple linear regression model for short-term traffic flow prediction in urban. The experimental results showed that, compared with decision tree methods (Kamiński et al, 2018), The proposed model has a higher prediction accuracy of 98.48% and a shorter prediction time, always less than 0.7 seconds. In addition, some scholars have combined multiple parametric models to improve traffic flow prediction accuracy. For example, Xu et al (2017) combined KF with ARIMA to achieve traffic flow state prediction of road sections.

Although parametric models have better prediction accuracy compared with statistical models, they still cannot fully adapt to the strong randomness of traffic flow. With the rapid development of computer technology, non-parametric models have gradually occupied the dominant position in the field of short-term traffic flow prediction. Models such as K-Nearest Neighbor (KNN) model (Luo et al, 2019; Zhang et al, 2013), SVR model (Zhang et al, 2018), BPNN model, Fuzzy Neural Networks (FNN) models (Moretti et al, 2015) have been proven to give promising results in traffic flow prediction problems. For example, Sun et al (2018) proposed a fully automatic dynamic procedure KNN to predict traffic flow. The results show that the proposed model performed better than the normal KNN and seasonal ARIMA (Shu, 2005) in terms of accuracy on average. Li and Xu (2021) applied SVR for the short-term traffic flow prediction. The results were obtained from experiments that the prediction error rate was the lowest (3.22%) compared with RF and Adboost (Kanduri et al, 2018). Zhang and Qu (2021) proposed a GA-BPNN model combining an adaptive genetic algorithm (Li et al, 2004) and BPNN to predict short-term traffic flow. The results show that the average prediction error of the proposed algorithm is about 1%, and the computational accuracy is better compared with that of a single BPNN.

2.2 Recurrent neural network variants

As mentioned, short-term traffic flow prediction belongs to a time series prediction problem and the deep learning method of RNN is designed to deal with this problem and has a wide range of applications (Zhang et al, 2014). For example, Chen et al (2020) proposed an attention-based RNN model for multi-step traffic flow prediction. Experimental results demonstrated that the proposed model had good performance compared to the KNN, and sequence to sequence (seq2seq) (Zhang et al, 2019). However, RNN has several disadvantages such as gradient disappearance, gradient explosion. To deal with these problems of RNN, scholars proposed many variants based on RNN, where LSTM, Bi-LSTM, GRU, and Bi-GRU have a wide application (Tedjopurnomo et al, 2022; Zhang et al, 2021).

LSTM modifies the hidden layer of RNN to gain the advantage of long-term memory, and had been introduced to short-term traffic prediction. Yang, Chen, et al. (2019) introduced the LSTM model for short-term traffic flow prediction, and the results showed that the proposed model had certain competitiveness in short-term traffic flow predictions. Xiao and Yin (2019) proposed a hybrid LSTM neural network to predict traffic flow. The results found that the prediction error of the proposed model was less than KF and SVR. For short-term traffic flow prediction, Zheng et al (2021) proposed a deep learning based model combined the convolutional neural network (CNN) and the LSTM to extract the spatial and short-term temporal features. Extensive experimental results showed that the proposed model achieved better prediction performance compared with SVR.

Similar to LSTM, Bi-LSTM extracts the temporal characteristics by two LSTM layers in opposite directions, which have been applied in short-term traffic flow prediction. Abduljabbar et al (2021) introduced the Bi-LSTM model for short-term traffic flow prediction. The results showed that the Bi-LSTM performed better than LSTM. Li et al (2021) introduced Bi-LSTM for traffic flow prediction, and applied GRU and LR in the experiment as a comparison. The experimental findings demonstrated that the Bi-LSTM model worked best in predicting traffic flow, achieving an accuracy of 92% when temporal differences were taken into account. Xing and Liu (2022) constructed a data fusion powered Bi-LSTM model for traffic flow prediction. The results showed that the proposed model produced more accurate predictions compared with LSTM, Bi-LSTM, GRU. However, he did not evaluate the prediction performance of Bi-GRU.

GRU is another variant of RNN with fewer parameters than LSTM, which has been introduced for short-term traffic flow prediction. For example, Zhang and Kabuka (2018) applied GRU model for short-term traffic flow prediction. The results showed that the GRU performed better than ARIMA, SVR, and RF. However, he did not evaluate the performance of GRU in comparison with LSTM, and Bi-LSTM. Wang et al (2020) applied GRU and LSTM at the same time for truck traffic flow prediction. The results showed that LSTM and GRU have superior performance compared to SVR and ARIMA. In addition, The overall accuracy of LSTM was 4.10% higher than that of GRU. Dai et al (2019) applied GRU for short-term traffic flow prediction. The results showed that the proposed method outperformed CNN in terms of accuracy and stability. However, he did not test the performance of LSTM.

Compared to LSTM, Bi-LSTM, and GRU, Bi-GRU is the latest to be proposed which combined with two GRU layers in opposite directions. Bi-GRU has been introduced in many prediction problems (e.g. wind power prediction, COVID-19 cases prediction, oil rate prediction). For example, Chen, Qi, et al. (2021) applied Bi-GRU to predict wind power, and the results proved its superior prediction performance compared with LSTM and GRU. Ahuja et al (2022) used CNN and stacked Bi-GRU to predict the COVID-19 cases. The experimental result showed that the proposed model was highly reliable over the gaussian process regression model (Schulz et al., 2018). Li et al (2022) proposed a framework using Bi-GRU and sparrow search algorithm (Zhang et al, 2022) to improve the accuracy of oil rate prediction. The observations showed that the proposed method performed better than RNN, LSTM, and GRU in terms of accuracy and robustness. Other than that, Shu et al (2022) have introduced the Bi-GRU for short-term traffic flow prediction and showed that the Bi-GRU performed better than LSTM. However, he did not discuss the model performance in comparison with GRU, and Bi-LSTM. Further, few scholars pay attention to the prediction performance of Bi-GRU during peak and low-peak periods of traffic flow.

In summary, traffic flow has complex temporal relationships, and scholars mainly use the data of time series information for prediction. Bi-GRU model has been proven the good performance in short-term traffic flow prediction, but few scholars evaluate the Bi-GRU prediction performance in comparison with LSTM, Bi-LSTM, GRU under the same dataset. Therefore, this paper introduce the Bi-GRU model to capture the temporal characteristics for traffic flow prediction, and discuss its prediction performance compared with LSTM, Bi-LSTM, and GRU models. Furthermore, we explored the performance of Bi-GRU for short-term traffic flow prediction model in peak and low-peak periods on each road sections, which can demonstrate the results from both spatial and temporal aspects. The discussion in this paper will provide some references for researchers and managers in selecting traffic flow prediction models associated with recurrent neural networks. In addition, accurate traffic flow forecasts will provide useful information for urban managers to take control measures and for residents to plan their travel routes.

3 Methodology

3.1 Bi-directional gated recurrent unit (bi-GRU) model

Bi-GRU model is a variant of RNN, which have capacities to memory long-term dependencies (e.g. 1 day traffic flow information at 1 hour interval) of time series data (Wang, Shao, et al., 2021). Short-term traffic flow prediction belongs to time series prediction problem, which indicates that Bi-GRU can be applied to short-term traffic flow prediction. Bi-GRU is composed of forward GRU and backward GRU. Compared with LSTM, GRU has a less complex structure and higher computational efficiency (Greff et al, 2017). The structures of the GRU is described by Cho et al (2014), which includes the input layer, the hidden layer, and the output layer. The hidden layer is composed of the reset gate and the update gate, which is used to control the information of traffic flow from the input layer at time t and the hidden layer at time t ‐ 1 (Agarap, 2018). We define the traffic flow input data of a road section as x_t. t = (1, 2, …, n), is the number of observed traffic flow records during the period indexed in time order, where the period implies the time length of the traffic flow data recorded. The output of GRU is defined as h_t, the output of reset is defined as r_t, and the output of update gate is defined as z_t. Moreover, the reset and update gates calculate the output h_t of the current moment by the joint control of the output h_t − 1 of the previous moment and the input x_t of the current moment. The equations of reset gate and update gate are shown in Eq. (1) and Eq. (2).

$${r}_t=\sigma \left({W}_r\cdot \left[{h}_{t-1},{x}_t\right]\right)$$

(1)

$${z}_t=\sigma \left({W}_z\cdot \left[{h}_{t-1},{x}_t\right]\right)$$

(2)

where, W_r and W_z are the weights of the reset gate and the update gate respectively, and σ is the Sigmoid function, where, σ(x) = 1/(1 + e^−x). The calculation equation of output h_t is shown in Eq. (3).

$${h}_t=\left(1-{z}_t\right)\times {h}_{t-1}+{z}_t\times {\tilde{h}}_t$$

(3)

where, ${\tilde{h}}_t$ is the candidate state of GRU at time t. The calculation of ${\tilde{h}}_t$ is shown in Eq. (4).

$${\tilde{h}}_t=\tanh \left({W}_h\cdot \left[{r}_t\times {h}_{t-1},{x}_t\right]\right)$$

(4)

where, W_h is the weight of the candidate state.

As mentioned, the Bi-GRU is constructed by two unidirectional GRUs facing opposing directions (Xiong et al, 2016). The forward GRU starts from the beginning of the time series data, and the backward GRU starts form the end of the time series data. The Bi-GRU is calculated by two GRU can be formulated as Eq. (5)–(7).

$${\overrightarrow{h}}_t={GRU}_{fwd}\left({x}_t,{\overrightarrow{h}}_{t-1}\right)$$

(5)

$${\overleftarrow{h}}_t={GRU}_{bwd}\left({x}_t,{\overleftarrow{h}}_{t-1}\right)$$

(6)

$${h}_t={\overrightarrow{h}}_t\oplus {\overleftarrow{h}}_t$$

(7)

where, ${\overrightarrow{h}}_t$ and ${\overleftarrow{h}}_t$ are the state information of the forward and backward GRU, respectively. GRU_fwd is the forward GRU, and GRU_bwd is the backward GRU, the GRU function is composed of Eq. (1) - Eq. (4). ⊕ denotes concatenating the ${\overrightarrow{h}}_t$ and ${\overleftarrow{h}}_t$. Therefore, the Bi-GRU with bi-directional GRU structures can memory the traffic flow information from historical and subsequent time series data.

3.2 Comparison models

In this paper, four benchmark methods including ARIMA, LSTM, GRU, and Bi-LSTM are selected for comparison, which have the ability to mine the temporal characteristics of time series data, and have been applied to short-term traffic flow prediction in existing literature (Shuai et al, 2022; Zhao et al, 2021). The process of the model comparison part among this paper can be seen in Fig. 1.

The details of the benchmark methods are described as follows:

(1).
Autoregressive Integrated Moving Average (ARIMA) is the representative model of the traditional parameters model, which can mine temporal features by statistical approaches for traffic flow prediction (Meng et al, 2015; Xu et al, 2017). Compared with Bi-GRU, we can explore whether the prediction performance of Bi-GRU is better than that of traditional models.
(2).
Long Short-Term Memory (LSTM) is the classical variant model based on RNN (Yang, Sun, et al., 2019), which is widely used and outperform in short-term traffic flow prediction. Compared with Bi-GRU, it only considers the influence of past information on the prediction time. Meanwhile, LSTM is more complex than GRU in structure.
(3).
Gated Recurrent Unit (GRU) is the classical variant model based on RNN (Zhang & Kabuka, 2018). Compared with LSTM, it has fewer parameters and compared with Bi-GRU, it only consider the influence of past information on the prediction time.
(4).
Bidirectional Long Short-Term Memory (Bi-LSTM) contains three gating units, which are input gate, output gate and forget gate (Ali et al, 2021). The input gate controls the input transmission of traffic flow, the forget gate controls whether the information of the memory module is transmitted, and the output gate is used to determine the output of the information. Bi-LSTM has been applied to short-term traffic flow prediction, and it has been proven to achieve a high prediction accuracy. Compared with Bi-LSTM, Bi-GRU has a simpler structure and has been proved to have a considerable prediction performance in natural language applications.

It should be noted that models (2)–(4) have the same basic parameters settings: the number of hidden neurons is set to 64, the batch size is set to 10, the number of iterations is set to 200, the learning rate is set to 0.01, and the time step is set to 9. In addition, these models use the Adam optimizer to minimize the loss function during training, and the early stopping is set to to prevent overfitting. Moreover, the deep learning packages of Tensorflow and Keras (Pang et al, 2020) are used. A general-purpose programming language is provided by Python 3.6. In addition, for ARIMA, the auto_arima is applied to automatically determine the most suitable parameters. The ‘auto_arima’ is the python package of ARIMA model, which can automatic find the optimal parameters for each road section by multiple calculating the error between predicted and observed traffic flow.

3.3 Evaluation metrics

An excellent prediction model needs to have the ability to accurately capture the temporal characteristics from the historical traffic flow information. In this research, we apply three evaluation metrics, including the Mean Absolute Error (MAE), the Mean Absolute Percentage Error (MAPE) and the Root Mean Square Error (RMSE) to evaluate the prediction performance of the proposed short-term traffic flow prediction model (Kumar & Katiyar, 2013; Xue & Xue, 2018; Meng, Chang, et al., 2022). The calculation of MAE, MAPE and RMSE are shown in Eq. (8)-Eq. (10).

$$RMSE=\sqrt{\frac{1}{n}\sum \limits_{i=1}^n{\left({\hat{y}}_i-{y}_i\right)}^2}$$

(8)

$$MAPE=\frac{100\%}{n}\sum \limits_{i=1}^n\left|\frac{{\hat{y}}_i-{y}_i}{y_i}\right|$$

(9)

$$MAE=\frac{1}{n}\sum \limits_{i=1}^n\left|{\hat{y}}_i-{y}_i\right|$$

(10)

where, y_i represents the observation, ${\hat{y}}_i$ represents the predicted traffic flow, and n represents the number of traffic flow samples. Evaluation metrics are used by quantify the error between the prediction and observation. Therefore, The smaller the value of evaluation metrics the better prediction performance of the model.

4 Data

4.1 Statistical analysis

To verify the superiority of the short-term traffic flow prediction model, we use the traffic flow data for four sections of an urban expressway in Zhengzhou, China, for 5 working days from 2nd to 6th, December 2019. The time interval is 1 hour, and each section has 120 pieces of data (5 day*24 h). Particularly, the model uses the data of the first 3 days for model training, and the data of the last 2 days for model evaluation. The four road sections are numbered S1, S2, S3 and S4 respectively. The traffic flow distribution of the four road sections in 5 days is shown in Fig. 2, and the descriptive statistics of the four road sections are shown in Table 1. It can be seen from Fig. 2 that the four road sections have time periodicity, and Table 1 shows that S2 section has the highest mean and volatility of traffic flow, and S1 section has the lowest mean and volatility of traffic flow.

Table 1 Basic information statistics of traffic flow

Full size table

4.2 Augmented dickey-fuller (ADF) unit root-based stability analysis

As can be seen in Fig. 2 and Table 1, the traffic flow data are unstable, with relatively large value differences between peak and low peak periods. Therefore, before model training, we performed ADF unit root-based stability analysis and stability processing on the traffic flow data (Aylar et al, 2019). The ADF values of the four road sections are shown in Table 2.

Table 2 ADF stability parameters

Full size table

As shown in Table 2, after the logarithmic and differential processing of four road sections, the p-values of ADF are less than 0.05, which indicates that the four road sections meet the stability requirements (Zhang, 2016).

5 Results and discussion

5.1 Model performance for overall

The Bi-GRU model for short-term traffic flow prediction is trained and validated, the overall evaluation results of RMSE, MAE, and MAPE are the average of four road sections, shown in Fig. 3. Meanwhile, the prediction performance of ARIMA, LSTM, Bi-LSTM, and GRU are also shown in Fig. 3 as the comparison.

It can be seen from Fig. 3 that the RMSE, MAPE and MAE values of Bi-GRU model for the overall traffic flow prediction of the four road sections are 30.38, 9.88% and 23.35, respectively, which are 0.87, 0.47% and 0.99 lower than the RMSE, MAPE and MAE of Bi-LSTM. It indicates that the overall prediction performance of the Bi-GRU model is slightly better than the Bi-LSTM model. Moreover, the prediction performance of LSTM and GRU are comparable, but both are worse than Bi-GRU and Bi-LSTM. ARIMA has the worst prediction results, and its MAE, RMSE and MAPE are significantly higher than LSTM, Bi-LSTM, GRU, Bi-GRU. This indicates that the four variants of RNN (e.g. LSTM, Bi-LSTM, GRU, Bi-GRU) which belongs to deep learning methods are is better at mining temporal characteristics than the traditional ARIMA model.

In addition, as can be seen from Fig. 3, the traffic flow prediction errors of both Bi-LSTM and Bi-GRU are smaller than those of LSTM and GRU. This indicates that the bi-directional structure of mining historical and subsequent time series data is useful for traffic flow prediction. In addition, we find that the MAPE of GRU is 12.36%, which is 2.53% lower than that of LSTM. This indicates that the GRU has a higher prediction accuracy and efficiency than LSTM for short-term traffic flow prediction. The MAPE of Bi-GRU (9.88%) is 0.48% lower than that of GRU (10.36%), which is a small difference in terms of traffic flow prediction error.

Therefore, in different application scenarios, different models can be applied for traffic flow prediction. For example, Bi-GRU is recommended for prediction in scenarios with high prediction accuracy and limited road sections. GRU is recommended for predicting traffic flow in a large road networks that require a large number of computational scenarios, combining prediction accuracy and computational efficiency.

5.2 Model performance for road sections

The predicted evaluation results for the four road sections are shown in Table 3. Among the prediction results of the Bi-GRU model for the four road sections, the RMSE evaluation values are 32.52, 37.56, 27.92, and 23.50, respectively, which are both lower than the benchmark methods. Similarly, the MAE evaluation function values were 25.19, 27.46, 23.19, 17.55, respectively, lower than the rest of the benchmark methods. We can conclude that under the RMSE and MAE indicators, Bi-GRU model for the four road sections shows a good predictive performance.

Table 3 Prediction results for 4 road sections

Full size table

In addition, in the prediction results of the Bi-GRU model for the four road sections, except that the MAPE value of the S2 section is slightly higher than that of the Bi-LSTM, the MAPE values of the S1, S3 and S4 sections are lower than those of the other baseline models, indicating that the Bi-GRU model shows good prediction performance for most road sections. For the S2 section, the MAPE of the Bi-GRU model is 11.01% slightly higher than 10.22% of the Bi-LSTM, which may be related to the observation of the road section. The mean value and standard deviation value of S2 are 255.84 and 82.89 respectively (see Table 1), slightly higher than S1, S3 and S4. Therefore, the prediction performance of Bi-LSTM may be better than that of Bi-GRU in road sections with large flow and fluctuation.

5.3 Model performance for peak and low peak periods

We further perform statistics on the model performance during peak (17:00–19:00) and low peak (6:00–8:00) periods (see Table 4) and compared the prediction of Bi-GRU and observation on four road sections. As can be seen from Table 4, the Bi-GRU model has lower RMSE, MAPE, and MAE in the peak period than in the low peak period, which indicates that Bi-GRU performs better in predicting traffic flow in the peak period. From Fig. 4, it can be found that in both four road sections, there has a lag in the prediction compared with the traffic flow observation. The phenomenon of lag is exhibited in each road section, which indicates the universal lag characteristics of the Bi-GRU for traffic flow prediction. This observation results is line with the findings in other articles (Yin et al, 2022).

Table 4 Bi-GRU performance during peak and low peak periods

Full size table

6 Conclusion

In this paper, we applied the emerging deep learning technologies and collected traffic data occurred in urban to predict the traffic flow. Accurate traffic flow prediction can provide the useful information for urban operators to develop management measures or for residents to adjust their travel plans or routes. Furthermore, the study will enable cities to operate more efficiently and ultimately achieve the goal of an intelligent and sustainable city of the future. This research will contribute to the effective management of complex urban information. Specifically, in this paper, the Bi-GRU is applied to predict the traffic flow of urban expressways, and traffic flow data of four road sections are applied to the model for training and evaluation. Before model training, the ADF unit root test and differential processing are carried out for the four road sections for data stability. Moreover, ARIMA, LSTM, Bi-LSTM, and GRU, which has the ability to mine the temporal characteristics of traffic flow, are introduced for comparison to further evaluate the performance of the Bi-GRU model.

Through model training and validation, the overall prediction results of RMSE, MAPE and MAE of the Bi-GRU model are 30.38, 9.88% and 23.35, respectively. The prediction error of Bi-GRU is lower than that of other models, which indicates that the Bi-GRU model has the highest prediction performance. The traffic flow prediction errors of both Bi-LSTM and Bi-GRU are smaller than those of LSTM and GRU, which indicates that the bi-directional structure of mining historical and subsequent time series data is useful for traffic flow prediction. The prediction accuracy of deep learning methods (e.g. LSTM, Bi-LSTM, GRU, and Bi-GRU) is significantly higher than that of the traditional ARIMA model. The MAPE difference of Bi-GRU and GRU is 0.48% which is a small prediction error values. Therefore, the Bi-GRU is recommended for traffic flow prediction in scenarios with high prediction accuracy and limited road sections. GRU is recommended for predicting traffic flow in large road network scenarios, combining prediction accuracy and computational efficiency. In addition, this paper compares the prediction and observation of each road section. It concludes that the Bi-GRU model shows better prediction results during peak than low peak, and the proposed model has a certain lag.

There are some limitations of this study that need to be acknowledged. First, due to data limitations, we did not compare the prediction performance of Bi-GRU with the existing combination model (e.g., CNN-BiGRU (Meng, Toan, et al., 2022), DTW-BiGRU (Zou et al, 2020). Second, we should include more latest technologies in the comparison. In the future, we will apply the emerging technologies and construct the combined model to improve the model accuracy under different scenarios. Besides, we will show the prediction results for multiple methods.

Availability of data and materials

The datasets generated during and/or analysed during the current study are not publicly available due to privacy restrictions from the third party but are available from the corresponding author on reasonable request.

References

Abduljabbar, R. L., Dia, H., & Tsai, P.-W. (2021). Unidirectional and bidirectional LSTM models for short-term traffic prediction. Journal of Advanced Transportation, 2021, 1–16. https://doi.org/10.1155/2021/5589075.
Article Google Scholar
Agarap, A. F. M. (2018). A neural network architecture combining gated recurrent unit (GRU) and support vector machine (SVM) for intrusion detection in network traffic data. In Proceedings of the 2018 10th international conference on machine learning and computing, (pp. 26–30). https://doi.org/10.1145/3195106.3195117.
Chapter Google Scholar
Ahuja, S., Shelke, N. A., & Singh, P. K. (2022). A deep learning framework using CNN and stacked bi-GRU for COVID-19 predictions in India. Signal, Image and Video Processing, 16(3), 579–586. https://doi.org/10.1007/s11760-021-01988-1.
Article Google Scholar
Ali, F., El-Sappagh, S., Islam, S. M. R., Ali, A., Attique, M., Imran, M., & Kwak, K.-S. (2021). An intelligent healthcare monitoring framework using wearable sensors and social networking data. Future Generation Computer Systems, 114, 23–43. https://doi.org/10.1016/j.future.2020.07.047.
Article Google Scholar
Aylar, E., Smeekes, S., & Westerlund, J. (2019). Lag truncation and the local asymptotic distribution of the ADF test for a unit root. Statistical Papers, 60(6), 2109–2118. https://doi.org/10.1007/s00362-017-0911-y.
Article Google Scholar
Bogaerts, T., Masegosa, A. D., Angarita-Zapata, J. S., Onieva, E., & Hellinckx, P. (2020). A graph CNN-LSTM neural network for short and long-term traffic forecasting based on trajectory data. Transportation Research Part C: Emerging Technologies, 112, 62–77. https://doi.org/10.1016/j.trc.2020.01.010.
Article Google Scholar
Cao, J., & Wang, J. (2019). Stock price forecasting model based on modified convolution neural network and financial time series analysis. International Journal of Communication Systems, 32(12), e3987. https://doi.org/10.1002/dac.3987.
Article Google Scholar
Chen, C., Liu, Z., Wan, S., Luan, J., & Pei, Q. (2021). Traffic flow prediction based on deep learning in internet of vehicles. IEEE Transactions on Intelligent Transportation Systems, 22(6), 3776–3789. https://doi.org/10.1109/TITS.2020.3025856.
Article Google Scholar
Chen, Q., Wang, W., Huang, X., & Liang, H. (2020). Attention-based recurrent neural network for traffic flow prediction. Journal of Internet Technology, 21(3), 831–839. https://doi.org/10.3966/160792642020052103020.
Article Google Scholar
Chen, W., Qi, W., Li, Y., Zhang, J., Zhu, F., Xie, D., … Tang, F. (2021). Ultra-short-term wind power prediction based on bidirectional gated recurrent unit and transfer learning. Frontiers in Energy Research, 9, 808116. https://doi.org/10.3389/fenrg.2021.808116.
Article Google Scholar
Cheng, A., Jiang, X., Li, Y., Zhang, C., & Zhu, H. (2017). Multiple sources and multiple measures based traffic flow prediction using the chaos theory and support vector regression method. Physica A: Statistical Mechanics and its Applications, 466, 422–434. https://doi.org/10.1016/j.physa.2016.09.041.
Article Google Scholar
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder–decoder for statistical machine translation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), (pp. 1724–1734). https://doi.org/10.3115/v1/D14-1179.
Chapter Google Scholar
Dai, G., Ma, C., & Xu, X. (2019). Short-term traffic flow prediction method for urban road sections based on space–time analysis and GRU. IEEE Access, 7, 143025–143035. https://doi.org/10.1109/ACCESS.2019.2941280.
Article Google Scholar
Duives, D., Wang, G., & Kim, J. (2019). Forecasting pedestrian movements using recurrent neural networks: An application of crowd monitoring data. Sensors, 19(2), 382. https://doi.org/10.3390/s19020382.
Article Google Scholar
Greff, K., Srivastava, R. K., Koutnik, J., Steunebrink, B. R., & Schmidhuber, J. (2017). LSTM: A search space odyssey. IEEE Transactions on Neural Networks and Learning Systems, 28(10), 2222–2232. https://doi.org/10.1109/TNNLS.2016.2582924.
Article Google Scholar
Gu, Y., Lu, W., Qin, L., Li, M., & Shao, Z. (2019). Short-term prediction of lane-level traffic speeds: A fusion deep learning model. Transportation Research Part C: Emerging Technologies, 106, 1–16. https://doi.org/10.1016/j.trc.2019.07.003.
Article Google Scholar
Haoyi, Z., & Jing, D. (2011). Comparison of aerosol size-distributions using linear-regression, genetic algorithm, and annealing genetic algorithm. Environmental Engineering Science, 28(12), 875–880. https://doi.org/10.1089/ees.2010.0339.
Article Google Scholar
Huang, M., Zhu, M., Xiao, Y., & Liu, Y. (2021). Bayonet-corpus: A trajectory prediction method based on bayonet context and bidirectional GRU. Digital Communications and Networks, 7(1), 72–81. https://doi.org/10.1016/j.dcan.2020.03.002.
Article Google Scholar
Kaffash, S., Nguyen, A. T., & Zhu, J. (2021). Big data algorithms and applications in intelligent transportation system: A review and bibliometric analysis. International Journal of Production Economics, 231, 107868. https://doi.org/10.1016/j.ijpe.2020.107868.
Article Google Scholar
Kamiński, B., Jakubczyk, M., & Szufel, P. (2018). A framework for sensitivity analysis of decision trees. Central European Journal of Operations Research, 26(1), 135–159. https://doi.org/10.1007/s10100-017-0479-6.
Article Google Scholar
Kanduri, A., Haghbayan, M.-H., Rahmani, A. M., Shafique, M., Jantsch, A., & Liljeberg, P. (2018). adBoost: Thermal aware performance boosting through dark silicon patterning. IEEE Transactions on Computers, 67(8), 1062–1077. https://doi.org/10.1109/TC.2018.2805683.
Article Google Scholar
Kumar, M. P., & Katiyar, V. K. (2013). Short term traffic flow prediction for a non-urban highway using artificial neural network, Procedia-social Behav. Sci, 104, 755–764.
Google Scholar
Li, C., & Xu, P. (2021). Application on traffic flow prediction of machine learning in intelligent transportation. Neural Computing and Applications, 33(2), 613–624. https://doi.org/10.1007/s00521-020-05002-6.
Article Google Scholar
Li, D. (2020). Predicting short-term traffic flow in urban based on multivariate linear regression model. Journal of Intelligent Fuzzy Systems, 39(2), 1417–1427. https://doi.org/10.3233/JIFS-179916.
Article Google Scholar
Li, L., Yang, Y., Yuan, Z., & Chen, Z. (2021). A spatial-temporal approach for traffic status analysis and prediction based on bi-LSTM structure. Modern Physics Letters B, 35(31), 2150481. https://doi.org/10.1142/S0217984921504819.
Article Google Scholar
Li, M., Cai, Z., & Sun, G. (2004). An adaptive genetic algorithm with diversity-guided mutation and its global convergence property. Journal of Central South University of Technology, 11(3), 323–327. https://doi.org/10.1007/s11771-004-0066-6.
Article Google Scholar
Li, P., Luo, A., Liu, J., Wang, Y., Zhu, J., Deng, Y., & Zhang, J. (2020). Bidirectional gated recurrent unit neural network for chinese address element segmentation. ISPRS International Journal of Geo-Information, 9(11), 635. https://doi.org/10.3390/ijgi9110635.
Article Google Scholar
Li, X., Ma, X., Xiao, F., Xiao, C., Wang, F., & Zhang, S. (2022). Time-series production forecasting method based on the integration of bidirectional gated recurrent unit (bi-GRU) network and sparrow search algorithm (SSA). Journal of Petroleum Science and Engineering, 208, 109309. https://doi.org/10.1016/j.petrol.2021.109309.
Article Google Scholar
Luo, X., Li, D., Yang, Y., & Zhang, S. (2019). Spatiotemporal traffic flow prediction with KNN and LSTM. Journal of Advanced Transportation, 2019, 1–10. https://doi.org/10.1155/2019/4145353.
Article Google Scholar
Ma, C., Dai, G., & Zhou, J. (2022). Short-term traffic flow prediction for urban road sections based on time series analysis and LSTM_BILSTM method. IEEE Transactions on Intelligent Transportation Systems, 23(6), 5615–5624. https://doi.org/10.1109/TITS.2021.3055258.
Article Google Scholar
Meng, M., Shao, C. F., Wong, Y. D., Wang, B. B., & Li, H. X. (2015). A two-stage short-term traffic flow prediction method based on AVL and AKNN techniques. Journal of Central South University, 22(2), 779–786. https://doi.org/10.1007/s11771-015-2582-y.
Article Google Scholar
Meng, Y., Chang, C., Huo, J., Zhang, Y., Mohammed Al-Neshmi, H. M., Xu, J., & Xie, T. (2022). Research on ultra-short-term prediction model of wind power based on attention mechanism and CNN-BiGRU combined. Frontiers in Energy Research, 10, 920835. https://doi.org/10.3389/fenrg.2022.920835.
Article Google Scholar
Moretti, F., Pizzuti, S., Panzieri, S., & Annunziato, M. (2015). Urban traffic flow forecasting through statistical and neural network bagging ensemble hybrid modeling. Neurocomputing, 167, 3–7. https://doi.org/10.1016/j.neucom.2014.08.100.
Article Google Scholar
Nagy, M., & Simon, V. (2018). Survey on traffic prediction in smart cities. Pervasive and Mobile Computing, 50, 148–163. https://doi.org/10.1016/j.pmcj.2018.07.004.
Article Google Scholar
Pang, B., Nijkamp, E., & Wu, Y. N. (2020). Deep learning with tensorflow: A review. Journal of Educational and Behavioral Statistics, 45(2), 227–248. https://doi.org/10.3102/1076998619872761.
Article Google Scholar
Schulz, E., Speekenbrink, M., & Krause, A. (2018). A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions. Journal of Mathematical Psychology, 85, 1–16. https://doi.org/10.1016/j.jmp.2018.03.001.
Shahid, F., Zameer, A., & Muneeb, M. (2020). Predictions for COVID-19 with deep learning models of LSTM, GRU and bi-LSTM. Chaos, Solitons & Fractals, 140, 110212. https://doi.org/10.1016/j.chaos.2020.110212.
Article Google Scholar
Shu, W., Cai, K., & Xiong, N. N. (2022). A short-term traffic flow prediction model based on an improved gate recurrent unit neural network. IEEE Transactions on Intelligent Transportation Systems, 23(9), 16654–16665. https://doi.org/10.1109/TITS.2021.3094659.
Article Google Scholar
Shu, Y. (2005). Wireless traffic modeling and prediction using seasonal ARIMA models. IEICE Transactions on Communications, E88-B(10), 3992–3999. https://doi.org/10.1093/ietcom/e88-b.10.3992.
Article Google Scholar
Shuai, C., Wang, W., Xu, G., He, M., & Lee, J. (2022). Short-term traffic flow prediction of expressway considering spatial influences. Journal of Transportation Engineering, Part A: Systems, 148(6), 04022026. https://doi.org/10.1061/JTEPBS.0000660.
Article Google Scholar
Sun, A. B., & Tao, Y. (2020). SSGRU: A novel hybrid stacked GRU-based traffic volume prediction approach in a road network. Computer Communications, 160, 502–511. https://doi.org/10.1016/j.comcom.2020.06.028.
Article Google Scholar
Sun, B., Cheng, W., Goswami, P., & Bai, G. (2018). Short-term traffic forecasting using self-adjusting k-nearest neighbours. IET Intelligent Transport Systems, 12(1), 41–48. https://doi.org/10.1049/iet-its.2016.0263.
Article Google Scholar
Tedjopurnomo, D. A., Bao, Z., Zheng, B., Choudhury, F. M., & Qin, A. K. (2022). A survey on modern deep neural network for traffic prediction: Trends, methods and challenges. IEEE Transactions on Knowledge and Data Engineering, 34(4), 1544–1561. https://doi.org/10.1109/TKDE.2020.3001195.
Article Google Scholar
Van Der Voort, M., Dougherty, M., & Watson, S. (1996). Combining kohonen maps with Arima time series models to forecast traffic flow. Transportation Research Part C: Emerging Technologies, 4(5), 307–318. https://doi.org/10.1016/S0968-090X(97)82903-8.
Article Google Scholar
Wang, G., Zhang, Z., Bian, Z., & Xu, Z. (2021). A short-term voltage stability online prediction method based on graph convolutional networks and long short-term memory networks. International Journal of Electrical Power & Energy Systems, 127, 106647. https://doi.org/10.1016/j.ijepes.2020.106647.
Article Google Scholar
Wang, S., Shao, C., Zhai, Y., Xue, S., & Zheng, Y. (2021). A multifeatures spatial-temporal-based neural network model for truck flow prediction. Journal of Advanced Transportation, 2021, 6624452. https://doi.org/10.1155/2021/6624452.
Article Google Scholar
Wang, S., Zhao, J., Shao, C., Dong, C., & Yin, C. (2020). Truck traffic flow prediction based on lstm and gru methods with sampled GPS data. IEEE Access, 8, 208158–208169. https://doi.org/10.1109/ACCESS.2020.3038788.
Article Google Scholar
Wei, D., & Liu, H. (2013). An adaptive-margin support vector regression for short-term traffic flow forecast. Journal of Intelligent Transportation Systems, 17(4), 317–327. https://doi.org/10.1080/15472450.2013.771107.
Article Google Scholar
Wei, W., Wu, H., & Ma, H. (2019). An AutoEncoder and LSTM-based traffic flow prediction method. Sensors, 19(13), 2946. https://doi.org/10.3390/s19132946.
Article Google Scholar
Williams, B. M. (2001). Multivariate vehicular traffic flow prediction: Evaluation of ARIMAX modeling. Transportation Research Record: Journal of the Transportation Research Board, 1776(1), 194–200. https://doi.org/10.3141/1776-25.
Article Google Scholar
Wu, C.-J., Schreiter, T., Horowitz, R., & Gomes, G. (2014). Traffic flow prediction using optimal autoregressive moving average with exogenous input-based predictors. Transportation Research Record, 2421, 125–132. https://doi.org/10.3141/2421-14.
Article Google Scholar
Wu, Y., Tan, H., Qin, L., Ran, B., & Jiang, Z. (2018). A hybrid deep learning based traffic flow prediction method and its understanding. Transportation Research Part C: Emerging Technologies, 90, 166–180. https://doi.org/10.1016/j.trc.2018.03.001.
Article Google Scholar
Xiao, Y., & Yin, Y. (2019). Hybrid lstm neural network for short-term traffic flow prediction. Information, 10(3), 105. https://doi.org/10.3390/info10030105.
Article Google Scholar
Xie, Y., Zhang, Y., & Ye, Z. (2007). Short-term traffic volume forecasting using kalman filter with discrete wavelet decomposition. Computer-Aided Civil and Infrastructure Engineering, 22(5), 326–334. https://doi.org/10.1111/j.1467-8667.2007.00489.x.
Article Google Scholar
Xing, L., & Liu, W. (2022). A data fusion powered bi-directional long short term memory model for predicting multi-lane short term traffic flow. IEEE Transactions on Intelligent Transportation Systems, 23(9), 16810–16819. https://doi.org/10.1109/TITS.2021.3095095.
Article Google Scholar
Xiong, C., Merity, S., & Socher, R. (2016). Dynamic memory networks for visual and textual question answering. https://doi.org/10.48550/ARXIV.1603.01417.
Book Google Scholar
Xu, D., Wang, Y., Jia, L., Qin, Y., & Dong, H. (2017). Real-time road traffic state prediction based on ARIMA and Kalman filter. Frontiers of Information Technology & Electronic Engineering, 18(2), 287–302. https://doi.org/10.1631/FITEE.1500381.
Article Google Scholar
Xue, Z., & Xue, Y. (2018). Multi long-short term memory models for short term traffic flow prediction. IEICE Transactions on Information and Systems, E101D(12), 3272–3275. https://doi.org/10.1587/transinf.2018EDL8087.
Article Google Scholar
Yang, B., Sun, S., Li, J., Lin, X., & Tian, Y. (2019). Traffic flow prediction using LSTM with feature enhancement. Neurocomputing, 332, 320–327. https://doi.org/10.1016/j.neucom.2018.12.016.
Article Google Scholar
Yang, D., Chen, K., Yang, M., & Zhao, X. (2019). Urban rail transit passenger flow forecast based on LSTM with enhanced long-term features. IET Intelligent Transport Systems, 13(10), 1475–1482. https://doi.org/10.1049/iet-its.2018.5511.
Article Google Scholar
Yao, B., Wang, Z., Zhang, M., Hu, P., & Yan, X. (2016). Hybrid model for prediction of real-time traffic flow. Proceedings of the Institution of Civil Engineers Transport, 169(2), 88–96. https://doi.org/10.1680/jtran.14.00015.
Article Google Scholar
Yin, J., Ning, C., & Tang, T. (2022). Data-driven models for train control dynamics in high-speed railways: LAG-LSTM for train trajectory prediction. Information Sciences, 600, 377–400. https://doi.org/10.1016/j.ins.2022.04.004.
Article Google Scholar
Zhang, D., & Kabuka, M. R. (2018). Combining weather condition data to predict traffic flow: A GRU-based deep learning approach. IET Intelligent Transport Systems, 12(7), 578–585. https://doi.org/10.1049/iet-its.2017.0313.
Article Google Scholar
Zhang, H., Wang, Z., & Liu, D. (2014). A comprehensive review of stability analysis of continuous-time recurrent neural networks. Ieee Transactions on Neural Networks and Learning Systems, 25(7), 1229–1262. https://doi.org/10.1109/TNNLS.2014.2317880.
Article Google Scholar
Zhang, J., & Qu, S. (2021). Optimization of backpropagation neural network under the adaptive genetic algorithm. Complexity, 2021, 1–9. https://doi.org/10.1155/2021/1718234.
Article Google Scholar
Zhang, L. (2016). Performance of unit-root tests for non linear unit-root and partial unit-root processes. Communications in Statistics-Theory and Methods, 45(15), 4528–4536. https://doi.org/10.1080/03610926.2014.922985.
Article Google Scholar
Zhang, L., Alharbe, N. R., Luo, G., Yao, Z., & Li, Y. (2018). A hybrid forecasting framework based on support vector regression with a modified genetic algorithm and a random forest for traffic flow prediction. Tsinghua Science and Technology, 23(4), 479–492. https://doi.org/10.26599/TST.2018.9010045.
Article Google Scholar
Zhang, L., Liu, Q., Yang, W., Wei, N., & Dong, D. (2013). An improved K-nearest neighbor model for short-term traffic flow prediction. Procedia - Social and Behavioral Sciences, 96, 653–662. https://doi.org/10.1016/j.sbspro.2013.08.076.
Article Google Scholar
Zhang, X., Xiao, F., Tong, X., Yun, J., Liu, Y., Sun, Y., … Chen, B. (2022). Time optimal trajectory planing based on improved sparrow search algorithm. Frontiers in Bioengineering and Biotechnology, 10, 852408. https://doi.org/10.3389/fbioe.2022.852408.
Article Google Scholar
Zhang, Z., Li, M., Lin, X., Wang, Y., & He, F. (2019). Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies. Transportation Research Part C-Emerging Technologies, 105, 297–322. https://doi.org/10.1016/j.trc.2019.05.039.
Article Google Scholar
Zhang, Z., Li, M., & Wang, Y. (2021). A customized deep learning approach to integrate network-scale online traffic data imputation and prediction. Transportation Research Part C: Emerging Technologies, 132, 103372. https://doi.org/10.1016/j.trc.2021.103372.
Article Google Scholar
Zhao, Y., Cao, J., Zhang, H., & Liu, Z. (2021). A deep learning traffic flow prediction framework based on multi-channel graph convolution. Transportation Planning and Technology, 44(8), 887–900. https://doi.org/10.1080/03081060.2021.1992180.
Article Google Scholar
Zheng, H., Lin, F., Feng, X., & Chen, Y. (2021). A hybrid deep learning model with attention-based conv-lstm networks for short-term traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems, 22(11), 6910–6920. https://doi.org/10.1109/TITS.2020.2997352.
Article Google Scholar
Zhu, L., Kwong, S., Zhang, Y., Wang, S., & Wang, X. (2020). Generative adversarial network-based intra prediction for video coding. IEEE Transactions on Multimedia, 22(1), 45–58. https://doi.org/10.1109/TMM.2019.2924591.
Article Google Scholar
Zou, H., Liu, H., Zhou, T., Jiashun, L., & Zhan, Y. (2020). Short-term traffic flow prediction using dtw-bigru model. In 2020 35th youth academic annual conference of Chinese Association of Automation (YAC), (pp. 557–562). https://doi.org/10.1109/YAC51587.2020.9337579.
Chapter Google Scholar
Kim, J. H. K., & Lee, G. (2022). GPS data-based mobility mode inference model using long-term recurrent convolutional networks. Transportation Research Part C: Emerging Technologies, 135. https://doi.org/10.1016/j.trc.2021.103523.
Meng, M., Toan, T. D., Wong, Y. D., & Lam, S. H. (2022). Short-term travel-time prediction using support vector machine and nearest neighbor method. Transportation Research Record, 03611981221074371. https://doi.org/10.1177/0361198122107437.
Wang, H., Zhang, R., Cheng, X., & Yang, L. (2022). Hierarchical traffic flow prediction based on spatial-temporal graph convolutional network. IEEE Transactions on Intelligent Transportation Systems, 1–11. https://doi.org/10.1109/TITS.2022.3148105.

Download references

Acknowledgments

This study was sponsored by the National Natural Science Foundation of China (No. 52072025).

Funding

This study was sponsored by the National Natural Science Foundation of China (No. 52072025).

Author information

Authors and Affiliations

School of Traffic Management, People’s Public Security University of China, Beijing, 10038, China
Shengyou Wang
Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing Jiaotong University, Beijing, 10044, China
Chunfu Shao
Business School, University of Bristol, Bristol, BS8 1SD, UK
Jie Zhang
School of Transportation, Southeast University, Nanjing, 211189, China
Yan Zheng
School of Management, University of Bath, Bath, BA2 7AY, UK
Meng Meng

Authors

Shengyou Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chunfu Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Meng Meng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Shengyou Wang and Chunfu Shao designed the study framework and developed the methodology. Jie Zhang and Yan Zheng validated the comparison and performance. Meng Meng improved the methodology and supported the overall writing. All authors provided critical feedback and helped conduct the research, validate analysis, and write the manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Meng Meng.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was not sought for the present study, because it does not involve human or animal subjects. All co-authors have seen and agree with the contents of the manuscript to be submitted to Urban Informatics.

Consent for publication

We certify that the submission is original work and is not under review at any other publication. All co-authors agree with consent for publication if this paper is accepted.

Competing interests

The authors have no conflicts of interest to declare.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, S., Shao, C., Zhang, J. et al. Traffic flow prediction using bi-directional gated recurrent unit method. Urban Info 1, 16 (2022). https://doi.org/10.1007/s44212-022-00015-z

Download citation

Received: 27 September 2022
Revised: 09 November 2022
Accepted: 10 November 2022
Published: 01 December 2022
DOI: https://doi.org/10.1007/s44212-022-00015-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Traffic flow prediction using bi-directional gated recurrent unit method

Abstract

Similar content being viewed by others

Deep Bi-directional Long Short-Term Memory Model for Short-Term Traffic Flow Prediction

Traffic Flow Prediction Model and Performance Analysis Based on Recurrent Neural Network

Traffic Flow Prediction Using Deep Learning Techniques

1 Introduction

2 Literature review

2.1 Short-term traffic flow prediction

2.2 Recurrent neural network variants

3 Methodology

3.1 Bi-directional gated recurrent unit (bi-GRU) model

3.2 Comparison models

3.3 Evaluation metrics

4 Data

4.1 Statistical analysis

4.2 Augmented dickey-fuller (ADF) unit root-based stability analysis

5 Results and discussion

5.1 Model performance for overall

5.2 Model performance for road sections

5.3 Model performance for peak and low peak periods

6 Conclusion

Availability of data and materials

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation