The Spatiotemporal Evolution Mechanism of Urban Rail Transit Fault Propagation in Networked Operation Modes

Xiaobing, Ding; Hua, Hu; Zhigang, Liu; Qingquan, Mu

doi:10.1007/s40864-023-00210-4

The Spatiotemporal Evolution Mechanism of Urban Rail Transit Fault Propagation in Networked Operation Modes

ORIGINAL RESEARCH PAPERS
Open access
Published: 25 February 2024

Volume 10, pages 65–88, (2024)
Cite this article

Download PDF

You have full access to this open access article

Urban Rail Transit Aims and scope Submit manuscript

The Spatiotemporal Evolution Mechanism of Urban Rail Transit Fault Propagation in Networked Operation Modes

Download PDF

Ding Xiaobing ORCID: orcid.org/0000-0003-1739-7684¹,
Hu Hua¹,
Liu Zhigang¹ &
…
Mu Qingquan¹

385 Accesses
Explore all metrics

Abstract

The cascading propagation and evolution of metro operation failures can significantly impact the safety of metro operation. To overcome this challenge, this study pre-processes a massive amount of metro operation log data through noise reduction. Moreover, a professional terminology dictionary is constructed along with a custom stop-word dictionary to segment the preprocessed data. Subsequently, the AFP-tree algorithm is employed to mine the segmented log data and identify key hazards. A weighted urban rail transit network is established, considering the effective path time cost, and the shortest travel OD path. To simulate the dynamic evolution of the failure chain propagation, a model based on disaster propagation theory is constructed. Taking the Shanghai Metro line as a case, multiple simulation scenarios are established with 25 key hazards as triggering points, and the number of cascade failure stations affected under different scenarios is outputted. The results indicate that the fault stations caused by the large passenger flow are the largest. Meanwhile, the number of stations affected by the door clamp is the smallest. The scale of fault stations reaches a maximum value in 16–20 min. Through case analysis, a positive correlation is found when the self-recovery factor is between 14 and 18, and the number of fault stations shows a significant increasing trend. The research results can provide decision-making support and theoretical guidance for rail transit operation safety management enterprises.

Data Analysis and Operational Impact Analysis of Urban Rail Transit Failure and Emergency: Evidence from Credible Internet

Malfunction in Railway System and Its Effect on Arrival Delay

Risk and Resilience of Railway Infrastructure: An Assessment on Uncertainties of Rail Accidents to Improve Risk and Resilience Through Long-Term Data Analysis

1 Introduction

With the continuous increase in demand for rail transit in various cities, the metro network structure and scale are rapidly developing, presenting a networked and complex characteristic [1]. As the backbone of large-scale urban transportation, urban rail transit systems have two main characteristics: First, there is a strong correlation within the network—stations and lines are interconnected, and their interactions affect each other. Second, the network has limited carrying capacity: each line and node on the line has a maximum carrying capacity. Only when each node and line in the network operate within the carrying capacity range can the entire network system run smoothly. When an abnormality occurs at a node in the network, such as a sudden increase in passenger flow, natural disasters, terrorist attacks, or severe weather, a station may become overloaded or directly paralyzed. Because of the strong correlation within the urban rail transit network, faults can spread to surrounding stations, and the limited carrying capacity of stations restricts their ability to share loads. When they bear a load beyond their capacity, these stations will also become paralyzed due to overload operation, leading to a cycle of failure and even causing the entire line or network to fail, which poses great danger. Complex networks have scale-free and small-world characteristics and have been applied in multiple fields, including power facilities, communication facilities, and transportation [2, 3]. Complex networks can intuitively describe the process of node failure and find the weak links in the network. Due to the robustness of urban rail transit networks, they are not easily destroyed under random attacks. However, when one or more stations in the network are attacked due to unexpected situations, the operating status of the stations can easily be destroyed, and the entire network may be affected by the propagation of these abnormal situations [4]. Some existed studies have evaluated the stability of the entire network by assuming the failure of a node or edge in the network.

However, these studies have inherent limitations, as they generally simulate the attack on a station in the network without considering specific risk factors that cause the station to malfunction. The impact of different types and levels of risk events on stations is also different. Additionally, the dynamic changes within the metro network over time are difficult to accurately characterize. Therefore, this paper proposes a method for predicting and estimating the cascade failure propagation trend of the subway network based on key risk sources, which can overcome these limitations.

2 Literature Review

Research on the network propagation of operational failures in rail transportation has focused primarily on exploring methods for text mining of hazard sources and failures, analyzing cascading failures, and studying the spread of disasters. This section covers these three aspects from the perspectives of text mining, cascading failure analysis, and disaster propagation.

2.1 Text Mining

In recent years, natural language processing technology has matured and has been widely applied in various fields [5,6,7]. In the safety field, some scholars have extracted risk item features from texts, such as Luo [8], who proposed the preprocessing of road traffic accident reports to enhance the feature representation of risk sources, build a double-hidden-layer adaptive convolutional neural network, and identify risk sources through sample training. Li [9] extracted features from reports on high-altitude construction accidents, obtained causal feature items, causal networks, and causal sets, and displayed the results using word clouds and network structure graphs. Some scholars have also used network models to mine risk factors and make predictions, such as Xue [10], who focused on the safety accident reports of construction projects, constructed a safety network model, graded the influencing factors, and implemented graded control to verify the feasibility of the model in handling such problems. Wu [11] used R language to mine ship collision accident text reports, studied methods for processing rare professional terms, and based on the mining of causal key factors, built a Bayesian network model to predict river ship accident risks. Others have implemented risk mining and control through system construction or design management frameworks. Fa [12] used coal-mining accident reports, used text mining technology to establish a coal mine human factor analysis and classification system, extracted strong association rules among influencing factors, and proposed relevant hypotheses to identify and analyze the hierarchical structure relationship in the human–machine interaction system framework from multiple perspectives. Xu [13] used text mining technology, based on data from construction accident reports, designed a translation management framework, and proposed information entropy-weighted term frequency for term importance evaluation, ultimately extracting core factors affecting construction safety. Chu [14] proposed a global supply chain risk management framework based on text mining, and collected and analyzed the existing literature; the analysis results revealed the importance of content related to terms, further defining potential supply chain risk factors. Zhao [15] proposed a network news risk factor extraction method based on the latent Dirichlet allocation (LDA) model, ultimately determining 28 risk factors, analyzing the relationships among these factors, and evaluating the risk factor structure of the oil market. Later, some scholars combined text mining technology with complex networks to find the connections between accident causation items. Qiu [16] creatively combined text mining technology with complex networks to identify 52 main accident causation factors, further constructing a coal mine accident causation network, clarifying eight core factors and their associated sets, as well as seven key links. Abdhul [17] proposed an automatic, semi-supervised, and domain-independent accident report analysis method, identified specific domain keywords in complex network structures, and grouped them into topics with expert participation, using these keywords and topics for various data mining purposes. Meanwhile, other scholars have utilized text mining techniques in practical applications to achieve quantitative analysis of accidents. For example, Liu [18] extracted train derailment accident data for various track types from the Federal Railroad Administration (FRA) Railway Equipment Accident Database and statistically analyzed them based on the frequency and severity of occurrence to derive the main causes of train derailment accidents. Wang [19] designed a railroad safety dictionary and comprehensively used algorithms including tire tree, directed acyclic graph (DAG), Viterbi, hidden Markov model (HMM) and other algorithms to extract causative keywords from accident reports, and then mined the correlation rules between the causative factors and the accidents, and combined with the high-speed railroad derailment matching model based on the risk factors of external environments to achieve an accurate and quantitative analysis of the safety situation of high-speed railroads.

The above literature focuses on the use of text mining technology to mine and analyze risks or risk causes from accident reports in the industry, and has achieved certain results. However, this type of research generally relies on experts or focuses on the features of risk causation items for risk analysis, and there is still a need for improvement in terms of the mining of risks or risk causes from texts.

2.2 Cascading Failure

Currently, in some research on metro network failures, it is assumed that when a node or edge in the network fails, it does not affect other nodes or edges in the network. This is referred to as static robustness research. However, in complex networks in the real world, such as urban rail transit networks, power networks, and communication networks, some nodes or edges may fail due to random accidents or deliberate attacks, causing cascading failures that may affect other nodes or edges in the network, leading to a chain reaction; this phenomenon is known as cascading failure [20]. Experts and scholars in various fields have extensively studied the cascading failure process, and the models proposed for cascading failure mainly include the load-capacity model [21], binary model [22], and sandpile model [23]. Among them, the load-capacity model has the widest impact and has been applied in empirical research and analysis of real networks.

Research on the load-capacity model focuses mainly on three basic issues: the definition of the initial load of nodes or edges in the network, the definition of the capacity of nodes or edges, and the method of load redistribution. Freeman [24] defined the initial load of nodes as their betweenness centrality, and the capacity of nodes as a linear function of the initial load, which is a reasonable and widely used characterization. However, considering that betweenness centrality is a global quantity and the calculation complexity is high, it is necessary to obtain the global properties of the entire network. Later, Wang [25] and others defined the initial load of nodes based on the degree of nodes and the total degree of adjacent nodes, which proposed a new concept of the probability of overload node failure.

Currently, commonly used load redistribution methods can be divided into two categories: one is based on the global allocation of the entire network, and the other considers the nearest allocation strategy of the capacity of adjacent nodes to failed nodes. Li [26] posited that the information processing capacity of nodes could be reflected by the size of node degree, and effective allocation of additional capacity through vertex quotas could prevent cascading failure and effectively improve the robustness of the network. Duan [27] proposed a cascading failure model with adjustable load redistribution range and load redistribution heterogeneity, and analyzed the cascading failure conditions of the model on scale-free networks. The results showed that reasonable adjustment of the load redistribution range and heterogeneity can significantly improve the robustness of complex networks. Fang [28] introduced the concept of neighbor links and proposed a load distribution method which can average the load of failed nodes to their adjacent nodes, and studied the cascading failure phenomenon on directed complex networks in a new environment. Ma [29] proposed a new load-capacity model, which redefined the load distribution rule based on the self-repair time factor of nodes and analyzed the adjustable parameters of node capacity and self-repair factor. Ju [30] combined the degree and betweenness centrality of nodes, and redefined the load distribution of adjacent edges, studying the robustness of network cascading failure. Li [31] constructed a model of urban passenger transport network in the city cluster, and evaluated the anti-destructive performance of cascading failure by adopting an improved optimal load allocation strategy based on actual passenger flow weighting.

Analyzing and summarizing the above domestic and foreign research, it can be found that most research on cascading failure in metro networks is based on real cities, and through the analysis of the network structure and the study of cascading failure models, it provides a theoretical basis and decision-making support.

2.3 Disaster Propagation

Buzna et al. [32] first proposed a model of fault propagation in a general network system that considers node recovery capability and transfer mechanisms to describe the dynamic spread and impact of disasters in complex networks. The model combines network nodes into active bistable elements with delayed interactions along directed links. Later, Buzna et al. [33] applied disaster propagation theory to study the effectiveness of different emergency strategies and optimized resource allocation based on network state and topology. By changing network topology, delay time factors, and overall resource allocation, the effectiveness of different emergency strategies was evaluated. Hu [34] proposed a resource node attribute model based on disaster propagation theory, which combines resource value, disaster energy of each node, disaster propagation path, and disaster propagation characteristics. Finally, the optimal timing for disaster relief and emergency resource preparation was determined through the model. Yi [35] used a method for simulating multiple failure events to describe the random factors that trigger disasters. Ouyang [36] presented an improved model of redundant systems in networks, and analyzed the differences in the spreading process and the role of important parameters. The results show that the disaster spreading process becomes slower with the existence of redundant systems in the network. Later, Ouyang [37] studied the impact of several redundancy strategies on controlling disaster propagation in a network and found that an improved random network can better cope with disasters, while the strategy based on total degree is the most effective way to control disaster propagation in scale-free networks. Weng [38] established a universal disaster propagation dynamic model and studied the influence of three important characteristic parameters: self-repair factor, delay time factor, and noise intensity. Xiao [39] established a dynamic model of congestion propagation in the rail transit network based on the disaster propagation dynamic model. The congestion propagation model can reflect the process of congestion propagation in the rail transit network, and the simulation process reveals the propagation law of congestion in the metro network.

In summary, the early research on disaster propagation theory fully considered the evolution process of faults over time, node self-recovery capability, disaster-fault-attack propagation mechanisms, and other influencing factors such as internal random noise. Based on disaster propagation theory, combined with the characteristics of the subway network and key hazards, a cascade failure model can be established to explore the propagation mechanism of faults in the network when it suffers from different forms and levels of attacks. Data can be used to predict the scope and degree of different risks.

3 Identification and Quantitative Treatment of Key Hazard Sources

3.1 Methods for Identifying Key Hazard Sources

Currently, there is still a problem of unknown high-risk sources in the operation process of subway systems. In the actual operation process, identification mainly relies on experienced experts or staff, which has a high degree of subjectivity and lacks scientific and effective data support. Therefore, establishing a key hazard identification method based on subway dispatch logs is a new exploration, which can accurately identify key hazards from a data perspective.

3.1.1 Data Preprocessing

The dispatch logs contain a large amount of information, but they typically consist of textual descriptions of events that cannot be directly used as objects for data mining. Therefore, data preprocessing is required, and a data processing flow as shown in Fig. 1 is designed to handle the data.

The first step is to clean the interfering data. Since the dispatch logs contain a large number of records that are irrelevant to the operation risk events, including normal vehicle dispatch and routine maintenance information, which do not contain risk source information, Python language is used in combination with common hazard sources to filter the data, and extract most of the valid data, as shown in the pseudocode in Fig. 2. Meanwhile, to ensure the integrity of the valid data, other data are manually screened to achieve completely cleaning of the interfering data.

Step 2: Word segmentation and stop-word removal. Using the Pycharm development platform and the Jieba library for word segmentation in Python, a more suitable, accurate mode for text analysis was used. A custom professional terminology dictionary was loaded, and then it was segmented to improve the accuracy of the segmentation. After segmentation, stop words were removed to delete words in the log text that are not relevant to the research, such as "punctuation", "numbers", "also", "just" and other words, based on the Harbin Institute of Technology stop-word list and customized stop-word dictionary, taking into account the professional characteristics of subway operation. The specific processing results are shown in Table 1.

Table 1 Jieba word segmentation and removing stop words

The Spatiotemporal Evolution Mechanism of Urban Rail Transit Fault Propagation in Networked Operation Modes

Abstract

Similar content being viewed by others

Data Analysis and Operational Impact Analysis of Urban Rail Transit Failure and Emergency: Evidence from Credible Internet

Malfunction in Railway System and Its Effect on Arrival Delay

Risk and Resilience of Railway Infrastructure: An Assessment on Uncertainties of Rail Accidents to Improve Risk and Resilience Through Long-Term Data Analysis

1 Introduction

2 Literature Review

2.1 Text Mining

2.2 Cascading Failure

2.3 Disaster Propagation

3 Identification and Quantitative Treatment of Key Hazard Sources

3.1 Methods for Identifying Key Hazard Sources

3.1.1 Data Preprocessing

3.1.2 Algorithm for Identifying Key Hazards

3.2 Weighted Identification of Key Risk Sources

3.2.1 Sequential Relationship Weighting Analysis

3.2.2 Objective Weighting Analysis

3.2.3 Composite Weighting Method

4 Construction of a Network Failure Propagation Model Based on Key Hazard Sources

4.1 Construction of a Complex Network Based on Subway Network Topology Structure

4.2 Construction of a Network Chain Fault Propagation Model

5 Case Study

5.1 Identification and Weighting of Key Hazards

5.1.1 Identification of Key Hazards

5.1.2 Weighting of Key Hazard Sources

5.2 Transmission of Subway Network Cascading Failures

5.2.1 Analysis of Complex Topological Network Characteristics of the Shanghai Metro

5.2.2 Analysis of Chain Failure Propagation in the Shanghai Metro Network

6 Conclusions and Further Studies

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation