Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Examining Bayesian network modeling in identification of dangerous driving behavior

  • Yichuan Peng,

    Roles Formal analysis, Funding acquisition, Resources, Supervision, Writing – review & editing

    Affiliations Jiangsu Key Laboratory of Traffic and Transportation Security, Huaiyin Institute of Technology, Huaian, China, Key Laboratory of Road and Traffic Engineering, Ministry of Education College of Transportation Engineering, Tongji University, Shanghai, China

  • Leyi Cheng,

    Roles Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Key Laboratory of Road and Traffic Engineering, Ministry of Education College of Transportation Engineering, Tongji University, Shanghai, China

  • Yuming Jiang ,

    Roles Supervision, Writing – review & editing

    14191@tongji.edu.cn

    Affiliation Key Laboratory of Road and Traffic Engineering, Ministry of Education College of Transportation Engineering, Tongji University, Shanghai, China

  • Shengxue Zhu

    Roles Data curation, Funding acquisition, Supervision, Writing – review & editing

    Affiliation Jiangsu Key Laboratory of Traffic and Transportation Security, Huaiyin Institute of Technology, Huaian, China

Abstract

Traffic safety problems are still very serious and human factor is the one of most important factors affecting traffic crashes. Taking Next Generation Simulation (NGSIM) data as the research object, this study defines six control indicators and uses principal component analysis and K-means++ clustering methods to get the driving style of different drivers. Then use the Bayesian Networks Toolbox (BNT) and MCMC algorithm to realize the structure learning of Bayesian network. and parameter learning was completed through Netica software. Finally, the vehicle-based traffic crash risk model was created to conduct sensitivity analysis, posterior probability inference, and simulation data was used to detect the feasibility of the model. The results show that the Bayesian network modeling can not only express the relationship between the crash risk and various driving behaviors, but also dig out the inherent relationship between different influencing factors and investigate the causes of driving risks. The results will be beneficial to accurately identify and prevent risky driving behavior.

1. Introduction

With the development of the transportation industry, the number of cars has increased, and the situation of road traffic safety has become more severe. The "2018 Global Road Safety Report" released by the WHO pointed out that approximately 1.35 million people die from road traffic collisions every year, 3,700 people die from car accidents every day, and one person loses his life on the road every 24 seconds. The causes leading to traffic crashes are diverse and complex [13], and the influencing factors mainly include human factor, vehicle factor, roads, and environmental factor [4]. According to previous research [511], human factor is one of the most important factors. Therefore, identification of dangerous driving behaviors in a timely manner can reduce the risk of driving and improve the safety of the road traffic system.

In 1993, the driving style was first defined as the driver’s habitual driving method during driving, and it was emphasized that driving style is a unique driving attribute of each person [12]. It will affect the driver’s speed control, driving awareness, driving skills and many other aspects during driving which have a great relationship with traffic safety [1315]. Currently, there are two main types of driving style recognition. One is a subjective questionnaire survey. Li [16] et al. used a standard driver behavior questionnaire on 225 non-professional drivers to determine the number of driver style categories based on the fuzzy C-means (FCM) algorithm. Liu Jing [17] and others used the Multi-Dimensional Driving Style Scale and the General Decision Style Scale by surveying 199 drivers to study the relationship between various factors and driving style. However, questionnaire surveys are subjective and may affect the research results due to the driver’s cognitive bias. To reduce subjectivity, this article uses another method based on vehicle kinematics parameters to identify driving style. This paper uses principal component analysis for dimensionality reduction and then uses k-means clustering analysis to classify driving style, and uses the elbow method to determine the number of classifications based on the NIGSIM data set which comes from the US "Next Generation Simulation" (NGSIM) program and will be explained in detail later.

Among the related methods of risk evaluation, the commonly used methods are fuzzy evaluation method [18], risk index method [19], regression model [20], decision tree, K-means algorithm [21], bayesian method [22, 23] and neural networks [24]. Wu et al [25] took various bad driving behaviors as evaluation indicators, and obtained the main factors affecting traffic crashes through fuzzy evaluation. Zhang [26] proposed to use equivalent acceleration as the weighting index and use the driving risk index proposed by Toledo and the safety threshold to judge whether the driving behavior is safe. Taking the accident samples as the research object, Ye [27] constructed a generalized ordered logit model to estimate the distribution probability of different severity levels of crashes and identify the main factors affecting different severity levels of rollover crashes. Sheng Dong [28] established a binary logit model to perform simulation and analysis of rear-end collisions. Zhang [29] applied the CART decision tree algorithm to focus on the driving behavior to explore the impact on the severity of the consequences of the crash. Yanyong Guo et al [30] developed traffic conflict-based real-time safety models for signalized intersections using multiple indicators under the Bayesian framework. Tarek et al [31] proposed a hierarchical Bayesian peak over threshold approach for conflict-based before-after safety evaluation of Leading Pedestrian Intervals.

In this paper, we employed Bayesian network to construct a vehicle-based traffic crash risk model. Compared with methods such as fuzzy mathematics and analytic hierarchy process, the objectivity of Bayesian network is stronger. Compared with regression models, Bayesian networks can better show the correlation between different crash risk factors in complex systems while odds ratios from a logistic regression can only show the relationship between crash risk and various factors. Compared with neural networks, Bayesian networks are more explanatory. Model visualization can directly show the relationship between various influencing factors, and make inferences by setting evidence variables. The Bayesian network graphically describes the relationships between independent and dependent variables. According to the Bayesian network structure, prior probability, and the conditional probability table of each node, the probability of event occurrence can be predicted. Therefore, the Bayesian network is selected for the prediction of traffic crash risk.

2. Data preparation

2.1 Data description

In this study, we used NIGSIM data to establish a vehicle-based risk assessment model and used the simulated data to verify the model.

NIGSIM data [3234] comes from the US "Next Generation Simulation" program which collected vehicle trajectory data on us-101 southbound and Lankershim Avenue in Los Angeles, California, I-80 eastbound in Emeryville, California, and Peachtree Street in Atlanta, Georgia. And this study selected us-101 data for analysis. The simulation data were collected from a driving simulation system, which can simulate the driving environment and driving behavior of the vehicle. The driver controls the vehicle model through acquisition modules such as the keyboard and steering wheel, and adopts different response methods in different driving environments.

The us-101 data contains 25 attributes such as Vehicle ID, Frame ID, Global time, Local X, and Local Y, etc. The length of the study area is 640m, including 5 main lanes; 1 distribution lane is located between the entrance of Ventura Boulevard and the exit of Cahuenga Boulevard. Fig 1 shows the road section of us-101.

We selected the vehicles on the main lane in order to ensure the accuracy of the data, which accounts for about 97% and selected the car as the research object and then converted the British unit into the international standard unit.

The contents of the data after processing is shown in Table 1:

2.2 Risk assessment indicators

There are many factors influencing road traffic safety. As we mentioned at the beginning of the study, dangerous driving behavior is one of the leading causes of the traffic crashes. This part focuses on the identification of dangerous driving behaviors by establishing eight vehicle-based risk assessment indicators based on the collected data.

2.2.1 Car following interval control indicator.

In order to have enough reaction time to deal with unexpected accidents for the driver, a reasonable safety distance between vehicles needs to be guaranteed. The minimum distance between vehicles can be derived by analyzing the braking process of vehicles as shown in Fig 2. If the distance between front and rear vehicles is less than the minimum vehicle distance during driving, it is considered to have a certain driving risk.

(2.1)

In the picture, Xi−1, Xi means the location of the two vehicles before braking and deceleration, D represents the distance between the two vehicles, t represents the reflected time [35] of the following car which means the time from when the vehicle in front slows down to when the rear vehicle slows down, and the distance traveled by the rear vehicle during this period is reflected distance which is represented by Vit. According to the dynamic formula, the braking distance of the front vehicle is Vi−12/2ai−1, and the driving distance of the rear vehicle during braking is Vi2/2ai. Assuming that the speed and acceleration of the front and rear cars are the same before braking, the minimum vehicle spacing Dmin between them can be obtained.

(2.2)

Using the ratio ζ of the minimum vehicle distance to the actual vehicle distance as the following interval control index to evaluate the driver’s control of the vehicle distance, the calculation formula of ζ is as follows: (2.3)

2.2.2 Sharp acceleration and deceleration control indicator.

The sudden acceleration and deceleration caused by poor control will bring crash risks. Set the abrupt acceleration threshold , and abrupt deceleration threshold , and when abrupt acceleration is greater than or the abrupt deceleration is less than , it is judged that the abrupt speed change occurs. Define the continuous rapid acceleration/deceleration time threshold as T. According to the three different states of no speeding behavior, speeding behavior, and continuous speeding behavior, the risk levels are divided into three categories: low risk, normal risk, and high risk. Compared with the previous simple classification of whether there are over-speeding behaviors based on only acceleration, this classification is more detailed and accurate.

2.2.3 Frequent acceleration and deceleration control indicator.

Frequent acceleration and deceleration mainly mean that the speed of the vehicle changes frequently with a short period. Frequent acceleration and deceleration are not illegal in traffic laws. Therefore, drivers pay relatively little attention to such dangerous driving behaviors. What’s more, in the process of frequent acceleration and deceleration, fuel consumption will increase, causing environmental pollution and economic waste. Therefore, the driver should be reminded in time when the vehicle speed is detected to be unstable.

Define Q as the oscillation frequency of acceleration in time T, n means the number of acceleration changes in the time T, so the calculation formula is: (2.4)

2.2.4 Line driving control indicator.

Define D1 and D2 as the distance respectively between the vehicle and the left or right sides of the lane. Set the safety distance as D. There is a high driving crash risk when the condition D1<D/2 or D2< D/2 is satisfied. Considering that the data with a small value may be caused by two reasons: driving on the line or changing lanes, a time threshold T is set to eliminate the interference of the lane changing behavior.

2.2.5 Serpentine driving control indicator.

The serpentine driving control indicator is created to study the dangerous driving situation where the vehicle shakes frequently in a short period of time based on the distance collected by the vehicle from the left side of the lane every time.

Within time T, the sloshing frequency W of the vehicle is determined by judging the change of the distance of the vehicle relative to the left side of the lane. When the value of W is larger, the driving behavior is more dangerous. The calculation formula of W is as 2.5: (2.5)

Where k is the number of shaking in time T.

2.2.6 Speeding control indicator.

Set a speed threshold V, and when the vehicle speed exceeds the threshold V, it is judged that an overspeed behavior has occurred. Then for each overspeed behavior, calculate the overspeed duration, and define the overspeed time threshold T.

2.2.7 Frequent lane change control indicator.

Define P as the frequency of lane change in T time, the calculation formula is: (2.6)

N is the number of lane changes in T time, and the value of N increases by 1 when the number of lanes where the vehicle is located changes.

2.2.8 Driving style.

Driving style is closely related to driving safety and aggressive driving style is usually more likely to cause traffic crashes. In order to combine with the actual situation and reduce the impact of subjective questionnaire surveys on the results, this paper used vehicle kinematics data to classify driving styles.

Speed and acceleration can show the driving habits, and the frequency of lane changes and the following distance can reflect the driving personality. This article selected 13 evaluation indicators about driving style, as shown in Table 2:

Then use the principal component analysis method to reduce the dimension, and finally use the elbow method to determine the number of driving style classifications. The calculation formula of the core index SSE (sum of the squared errors) is as follows: (2.7)

Among them, Ci is the i_th cluster, p is the sample point in Ci, mi is the centroid of Ci (mean of all samples in Ci), and SSE is the clustering error of all sample, representing the quality of the clustering effect.

3. Methods

Bayesian Networks graphically describe the independent or dependent relationship between variables. According to the Bayesian network structure, prior probabilities and the conditional probability table of each node, the probability of event occurrence can be predicted, which can intuitively show the causal relationship between data variables. This method is suitable for describing the relative relationship between multiple variables in a complex system. Suppose A and B are two random variables, A = a is a certain hypothesis, and B = b is a set of evidence. Before considering the evidence B = b, the probability estimation P(A = a) of the event A = a is called the prior probability; after considering the evidence B = b, the probability estimation P (A = a) of the event A = a is called the posterior probability. Bayes’ theorem describes the relationship between the prior probability and the posterior probability. The formula is as follows: (2.8)

The construction of Bayesian network mainly has two processes: network structure and parameter learning. Structural learning mainly includes expert experience and machine learning methods. Compared with the expert experience method, the machine learning method can avoid the influence of subjective factors. Machine learning methods include scoring-based search methods, constraint-based methods, and random sampling-based methods. The basic idea of the score-based search method is to traverse all possible structures, and then use a certain standard to measure each structure to find the best structure. In 1992, Cooper and Herskovits proposed the first Bayesian scoring function, the K2 scoring function [36, 37]; in 1995, Heckerman proposed the BD scoring function, which is a generalization of the K2 function; at the same time, Heckerman proposed the BDe scoring function based on additional likelihood equivalence hypothesis [38, 39]; Bouckaert and Suzuki proposed the K3 algorithm using a scoring function based on the Minimum Description Length (MDL) principle in information theory [40, 41]. Constraint-based Bayesian network structure learning method (also known as dependency analysis method or conditional independence test method), usually uses statistical or information theory methods to quantitatively analyze the dependence relationship between variables to obtain the optimal expression of the network structure. In 1993, the SGS algorithm proposed by Spines et al. was a typical algorithm for determining the topological structure by conditional independence tests [42]; in 2000, Spines et al. enhanced the SGS algorithm and proposed the PC algorithm [43]; in 2002, Cheng combined information theory with statistical testing and proposed the TPDA algorithm [44].

Among the machine learning methods, the learning method based on score search has a large search space and low learning efficiency and it is difficult for the method based on constraints to judge the independence between nodes. Therefore, this paper uses MCMC to learn parameters based on random sampling, which has high learning efficiency and is easy to implement. Since there is no missing data in this article, in order to improve the efficiency of parameter learning, this article used a counting algorithm for parameter learning.

In MCMC, the likelihood function is given as (2.9)

Among them, xi represents the i-th sampling sample, m represents the number of samples and MC(p) stands for Markov process.

And for counting algorithm, before it begins, the net starts off in a state of ignorance. At each node, all CPT probabilities start as uniform. Only nodes for which the case supplies values for all of its parents, have their experience and conditional probabilities modified. Each of these nodes is modified as follows.

Only the single experience number, and the single probability vector, for the parent configuration which is consistent with the case is modified. The new experience number (exper’) is found from the old (exper) by: (2.10) where degree is the multiplicity of the case.

Within the probability vector, the probability for the node state that is consistent with the case is changed from probc to probc’ as follows: (2.11)

The other probabilities in that vector are changed by: (2.12) where probc is the probability of the case.

4. Results

In the driving style recognition section, we calculated the contribution rate and cumulative contribution rate of each principal component, as shown in Fig 3. According to the principle that the cumulative contribution rate reaches 85%, the first six principal components were selected to reflect the information of the original indicators sufficiently. The principal component coefficient matrix is shown in Table 3.

thumbnail
Fig 3. Principal component contribution rate and cumulative contribution rate.

https://doi.org/10.1371/journal.pone.0252484.g003

The score of the principal component according to the analysis coefficient will be used as the input for the classification and driving style recognition model later.

Then use the elbow method to determine the number of categories. When the number of clusters increases, the degree of aggregation of each cluster will also increase, and the SSE will gradually decrease. When the value of k is less than the correct number of clusters, the increase of the value of K will significantly increase the degree of aggregation of each cluster, and the decrease of SSE is larger. Conversely, when k reaches the true number of clusters, the return on the degree of polymerization obtained by increasing k will quickly decrease, so the decline of SSE will also decrease, and eventually it will tend to be gentle. Fig 4 is the SSE change graph when the number of clusters is between 1 and 7.

As it can be seen from Fig 4, when the number of clusters is greater than 3, the change in SSE tends to be flat. Combined with current research, driving styles are mainly divided into three types: calm type, general type, and aggressive type. So, the number of driving style classifications in this study is 3.

Then draw the result of driving style recognition based on the first three principal components. The recognition result is shown in Fig 5,

This article divided the driver’s driving style into three types: calm type, general type, and aggressive type according to the degree of aggressiveness. It can be seen from the figure that the three driving styles have obvious differences in the first three principal components.

According to the previous crash risk assessment indicators, all data was used as the input for constructing the Bayesian network, and the following Table 4 was obtained by discretizing the variable of each node.

thumbnail
Table 4. Bayesian network node variables and their discrete values.

https://doi.org/10.1371/journal.pone.0252484.t004

In order to facilitate the display of the Bayesian network structure, the variable names were simplified. The corresponding relationship is shown in Table 5:

The MCMC algorithm was applied to learn the Bayesian structure of the original data. Finally, there are 21 directed edges, and the DAG structure is shown in Fig 6. 0 means there is no obvious dependency between the two nodes, 1 means there is an obvious correlation between the them.

Combining the existing prior knowledge, 18 directed edges were finally determined. The visualization of the Bayesian network structure is shown in the following Fig 7.

It can be found from the above figure that after structural learning, the car following interval control, rapid acceleration and deceleration, frequent acceleration and deceleration, line driving, snake driving, speeding, and lane change frequency all have a direct impact on the type of risk. At the same time, there is a mutual influence between various factors. For example, speeding will affect the car following index, the car following index is mainly related to the vehicle distance and vehicle speed and the vehicle speed obviously affects the distance between the front and rear vehicles. Overall, the Bayesian network structure conforms to basic logical cognition.

The parameters were estimated based on counting-learning algorithm which is a kind of Bayesian learning algorithm. Build a Bayesian network based on the results of structural learning, and obtain the Bayesian network structure. 303 3 And then get the conditional probability table of each node by counting algorithm, the Bayesian network model obtained by parameter learning in Netica software is shown in Fig 8:

thumbnail
Fig 8. Bayesian network model after learning with Netica software parameters.

https://doi.org/10.1371/journal.pone.0252484.g008

Through the counting algorithm, Netica can get the probability table of each node. The probability table shows the probability relationship between the changed node and its child nodes. Taking the Sty node as an example, this node has two child nodes Sha and Fol. Table 6 shows the conditional probability table of node Sty.

5. Discussion

To further explore the results of established vehicle-based crash risk model, we analyzed and verified the model from three aspects: model sensitivity, posterior probabilistic inference, and effectiveness.

5.1 Model sensitivity

Bayesian network sensitivity analysis refers to analyzing the impact of other nodes on the target node. Through sensitivity analysis, we can identify the factors that have a greater impact on the vehicle’s crash risk and take effective measures for the factors. The results are shown as follows in Table 7.

Mutual Information is used to measure the degree of dependence between nodes. The mutual information between two nodes can indicate whether the two nodes depend on each other. From Table 7, it can be seen that the mutual information between the line driving and the risk type is the largest which can be speculated that the vehicle has a greater possibility of line driving followed by snake driving, frequent acceleration and deceleration, speeding and so on.

5.2 Posterior probabilistic inference

The Bayesian network model can be used for probabilistic inference, including calculating the posterior probability of the target node, predicting the possibility of the result, and analyzing the main influencing factors of the result when the node state has been determined. The above two posterior probabilities of inferring results from causes and inferring causes from results are called risk prediction and causal inference, respectively.

Risk prediction refers to inputting the determined status of node variables into the Bayesian network. In Netica, if a certain node variable is determined, the corresponding state will be set to 100%, and the changes of other nodes in the entire Bayesian network can be observed. As shown in Fig 9, when the rapid acceleration and deceleration indicator is at the general risk level, the existing risk increases from 38.5% to 57.7%, and the risk level increases from a certain risk to a higher risk.

thumbnail
Fig 9. Known network changes on rapid acceleration and deceleration.

https://doi.org/10.1371/journal.pone.0252484.g009

Another advantage of Bayesian network is causal inference. Causal inference refers to two-way inference through Bayesian network. It can not only calculate the probability of the target node, but also calculate the posterior probability of other nodes when the target node is determined. To find out the most probable combination of factors, this analysis is more intuitive and can prevent the most influential factors in advance.

Assuming that the risk probability is 100%, as shown in Fig 10, it can be discovered that the safety of line driving and serpentine driving are significantly reduced. The most obvious changes in indicators are: the range of high risk increases from the initial 24.9% to 64.7%. This shows that in the absence of other evidence, the most likely cause is line driving.

5.3 Effectiveness

For the judgment of the effectiveness of the crash risk model, this paper selected the risky driving process and compared it with the normal driving process from simulation data.

Combine the video of the corresponding time to verify the risk status.

Fig 11 is a screenshot of the video corresponding to the normal driving time. The vehicle speed is lower and the driving is more stable, which is consistent with the driving behavior during normal driving.

Fig 12 is a screenshot of the video corresponding to the risky driving time. During this period, driving behaviors such as frequent vehicle lane change and continuous overtaking occur, which are consistent with the driving behavior when there is an accident risk.

Then input the data into the Bayesian network to calculate the risk value, and draw a comparison chart of the risk value during the normal driving period and the risky period in Fig 13 as follows:

thumbnail
Fig 13. Vehicle operation risks in different time periods.

https://doi.org/10.1371/journal.pone.0252484.g013

It can be seen from Fig 13 that during the operation of the vehicle, the risk state will change with time. The risk value in the risk driving process shown in the figure above fluctuates between 50-100%. For the first 0.5s, the risk value is close to 90% which can be considered that there is a higher risk in this time period. After 0.5s, as the driver correcting the behavior, the risk is reduced and remains at about 50%. The risk of normal driving process fluctuates around 10%, which is much lower than the former one, indicating that the risk is relatively low during normal driving.

Verification results show that the model is effective for vehicle-based crash risk analysis.

6. Conclusion

Based on the NIGSIM data set, this paper has developed 8 indicators to comprehensively identify dangerous driving behaviors. Compared with previous studies, this paper considered more types of dangerous driving behaviors, which can ensure the safety of vehicle operation better.

Based on the vehicle kinematics data, PCA algorithm was used for dimensionality reduction, and improved K-means algorithm was used for driving style classification. Compared with commonly used questionnaire survey methods, this method has better feasibility and objectivity.

This paper proposes to use Bayesian network to build a vehicle operation risk assessment model. Compared with commonly used methods such as fuzzy mathematics and neural network, Bayesian network is more objective and explanatory, and can analyze the correlation between various factors in depth. Through sensitivity analysis of eight factors, the mutual information between driving on the line and risk types was the largest, reaching 44.59%. It is more likely for vehicles to run on the line, which has the greatest impact on the risk types and is the most sensitive. Through causal inference, when the risk probability is 100%, the safety of line pressing and snake driving is significantly reduced. Among them, the driving index of pressing line is in a high risky range, which increases from the initial 24.9% to 64.7%. This shows that, in the absence of other evidence, the most likely cause of the risk is driving on the line, which is consistent with the conclusion of sensitivity analysis, thereby providing a research basis for the prevention of dangerous driving behavior. Once the risk level reaches a higher level, the model infers the most likely driving behavior that causes the danger based on the posterior probability, and reminds the driver to respond. Finally, the validity of the model is tested to verify that the model is effective for vehicle operation risk analysis, so that it provides a direction for the prevention of dangerous driving behaviors to fundamentally reduce the driving risk. This article only studied the risk of single-vehicle crashes, and did not comprehensively consider the interaction between other motor vehicles, which could be considered in the subsequent research.

References

  1. 1. Lu J, Xie X, Zhang R. Focusing on appraisals: how and why anger and fear influence driving risk perception[J]. Journal of Safety Research, 2013, 45(6):65–73.
  2. 2. Eboli L, Mazzulla G, Pungillo G. The influence of physical and emotional factors on driving style of car drivers: A survey design[J]. Travel Behaviour & Society, 2017, 7:43–51.
  3. 3. Ellison A B, Greaves S P, Bliemer M C. Driver behaviour profiles for road safety analysis[J]. Accident Analysis & Prevention, 2015, 76:118–132.
  4. 4. Zhang Q, Ge Y, Qu W, et al. The traffic climate in China: The mediating effect of traffic safety climate between personality and dangerous driving behavior[J]. Accident Analysis & Prevention, 2018, 113:213–223.
  5. 5. Broyna E. Human factor and traffic safety[J]. AUTOBUSY – Technika Eksploatacja Systemy Transportowe, 2017, 18(7-8):49–52.
  6. 6. Batishcheva O, Ganichev A. Measures to improve traffic safety at road junctions[J]. Transportation Research Procedia, 2018, 36:37–43.
  7. 7. Huvarinen Y, Svatkova E, Oleshchenko E, et al. Road Safety Audit[J]. Transportation Research Procedia, 2017, 20(Complete):236–241.
  8. 8. Zhide Zhang. A Data Analysis Method of Automobile Driving Behavior[J]. Science & Technology Vision, 2017, 000(001):173–174.
  9. 9. Ding Chen. Research on Abnormal Driving Behavior Recognition Technology Based on Vehicle Dynamic Monitoring Data[D]. Beijing Jiaotong University, 2015.
  10. 10. Weng Licheng. Research on Dangerous Driving Behavior Recognition Based on Multivariate Data[D]. Tongji University, 2019.
  11. 11. Luo Yi. Highway driving behavior recognition based on GPS data[D]. Harbin Institute of Technology, 2017.
  12. 12. Elander J, West R, French D. Behavioral correlates of individual differences in road-traffic crash risk: an examination method and findings.[J]. Psychological Bulletin, 1993, 113(2):279–94. pmid:8451335
  13. 13. Simons-Morton B G, Klauer S G, Ouimet M C, et al. Naturalistic Teenage Driving Study: Findings and lessons learned. Journal of Safety Research, 2015, 54, 41–44. pmid:26403899
  14. 14. Tak S, Kim S, Yeo H. Development of a Deceleration-Based Surrogate Safety Measure for Rear-End Collision Risk. IEEE Transactions on Intelligent Transportation Systems, 2015, 16(5), 2435–2445.
  15. 15. Saiprasert C, Pholprasit T, Thajchayapong S. Detection of Driving Events using Sensory Data on Smartphone[J]. International Journal of Intelligent Transportation Systems Research, 2017, 15(1):17–28.
  16. 16. Li P, Shi J, Liu X. Driving Style Recognition Based on Driver Behavior Questionnaire[J]. Open Journal of Applied ences, 2017, 7(4):115–128.
  17. 17. Jing Liu,Long Sun.The influence of driver’s decision-making style on driving style[J].Ergonomics,2015,21(04):18–21.
  18. 18. Jin L, Xian H, Niu Q, et al. Research on safety evaluation model for in-vehicle secondary task driving.[J]. Accident Analysis & Prevention, 2015, 81:243–250.
  19. 19. Moghaddam A M, Ayati E. Introducing a risk estimation index for drivers: A case of Iran[J]. Safety Science, 2014, 62(1):90–97.
  20. 20. Pande A, Chand S, Saxena N, et al. A preliminary investigation of the relationships between historical crash and naturalistic driving[J]. Accident Analysis & Prevention, 2017, 101(APR.):107–116.
  21. 21. Lin G, Zhou J B, Dong S, et al. Analysis of Urban Road Traffic Accidents Based on Improved K-means Algorithm[J]. China Journal of Highway and Transport, 2018.
  22. 22. Jean-Nicola R., Thomas S., Frederic D., etc. Risk Level Assessment for Rear-End Collision with Bayesian Network[J]. ScienceDirect, IFAC Papers OnLine 50-1, 2017, 12514–12519.
  23. 23. Guo Y., Li Z., Liu P., Wu Y. Modeling correlation and heterogeneity in crash rates by collision types using full Bayesian random parameters multivariate Tobit model. Accident Analysis & Prevention, 128, 164–174. pmid:31048116
  24. 24. Chong L, Abbas M M, Flintsch A M, et al. A rule-based neural network approach to model driver naturalistic behavior in traffic[J]. Tramsportation Research Part C Emerging Technologies, 2013, 32(jul.):207–223.
  25. 25. Jun Wu, Lu Zhang, Qinghua Gao. Application of Fuzzy Evaluation in the Influence of Expressway Driving Behavior on Traffic Accidents[J]. Journal of Inner Mongolia Agricultural University (Social Science Edition),2007(02):164–165+170.
  26. 26. Zhang Wei, Wang Wenjun, Cheng Bo. Identification method of drivers’ bad driving behavior[C]// 2011 Automotive Safety Technology Conference. 2011.
  27. 27. Zhifang Ye, Wenzhou Jin, Zuogan Tang. Estimation of injury degree in rollover accident based on generalized ordered logit model[J]. Henan Science,2019,37(01):119–126.
  28. 28. Dong S, Zhang M, Li Z. Risk Analysis of Vehicle Rear-End Collisions at Intersections[J]. Journal of advanced transportation, 2020, 2020(3):1–11.
  29. 29. Zhang Fuming. Research on the impact of taxi drivers’ driving behavior on accidents based on CART decision tree [C]. Chinese Association for Science and Technology, Ministry of Transport of the People’s Republic of China, Chinese Academy of Engineering. Proceedings of the 2019 World Transport Conference (Part 2). China Science and Technology Association, Ministry of Transport of the People’s Republic of China, Chinese Academy of Engineering: China Highway Society, 2019: 1030-1038.
  30. 30. Guo Y, Sayed T, Essa M. Real-time conflict-based Bayesian Tobit models for safety evaluation of signalized intersections[J]. Accident Analysis & Prevention, 2020, 144:105660.
  31. 31. Yg A, Ts B, Lai Z C. A hierarchical bayesian peak over threshold approach for conflict-based before-after safety evaluation of leading pedestrian intervals - ScienceDirect[J]. Accident Analysis & Prevention, 147.
  32. 32. Juan Zhou, Yulong He, Jingjing Tian. Analysis of vehicle speed characteristics based on NGSIM trajectory data[J]. Transportation Technology and Economy,2019,21(03):45–48+61.
  33. 33. Xiangran Lin. Research on lane-level vehicle lane-changing strategy based on NGSIM data[J]. Integrated Transportation,2019,41(04):120–126.
  34. 34. Dongliang Liu, Qiushuang Wang. The method of obtaining instantaneous vehicle speed based on NGSIM data[J]. Journal of Jilin University (Engineering and Technology Edition), 2018,48(01):330–335.
  35. 35. Lanxin Geng, Xinnan Liu, Daxue Liu. Research on Driver Response Time[J]. Energy Conservation and Environmental Protection in Transportation, 2015(02):25–29.
  36. 36. Cooper G.F., Herskovits E. A Bayesian Method for the Induction of Probabilistic Networks from Data. Machine Learning 9, 309–347 (1992).
  37. 37. Cooper G F, Herskovits E. A Bayesian Method for Constructing Bayesian Belief Networks from Databases[C]// UAI ’91: Proceedings of the Seventh Annual Conference on Uncertainty in Artificial Intelligence, University of California at Los Angeles, Los Angeles, CA, USA, July 13-15, 1991. 1991.
  38. 38. Heckerman D, Geiger D, Chickering D M. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data[J]. Machine Learning, 1995, 20(3):197–243.
  39. 39. Heckerman D. A Bayesian Approach to Learning Causal Networks. 2013.
  40. 40. Bouckaert R R. Probabilistic network construction using the minimum description length principle[J]. Lecture Notes in Computer Science, 1993, 747:41–48.
  41. 41. Suzuki J. A Construction of Bayesian Networks from Databases Based on an MDL Principle[J]. Uncertainty in Artificial Intelligence, 1993.
  42. 42. Spirtes P, Glymour C, Scheines R. Causation, Prediction, and Search, 2nd Edition[J]. 2001.
  43. 43. Spirtes P, Meek C. Learning Bayesian networks with discrete variables from data. 1995.
  44. 44. Jie C, Greiner R, Kelly J, et al. Learning Bayesian networks from data: An information-theory based approach[J]. Artificial Intelligence, 2002, 137(1–2):43–90.