Analyzing urban traffic demand distribution and the correlation between traffic flow and the built environment based on detector data and POIs

Purpose: This paper aims to determine the urban traffic flow spatiotemporal characteristics and correlation with the built environment using SCATS (Sydney Coordinated Adaptive Traffic System) and POIs (Point of Interests) data of Shenyang, China. Methods: A standard analysis framework based on these data is proposed in the paper. The study analyzes the traffic volume spatiotemporal distributions and built environment influence factors determined by the geographical detector. An improved gravity model using simple structural parameters (lanes number and road length) is proposed to estimate the traffic flows of day and peak hour scales for specific flow ranges. Results: The results show that the peak hours of different intersections and roads are heterogeneous and reveal trip time flexibility. The correlation between peak hour flows and day flows is significant in the multidimensional analysis. Based on the investigation of lanes, more interesting conclusions are found. In this case, when the numbers of lanes of intersections and roads are more than 14 and 4 respectively, the lane resources are wasted to a great extent. There is also a certain correlation between these factors. Proposed gravity model establishes the connection between structure and function of urban roads. Conclusions: Flexible work time and places will be effective methods to reduce traffic congestion. The day flows could be estimated via a traffic survey on peak hour flows, especially in developing cities. The traffic flow mainly concentrates in a relatively small part of city roads. The maximum service traffic volumes exhibit segmentation, we should reconsider the maximum optimal lanes number of intersections and roads under better performance and utilization rate of the network. The effect of lanes number on the service traffic volumes is found to be more significant compared with the other factors. Our conclusions will be helpful for policy-makers and sustainable urban planning.


Introduction
Some studies have analyzed the relationship between the built environment and traffic systems, such as traffic behaviors [16][17][18][19][20], the association between network structure and road safety [13,21,22], and the correlation between traffic congestion and different attributes of urban land use [23][24][25]. Though they found strong empirical evidence to show the correlation, limited research has investigated the impact of the built environment on traffic flow or complicated relationships between them at the level of the network. Because research data as a kind of scarce resource in this domain is often comprised of location data and other data [24,25]. Trajectory, traffic state or other data in these studies are sample data in a sense and do not involve the traffic flow of different traffic modes. These data have restricted the ability to reflect the whole traffic flow conditions. Many existing studies are still limited and restrained because of lack of the whole investigation. The deficiency is also common in European transport studies. For example, the conclusions of the spatial distribution of traffic flow in existing studies derived from travel time and taxi data [26,27], not traffic volume of all transport modes. Moya-Gómez and García-Palomares studied changes in automobile accessibility over the course of the day, as caused by congestion of the road network in eight European cities [28], however, it is not a direct investigation towards trip time. Meanwhile, the correlation analysis between traffic flow and built environment is seldom mentioned in European cases. Therefore, this correlation needs to be analyzed in more empirical evidence especially from traffic detector data. The authors have also conducted a preliminary exploration of the road network structure characteristics [29][30][31][32][33], and greatly appreciated the importance of the number of lanes for traffic which is different from existing studies significantly. The lane is the carrier of urban traffic glow and plays a significant role in the transportation systems. However, to our knowledge, the empirical research involved lanes at the level of the network is still very limited. And research on the network structure is just a small step; a more important task is to establish the connection between the structure and function of the network, i.e., determine how to predict or infer the operation law of the whole system with structural measures after quantitative observation of the structure of the network. Unfortunately, compared with the description of the network structure, research on this topic has developed slowly [10]. Our research is based on SCATS and POI data which have a detailed description of the whole traffic state. The detector data from SCATS is complete and can show the real traffic burden or demand. The beauty of real data lies in this capability. Therefore, the first and most important contribution of the paper is to help researchers to understand these characteristics and correlations derived from the whole, not samples before modeling analysis and engineering application. In the overall view, we will answer the following two questions in the paper.
(1) Based on the empirical population evidence, what are the time and space characteristics for traffic demand in the road network? How to illustrate them systematically?
(2) In our case, whether the correlation between traffic flow and the built environment exists? If so, is it the same as the existing studies?
Based on the simple thought, we will analyze the SCATS data and built environment data to reveal interesting results. Our empirical analysis could offer a more comprehensive understanding of the temporal and spatial distribution characteristics of urban transportation demand, and also further reveal the relationships between structure and function information. The results of this study provide an empirical and theoretical reference for the network analysis and management of urban road traffic as well as exploring how universal these findings are by conducting a similar analysis for European and other cities.

Traffic flow data description
In this paper, the functional information of the urban road traffic network is extracted from the SCATS (Sydney Coordinated Adaptive Traffic System) [34] in Shenyang City of China. In 2014, the city which is one of the biggest cities in the northeast region of China had a population of 8.29 million, and the number of cars was 1.46 million. As a form of spatial distribution of travel demand, traffic flow is selected as the basic parameter to reflect the function information of the urban road traffic network. The SCATS system has a total of 525 intersections in the main urban area of the city, as shown in Fig. 1. The number of inlet lanes of intersections is 3-24, including T-intersections, crossing intersections and five-way intersections.
Considering the regularity of the travel patterns of the residents, we randomly selected data of a typical intersection and the corresponding western entry road for a week (July 29, 2014 -Aug. 4, 2014). As an example, these data exhibited time similarity separately in traffic flow, as shown in Fig. 2, making it necessary to select the day which has maximum traffic demand as the research object.  The calculation formula of the Pearson correlation coefficient is given, where n is the total number of samples, i refers to a specific sample. x and y are the variables, and x and y are the means of the corresponding variables, respectively. If the two variables are positively linearly correlated, then 0<R ≤ 1. If two variables are negatively linearly correlated, then −1 ≤ R < 0. If there is no linear correlation between the two variables, then R = 0. Generally, if |R|> 0.8, then the two variables are considered to have a strong linear correlation. At this typical intersection, the change trends and values of the flow time series of 15 min during the continuous week are consistent. Fig. 2(a) exhibits the correlation between variables directly. The corresponding west entrance road also has similar characteristics, as shown in Fig. 2 Table 1 shows the total traffic volume and correlation coefficients of the 15-min flow time series of the intersection on different days. In terms of the total volumes, except Sunday (Aug. 3, 2014), the traffic demand of this intersection is relatively stable, and the maximum value occurred on August 1, 2014. The correlation coefficients at different dates were found to be R > 0.95 indicating that the flow time sequence of different dates has obvious time similarity. The average correlation coefficient of the flow time series of the corresponding western inlet road is 0.9692, which also has obvious time similarity. In view of the maximum traffic demand of August 1, 2014, the traffic of the road network on that day is selected for the following analysis.
Data quality and detectors condition were checked. According to the statistics, there are 318 intersections having the output data on the day, 63 of which are normal for all detectors. There are 521 segments having the complete data, 64 of which are normal for the two-way detectors.

Built environmental influence factors
The primary built environmental factors affecting traffic state of travel behaviors could be divided into traffic-related and land-use related factors [11,35,36]. The geographical detector [37] was introduced to assess the built environmental parameters that may be responsible for the road traffic state [24]. Zhang et al. defined the power of determinant (PD) to determine whether a spatial factor may be responsible for clustering results of traffic state [24]. The equation of PD is as follows, where n D, i is the number of samples in the subregion i of the determinant D i ,a n dn is the total number of samples. n ¼ P i¼1 k n D;i ,w h e r ek is the number of the sub-regions. σ 2 is the global variance of an influence factor in the study region, and σ 2 D;i is the weighted divisional variation. The value range of PD is [0, 1]; a larger value indicates the factor's determinant power is stronger. Zhang et al. investigated the relationship between traffic congestion and the built environment based on taxi GPS data of Shanghai, China; the built environment factors and PD are shown in Table 2. Table 2 presents the explanatory power of the factors. Num_bus (0.130) has the highest PD, i.e., more bus stations along the road segment per 100 m are related to the high possibility of congestion, because bus stations of higher density reflect greater commuting volume along the road segments. Considering the possibility of data collection and the values of PD, the following factors were chosen for further analysis in this study: Num_bus (0.130), Rd_type (0.105), Dist_hosp (0.091), and Num_scho (0.084). Different from the simple description (1 for the primary road and 2 for the secondary road) in Zhang's work, the paper replaces that factor with the lane number as well as considering the importance of lane number based on our previous studies [29][30][31][32][33]. The complete data of traffic demand from SCATS is more convincing with comparison to taxi data of Zhang'swork. Correspondingly, this study extracted 8643 POIs (point of interests), including bus stations, hospitals and schools using web-crawler software, as shown in Fig. 3.

Research methods
Based on the analysis mentioned above, the analytical flow of this study is shown in Fig. 4. Section 2 presents a description of the used data and methods in the study. In section 3, traffic demand spatiotemporal characteristics will be analyzed from the two dimensions (time and space) within the actual road network data in Shenyang, China. For variable traffic demand in a day, peak periods will be firstly investigated in this section. Except for the peak time distributions in the scales of the morning, evening and the whole day, the correlations between peak hour flows and day flows of specific peak periods will be also considered. From the spatial point of view, we will show traffic flow distribution of roads with diverse lane numbers in the urban street network. Subsequently, the correlation between the traffic flow and the built environment (Num_bus, lane number, Dist_hosp, Num_scho) is investigated, and some interesting findings will be given. Finally, we will present the research conclusions and discussions of future work in section 4.

Traffic demand analysis
The analysis object is the city's traffic data on August 1, 2014. Because of the lack and fault of detectors, it is difficult to obtain the traffic flow of all intersections; thus the results represent a relative relationship. At first, this section analyzed the temporal distribution of traffic demand in the three scales of the whole day, the morning peak and the evening peak.
As the bottleneck of the urban traffic system, the intersections play a significant role in the process of transportation operation. Analyzing the peak hour and flow distributions of an intersection is helpful to gain the profound understanding of the temporal operation characteristics of urban traffic flow. As shown in Fig. 3, the traffic flow chart in urban roads is usually in the form of a saddle shape. q hi indicates the traffic in the i th hour. There are peaks in the morning and afternoon, and each corresponding hour is called the peak hour. The traffic volume within that peak hour is called the peak hour flow q hm . Define peak hour flow ratio, In the eq. (3), Q is the full day flow, that is Q ¼ P We screened out detectors, and the segments with fully covered and normal detectors were selected. There are 521 segments in one way meeting the research requirement according to the statistics, and the number of lanes is 1~7. There are 64 two-way segments with normal data output.

Correlation analysis between traffic flow and built environment
The correlation analysis in this part includes two objects, i.e. the intersections and roads.
To study the relationship between the intersection traffic flow and the built environment, the aforementioned several factors were first analyzed in the context of the intersections. Given that intersection traffic flow comes from the adjacent road segments, the number of lanes becomes the only analysis factor. In this section, we investigated the correlation between the actual maximum capacity and number of lanes. The number of approach detectors, peak hour time throughout the day and the corresponding flow data of each intersection were counted. The traffic analysis report of China major cities in the third quarter, 2014 (http://report.amap.com/ download_city.do) indicates the rankings of Shenyang's peak time and all-day congestion in key cities were both at the top of the list. Therefore, Shenyang is the typical case for analysis in China. The traffic function of the intersections at the network level is measured by peak hour flow of the full-day. Moreover, considering the lower traffic demand of some roads, the 30 percentile of the average lane hour flows (227veh/(h * ln)) is selected for the threshold value to remove intersections of lower traffic pressure. Define detector integrity rate, In the formula, n j is the number of actual detectors at the intersectionj; N j refers to the actual number of approach lanes. The sample size U is assumed to represent a set S of intersections. Other influence factors will be considered in the part of roads.
To analyze the road traffic's relationship with POIs, we combined the roads with the same names and obtained 42 roads. The flows are the maximum values of the same roads. In the seventeenth century, Newton proposed that the force of any two objects is proportional to its mass and inversely proportional to the square of the distance between them. Currently, the gravity model has become a widely used model in spatial interaction. The improved gravity model formula is given below, w h e r eKi sac o n s t a n t ;M is the fitness that refers to the intrinsic properties of the nodes and indicates the ability to get an edge; D is generally defined as the Euclidean distance, but it can also represent other physical quantities, such as time; The values of the two exponents α and γ depend on the network's dependence on node fitness and geography [38]. A classic application of the gravity model in the field of transportation planning is trip distribution forecast of the four stages, in which trips between two traffic zones are directly proportional to the number of trip productions and attractions and is inversely proportional to the traffic impedance between the origin and destination. Previous studies have validated the applicability of the gravity model in network flow analysis including highway systems [39], airport systems [40][41][42] and rail systems [43]. Related research on urban traffic flow focuses on the human mobility among towns or cities [44][45][46][47], however, to our knowledge, no study has used the gravity model to estimate traffic flow between two adjacent intersections from the perspective of spatial interaction.
Here, we defined M as degree [10], improved degree [29] and lane number of the connecting road (i.e., estimated road). The distance function is described in the forms of the power function, the exponential function and the combination function [48]. The interactions Q between the adjacent intersections will be investigated in the fitting experiments. Q includes total day flows of two-way roads and peak hour flow of larger traffic demand direction. When fitness (M) is defined as the two-way lane number and the form of combination function is selected for D, eq. (6) is true within a specific flow range.
In the formula, α, β, γ, η, k, K are parameters that re- of root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and correlation coefficient (R) were calculated to determine the accuracy and agreement between the observed and estimated values.
where n is the total number of observed (and forecasted) values; Q ij refers to the observed values of daily flows (or peak hour flows) of intersection i and j; Q 0 ij refers to the corresponding traffic estimated values.

Traffic demand spatiotemporal distribution characteristics 3.1.1 Temporal distribution characteristics analysis
In the 318 intersections of Shenyang city, China, the peak hours of the day were mainly concentrated in the morning peak (07:00-09:15) and the evening peak (16:15-18:15), as shown in Fig. 5(a). The trip peak period is similar to that of European cases [28]. The combined peak hour frequencies of both peaks accounted for 82.22% of the peak hours. The maximum flow of the day mainly occurred in the morning peak. The ratio of the frequency of the morning peak to that of the evening peak was 3.18:1. The morning peak accounted for 62.54% of the day. The average peak hour flow ratio of the whole day was 0.0844. The average peak hour flow ratio ranged from 0.0654 to 0.1087 according to the time interval statistics.
A ss h o w ni nF i g .5(b), the average flow distributions and change trends of the peak hours were consistent with those of the day flows of corresponding sample sets, especially during the morning peak and the evening peak, and the correlation coefficient R was as high as 0.9821. A linear correlation was found between day flow and peak hour flow of individual intersection, as shown in Fig. 5(c). The model is as follows, where Q is the day flow of the intersection, q is the peak hour flow of the intersection, and the number of samples is 318. In addition to the time distribution of all day traffic, the morning and evening peaks will be inspected separately, as shown in Figs. 6 and 7. The morning average peak hour flow ratio was 0.0831, slightly less than that of the full-day. The average peak hour flow ratios of the time interval segment ranged from 0.0646 to 0.1510 (the second largest ratio is 0.1087). The correlation coefficient between average daily flows and average flows of the morning peak hours in the corresponding samples was 0.8704. The average flow ratio of the evening peak hour was 0.0733, which was significantly less than that of all-day and the morning peak. The average peak hour ab Fig. 6 Morning peak time characteristics, (a) peak hour distribution and APHR; (b) correlation between average peak hour flow and average daily total flow flow ratio of the time-sharing segment was 0.0605-0.2131 (the second largest ratio was 0.1075). The correlation coefficient is 0.9073 between average daily flows and the average flows of the evening peak hours. The flows of the morning peak and the evening peak were in line with the changing trend. Because the bottleneck of urban traffic flow is the intersection, the time distribution of the road was not discussed here. The average peak hour flow ratio of the road was 0.0801, which was the 63rd percentile. The 93rd percentile was 0.1003. This section reviewed the consistency of trends between peak hour flows and full-day flows from three dimensions of the day, morning and afternoon. In the form of an example, the full-day service traffic volumes could be estimated by the peak hour flow ratio because the peak hour volume is a typical item of the traffic survey.  Fig. 8(b), the ranges (max-min) of the total flows of the same number of lanes first gradually expanded and then decreased. These findings show that the actual traffic function of different roads is quite different, despite having the same road structure. It provides the possibility for the refined design of traffic control strategy and the further optimization of transportation resources. Note that there is only one normal one-way road whose number of lanes is 7; this road is ignored in the range analysis. The distribution of peak hour flows is similar to that of the daily flows.

Spatial distribution characteristics analysis
Similar to the intersections, the correlation between the daily flows and peak hour flows of roads was analyzed. A significant linear relationship was f o u n d ,a ss h o w ni nF i g .8(c). The linear model is as follows, where Q' is the day flow of the road, q' is the peak hour flow of the road, and the number of samples is 521.
In addition, the total traffic volume of each road type was obtained by multiplication of the number of segments of different lanes and the average values of the daily total flows. The subgraph of Fig. 8(a) shows the daily flow cumulative probability with increasing number of lanes. We find that urban traffic flows are mainly concentrated in a small number of roads. Approximately 66% of the small and medium-sized roads were covered by about 38% of the traffic flow, and 34% of the medium and big-size roads served approximately 62% of the traffic. One-lane roads, which occupied 42.54% of the total number of segments accounted for 14.64% of the traffic, and their traffic functions were equal roughly to those of arterial roads (five, six and seven lanes in one direction), which accounted for 5.19% of all roads (the latter served 13.40% of the traffic demand). Service traffic volumes of one-way two, three and four lane roads were 23.02%, 25.11% and 23.82%, respectively, all exceeding 20%. The proportions of the three kinds of roads were 23.76%, 17.69% and 10.81% for two, three and four lane roads, respectively. The finding from China's real traffic data is a powerful supplement of street hierarchies for Lammer'sw o r k [ 26] on the German cities using travel time and betweenness centrality to reflect the real flows, Jiang's European taxi case [27] and Huang's Wuhan case [49]. As an important arterial road connecting the north and south of the city, Fig. 9 shows that the traffic flows in both directions of the Qingnian street were relatively close, which were 54,639 veh/d and 53,714 veh/d. The flow ratio was 0.8-1. intersections with lower traffic demand, U 1 ′ = 222. The difference between samples U 1 and U 1 ′ is that average flows of the latter are slightly greater than that of the former. The frequency is the opposite case. Moreover, they have same correlation with the entry lane number. U 2 , U 3 ,a n dU 4 are the cases of filtering out the intersections.
According to the relationship shown in Fig. 10 between the number of lanes and the mean of traffic flow, the number of inlet lanes in the intersections was linearly positively correlated with the peak hour flow. In general, the peak hour load flow (actual capacity) at the intersection increased with the increase of the number of inlet lanes. Different sets of samples, however, revealed that the maximum carrying capacity for each type of individual was optimal when entry lane number was 14, namely, a marginal effect of entry number of lanes existed in the urban road network. The marginal effect is that the increase of the traffic capacity will gradually decrease when the other inputs are fixed. Since the flow is still larger when the number of entry lanes is 15, it can also be considered.
In SCATS, Degree of Saturation (DS), which refers to the ratio of effectively used green time to the total available green time, is utilized to evaluate the saturated state of the traffic control system [50]. Similar to the previous study, we acquired the DS data of the intersections with ξ j ≥ 80% and removed outliers whose phase number is significantly less than the illustrated number in the system. The distribution of the average degree of saturation in ascending order for each intersection is shown in Fig. 11. The figure indicates the average DSs of different intersections and phases are larger in peak hours, with the average values of 77.08% and 76.09% respectively.
In addition to signal control, another important factor affecting the capacity of intersections is lane function division. As one of the common traffic facilities on the city roads, the commonly used signals are red, green and yellow. In the green light period, vehicles that arrive at the intersection can go straight into the intersection, turn right or left (unless other traffic signs forbid a flow). But when the yellow light starts, the vehicles are prohibited from entering the intersection and wait in line until the restart of the next green light. Because of the releasing or interrupting traffic flow of a certain direction periodically, vehicles in a given lane go through the intersection at part of the time, and they will wait for the green light signal or the previous release at other times. According to the control of the signal, the traffic flow of the signal intersection having conflict in space could be separated in time. The lane group is an important analysis object when calculating the capacity of a single intersection. From the perspective of network traffic flow analysis, however, vehicles during peak time are in a state of saturation or even oversaturation for most intersections. Moreover, each approach usually has lanes of three directions (straight, left and right). When the total number of lanes is fixed, the specific lane combinations are no longer analyzed in the comparison among multiple intersections in the urban road network.

Correlation between the road traffic flow and the built environment
From the perspective of traffic flow, however, a significant correlation among traffic flow, bus stations, hospitals and schools found from a speed analysis [24] did not occur. Dist_hosp did not also show correlation with other factors, so it was replaced by the number of hospitals within 500 m. This discrepancy may be the result of the differences in various research cases and analysis indicators. Despite the discrepancy in this aspect, we found a new correlation among the built environment factors. Figure 12(a) exhibits the positive linear correlation between road length and the number of bus stations along the roads. Figure 12(b) shows the correlation among the number of hospitals within 500 m, the number of schools within 500 m and the degree. Figure 12(c) shows the correlation between the degree and the number of schools within 500 m. Here, the degree is the number of connecting roads for a road, and it is a basic indicator in network science [10]. Although a simple and clear correlation has not been found in this case, we think the correlation should exist in a mature development status of the city systems.
When examined the prediction result of the gravity model, we found that when fitness is defined as the two-way lane number, the form of combination function is selected for D, within a specific flow range, there exists eq.

Conclusions and discussions
In this paper, real data of Shenyang, China was taken as an example to study the urban traffic flow spatial-temporal characteristics and its relationship with the built environment; and some interesting findings were obtained. The conclusions were derived from empirical data analysis from the perspectives of time and space. The temporal characteristics focus on the trip time flexibility and the trip quantity variability of city traffic demand. The spatial aspect focuses on the difference of road utility at the network level, i.e., the road utilization rate. The potential important findings were elaborated in figures and models.
In terms of the temporal distribution of traffic demand, the peak hours of different intersections and roads were found to be heterogeneous, revealing trip time flexibility. The primary trip peaks were the morning and evening peaks (07:00-09:15 and 16:15-18:15). Citizens' commute behaviors determine the phenomenon; however, we found that the trip quantity of the morning peak is larger than that of the evening peak under fixed traffic demand (average peak hour flows are 40,956 and 33,989 vehicles for morning and evening peaks, respectively). The peak period of the day mainly occurs in the morning, accounting for approximately three quarters of peak hours. It indicates that, after work, people's destinations and the variability of routes caused less traffic burden for the roads. Therefore, flexible work times and places is an effective method to reduce the number of vehicles and improve the traffic condition. Considering the influence of routes and trip times on the traffic state and the imbalance of the network flow distribution, the study of the traffic signal control strategy should emphasize the time difference and signal optimization of heavy traffic burden routes. In addition to traffic control, another link that must be strengthened is traffic information service based on GIS-T (Geography Information System-Transportation). It will be more important in the next era of autonomous vehicles. After studying the traffic flow of the intersections and roads, the interesting scope and number of the average peak hour flow ratio were discovered. The scope was found to be 0.06~0.10, and 88% of the intersections and 93% of the roads are in this interval. The average values of peak hour flow ratios are 0.08 (0.0844 for the intersections, 0.0801 for the roads). Since the correlation between peak hour flows and day flows is significant, day flows could be estimated when we have traffic survey of the peak hour flows. This estimation is more important for developing cities because of the lack of data collection equipment. Moreover, even if the roads have similar road structure with the same number of lanes, the actual traffic functions of different roads are quite different. The traffic flow is found to be concentrated in a relatively small part of city roads. The small and partial medium-sized road segments account for 66% of all roads, but only cover approximately 38% of the day service traffic, and the large and partial medium-sized road segments (34% of the whole) account for 62% of the traffic.
Built environment influence factors (Num_bus, Rd_type, Num_hosp, Num_scho) were considered in the correlation analysis with traffic flow. We found that the effect of lane number on service traffic volumes of the intersections and roads is more significant compared with the other factors. The lane number has a significant positive linear correlation with average service traffic flow. The greater the number of lanes is, regardless of whether roads or intersections are considered, the greater the number of vehicles serviced is. However, maximum values of traffic flow revealed that the service capacity is different. There is a segmentation feather. Namely, for both of cases, optimal network function is achieved at a certain number of lanes. The case results indicate that the maximum number of lanes of intersections and roads should be 14 and 4, respectively. The latter, 4 lanes, is merely a reference because the utility of roads is also determined by the green time or split. In this context, we should reconsider the road diet [51] from the view of point of better performance and utilization rate of the road network. The discovery of the optimal lane number provides new insight and reference for urban planning and traffic design. Other factors were not found to be strongly correlated with traffic flow. However, the correlations among these factors were revealed, such as road length with the number of bus stations, numbers of hospitals and schools with degree, and degree with the number of schools. Finally, we proposed an improved gravity model to estimate the traffic flow at the day and peak hour scales for specific flow ranges. This model represents a new approach to investigate the traffic flow using simple structural parameters (number of lanes and length of a road). The results of this study provide quantitative support for urban traffic flow spatiotemporal characteristics and its relationship with the built environment. It could provide the reference for current traffic management and help determine how to reduce the waste of road resources in the form of empirical evidence. However, the results are perhaps only valid in this case; thus more data from other cities are required to explore whether there is a universality rule. It would be interesting to explore how universal our findings are by conducting a similar analysis for European and other cities so that we can have a better understanding of urban transport systems. Proposed analysis method and subsequent results will be important references for trip demand distributions and the correlation between traffic flow and built environment of European transport studies. Related work on the urban traffic flow spatiotemporal characteristics and its relationship with the built environment must be further investigated in future studies. Except for the traffic demand of vehicle level, the characteristics of selecting routes of drivers based on trajectory data will also be our next research content.