1 Introduction

Electrified railways have constantly raised interest since they are considered one of the most energy-efficient modes of transportation. It is known from the report of [1] that the railway sector is efficient when compared to other means of transportation: the railway sector has a 9% market share in the transportation of passengers and goods in the European Union and this is achieved with a final energy consumption of 2%, in comparison with other sectors. The considerable power requirements of the railway transportation system lead the railway operators to pursue actions to increase the energy efficiency, reduce the energy consumption bill and increase the infrastructure capacity, [2, 3].

This work is framed within the IN2STEMPOFootnote 1 project objectives—Innovative Solutions in Future Stations, Energy Metering and Power Supply—specifically in the smart power supply activities. Inserted in the Shift2Rail European program [4], the smart power supply activities seek to contribute to the development of a railway power grid in a fully interconnected and communicating system, [5]. In the smart power supply target of IN2STEMPO, one goal is to study Flexible AC Transmission Systems (FACTS) to exploit the full potential of the 25 kV, 50 Hz rail power supply system. Specifically, one of the FACTS under study is the Railway Interline Power Flow Converter (RIPFC) [6]. The RIPFC comprehends a power transfer device (PTD) in the neutral zone (NZ), providing double-side feeding to the catenary branch and respective trains, and reactive power on both sides to stabilize the catenary voltage.

The objective of this paper is to statistically evaluate the effects of the inclusion of a PTD in the neutral zone of an electrified railway line. Mainly, the active power balancing in the two traction power substations (TPS) and the system power losses are under consideration in this study.

The first contribution of this paper is a simulation framework to statistically evaluate the proposed PTD compensation strategy (where the framework is capable to be extended to additional statistical analysis). Then, the second contribution is the evaluation of two scenarios, with different PTD characteristics, when they are considered design constraints (namely, the catenary branch length and the sparsity/density of trains in the TPS).

This paper is divided into seven sections. Sect. 2 addresses a scientific literature review on the topologies for power transfer devices and on the control strategies. Section 3 presents the architecture for the simulation model necessary to implement the required analysis. Then, Sect. 4 covers reduced scenario simulations to illustrate the advantages of this strategy, in terms of active power balancing and system power losses. Section 5 covers the universe of parameters randomly analyzed with the Monte Carlo analysis methodology. Section 6 presents the results and the discussion on the. Finally, the conclusions are presented in Sect. 7.

2 Literature review

Electrified railways, DC or AC, must deal with several power quality issues, either in the railway subsystem itself and in the transmission/distribution system operator (TSO/DSO) supply [7,8,9,10]. In the past, some passive solutions helped to alleviate these issues but their performance was dependent on different operation parameters and has poor dynamics. Then, the use of power electronics based converters for power quality increasing in railway systems has entered into the domain and is continuously increasing [11,12,13]. Their purpose is quite large and addresses different power quality issues in railway systems: transformer losses, catenary voltage stability, substation active power balancing, peak shaving, three-phase current balancing, reactive power compensation [13,14,15]. It should be mentioned that the static frequency converter (SFC) system is a kind of “one fits all” regarding all the referred power quality issues but this system is a stand-alone solution, interesting for new lines, [12, 16,17,18]. Since the large majority of railway systems is based on some type of transformer topology, the SFC system will not be further considered in this paper.

Modern railway line voltage stabilization is made using a static VAR compensator (SVC), located near the neutral zone [19, 20] , or using mobile reactive power compensation strategies [3, 21, 22]. The railway power conditioner system (RPC) is employed in conjunction with V/V transformers and offers very good performance regarding 3-phase power balancing and power factor correction [10, 23, 24]. Similarly, the co-phase system [25,26,27,28] , has similar characteristics in terms of an optimized 3-phase interface. Maintaining the objective of guaranteeing a high level of power quality at the 3-phase supply, additional solutions are employed, e.g. the STATCOM [29,30,31].

In recent years, a strong effort is being made to introduce renewables and storage in railway systems. This tendency intends the satisfaction of several main objectives: i) alleviate the substations load and then avoid peak loads (peak shaving) and, at the same time, reduce the system losses [32,33,34]; ii) turn the railway system an even more environmental friendly [35]; and iii) provide the means for regenerative energy storage [36, 37].

There are many factors such as weather conditions, driving style, schedule, and stop time in the stations which alter the actual characteristics of train circulation concerning pre-defined ideal conditions. All of these factors have some randomness degree [38].

The position in the sector, speed and acceleration of the train define both the consumed power and the voltage drop in the line and respective losses and the power factor in the substation. Therefore, it is not possible to carry out a fully deterministic analysis of the various operating parameters of the railway subsystem [39, 40].

Thus, several aspects of the power flow analysis regarding both the railway substations and the catenary are better characterized from a probabilistic analysis [41,42,43]. Analyses of feeder current profile, [44, 45], harmonic propagation in DC [46] or AC railways [47], and the voltage profile in the catenary [39, 48], have been performed using probabilistic methods. The analysis of data from both real exploration and dedicated simulators allows defining the probability density functions associated with factors that contain a probabilistic character such as the number of trains in a section, the position, the consumed power, or the power in the substation [41, 49].

The PTD is an additional element in the railway system and the analyses of its effects, both on the balance of active power in substations and on the redistribution of power losses in the system, are the main objectives of this article. As, the railway system becomes more complex, data clustering and probabilistic analysis are used, in particular, the Monte Carlo method. The following section addresses the subsystem models, with particular focus on the developed simulation framework.

3 Subsystem models

Let us consider a generic 1\(\times\)25 kV, 50 Hz double-track railway line as illustrated in Fig. 1 including a PTD in the ND. In this configuration, and commonly, each traction substation is electrically isolated from an adjacent one where this electric separation is made in the ND.

Fig. 1
figure 1

Simplified representation of the inclusion of a power transfer device on a neutral zone

This paper considers the inclusion of a power converter (a PTD) in the ND, having a back-to-back configuration, as illustrated in Fig. 2, being capable of independently control the reactive power injected in each side of the two sections (\(Q_1\) and \(Q_2\)) and transfer active power between the two sides (\(P_{\text{ptd}}\)). To limit the size of the problem, the work in this paper is only focused on the management of active power where the power factor of the PTD is unitary.

Fig. 2
figure 2

Simplified architecture of the electronic converter to perform the power transfer device function

The specific converter structure can be based on different approaches but the most common is a multilevel one, built with cascaded H-bridges, neutral-point clamped, or modular multilevel converters. The transformers are designed according to the power level to be transferred between sections, the reactive power injected in the ND, and the voltage and current levels in the converters. Also, a parameter of high relevance in the system design either in the transformer and in the converters is the efficiency level. This parameter ensures that the power losses value is kept at a minimum and thus does not compromise the overall losses when using the PTD, particularly in the active power transfer mode.

3.1 Architecture of the simulation framework

The simulation model of a railway power system is a particular power flow analysis, where the electric model is dynamic. Specifically, the electric model depends on the number of trains and the position of each one (absolute/relative position). Then, the power flow analysis of a railway power system is a snapshot simulation, where each simulated scenario has a specific matrix for the branches and the buses of the power system.

This work requires the development of a simulation framework, available as open-source in the repositoryFootnote 2. The simulation framework purpose is to dynamically generate the branches and bus matrices of the power system based on the definition of the physical conditions. Then, this framework has two functional highlights: for each scenario, the physical characteristics can be established (either fixed or randomly defined), with the automatic power system modeling, and the scenario can be automatically evaluated (using a power flow solver).

The implementation approach depends on the usage of an object-oriented language, where each simulation scenario follows a class of objects capable to implement the functional highlights.

The desired functionality was modeled in a MATLAB simulation framework where the power flow analysis is made using the MatPower tool. The simulation framework has the following functional requirements:

  1. 1.

    Must be capable of simulating all trains of two or four branches of each traction substation (respectively for single and double track lines).

  2. 2.

    Must be implemented in an object-oriented approach, enabling the analysis of an array of simulation scenarios targeting multi-core parallelization. Each simulation scenario is one object, having intrinsic properties and methods.

  3. 3.

    Each simulation scenario (or object) must consider:

    • All traction substations – TPS\([1,2,\ldots ,T]\) – in adjacent pairs;

    • All branches of each TPS – br[1, 2] or br[1, 2, 3, 4];

    • All trains in each branch of each TPS – \(T[1,2,\ldots ,N]\);

    • All power transfer devices existing in the whole line in the analysis – PTD\([1,2,\ldots ,P]\).

  4. 4.

    The essential methods of the class must be:

    • Dynamic setup of all TPS branches;

    • Dynamic addition of N trains to a branch (placed in specific positions, with specific apparent power consumption);

    • Dynamic setup of power transfer devices;

    • Power flow analysis and automatic output generation;

    • Automatic graphical generation of the electric diagram.

Let us consider a simple scenario having two traction substations, each one feeding a single-track 1\(\times\)25 kV railway line, where both TPS are interconnected through a power transfer device, and the respective two branches (20 km and 15 km, respectively) have three trains each in different positions and with different apparent power. This scenario can be simulated in the proposed simulation framework. Then, the automatic graphical generation output is illustrated in Fig. 3.

Fig. 3
figure 3

Resultant implemented simulation model of the simple scenario (P stands for active power with units in MW, and Q reactive power with units in MVAr, while pos means position with units in km, and asc ascending direction.)

The output visible in Fig. 3 can be automatically generated by the simulation framework.

Since there are no trains in the left branch of the TPS1, the branch impedance (br[2]: 2–3) corresponds to the impedance of a 20 km catenary line. The traction power transformer of TPS1 is represented by branch br[1]: 1–2. The automatic electric model generator ensures that the sum of the impedances of the right branch is equal to the total impedance of a 20 km catenary line (br[7] + br[8] + br[9] + br[3] = br[2]).

Regarding the PTD, the adopted model comprises two series branches with a middle bus representing the constant losses. The first branch (br[13]: 4–100) represents the series impedance of the PTD step-down transformer; the subsequent branch (br[14]: 100–101) is associated to the inverter conduction losses. The constant power losses in bus 100 represent the transformer constant power losses (no-load losses) together with the inverter constant power losses.

The simulation framework follows the UML diagram of Fig. 4. Each electric_model object has several properties (like the characteristics of the TPS, the information of train positions and respective power consumptions, the characteristics of the PTD, and the relevant parameters for the power flow solver). Also, each electric_model object has several methods, which are the functions of the class, with examples like: i) electric_model::generate_random_trains() to start a new scenario with random train positions and respective power consumptions; ii) electric_model::run_powerflow() to perform the power flow analysis of the scenario; or iii) electric_model:: attach_power_transfer_device() to include a PTD to the scenario for optimization of the TPS power consumption. Additional methods were also developed to load and save the simulation state, or to plot the resulting electric diagram of the scenario.

Fig. 4
figure 4

UML diagram of the resultant implemented simulation model

A new scenario is evaluated with the sequential launch of the methods of electric_model, as illustrated in Fig. 5. The initial step is to configure the electric parameters and the two TPS under study. Then, the trains are generated (number of trains, active power, and power factor of each one) for both TPS and the power flow is calculated. Finally, the PTD is attached to the simulation framework with a specific optimization strategy, the power flow is launched again and the final results are extracted. This procedure is then evaluated as a scenario of two TPS without and with the PTD in the ND.

Fig. 5
figure 5

Example of usage of an electric_model object to evaluate a new scenario

In the example of the application of the flowchart of Fig. 5, the main objective is to balance the active power in each TPS, with the compensation power in the power transfer device being defined by the method electric_model::tps_power_optimization(), where half of the power is transferred in the NZ. In this flowchart, the first part (in green) is to prepare the object for the power flow analysis: first is defined the electric parameters (TPS traction transformer impedance, catenary impedance, among others), and then is created a new physical simulation scenario (where the number of trains can be random or fixed, as well as the respective apparent power and position in the branches, among other properties). The second part (in coral and blue) targets the specific simulation presented in this paper: the evaluation of the installation of the PTD in the NZ, in terms of power losses, where is obtained the knowledge of the system state before and after the installation.

To evaluate a universe of simulation scenarios, a second handler class was developed, which follows the UML of Fig. 6.

Fig. 6
figure 6

UML diagram of the resultant implemented simulation model

The purpose of this handler class is to hold the array of electric_model_objects and to extract the results on the performed analysis. The main method of this class is the eval_compensation(type,max_iter), where all the new simulation scenarios are created, the random and fixed parameters of each scenario are generated and the power flow analysis is performed, following the flowchart of Fig. 5.

This section was detailed with the implementation steps of the simulation framework capable to evaluate the inclusion of the PTD for a universe of simulation scenarios. The Appendix section is also presented two details of the graphical user interface, in Figs. 26 and 27, where the scenario random generation and the electric parameters menus are presented, respectively.

4 Reduced scenario simulation

Before entering a large-scale simulation analysis, it is useful to analyze and validate some simple scenarios, in which the quantitative and qualitative analysis should validate a priori known conclusions.

This section evaluates two sets of simulation scenarios, starting with scenarios comprising two TPS and one train, and then scenarios having two TPS and two trains (each one supplied by a different TPS), using the simulation framework presented in the previous section.

Fig. 7 illustrates a simple simulation of one train and two TPS connected by a PTD.

Fig. 7
figure 7

Illustration of the reduced scenario simulation: a physical representation; b electric diagram

Let us consider the following reduced scenario analysis, consisting of N scenario simulations, as in (1), where the power consumption and position of this train are randomly distributed following specific probability distribution functions (PDF).

$$\begin{aligned} {\left\{ \begin{array}{ll} P_{\text{train}}[1,2,...,N] \in [0 ~ 8] &{} \text {in MW}\\ D_{\text{train}}[1,2,...,N] \in [0 ~ 1] &{} \text {in }\% \text { of } d_{\text{cat,total}}^{\text{tps}1} \\ \end{array}\right. } .\end{aligned}$$
(1)

For all N scenarios, the train power factor is fixed (\(pf = 0.98 ~ ind.\)), the branch lengths are fixed (\(d_{\text{cat,total}}^{\text{tps}1}=30\) km; \(d_{\text{cat,total}}^{\text{tps}2}=30\) km), the parameters of the TPS (\(Z_{\text{tps}}\)) and the catenary are fixed (\(R_{\text{cat}}^{*} = 0.15\, {\Omega }/{\text{km}}\), \(L_{\text{cat}}^{*} = 1.43 \, {\text{mH}}/{\text{km}}\) and \(C_{\text{cat}}^{*} = 12.4 \, {\text{nF}}/{\text{km}}\), where \(Z_{\text{cat}} =Z_{\text{cat}}^{*} ~ d_{\text{cat,total}}^{\text{tps}}\)), and the parameters of the PTD are fixed (\(R_{T,\text{ptd}} = 0.47 \, \Omega\) and \(X_{T,\text{ptd}} = 0.94\,\Omega\) for the PTD transformer, \(R_{I,\text{ptd}} = 0.47\,\Omega\) and \(L_{I,\text{ptd}} = 0 \, \text{mH}\) for the PTD inverter, and the no-load power losses are zero).

Considering that the PTD operates to perfectly balance the traction power between the two TPS, then the formulation of the system power losses can be illustrated in Fig. 8.

Fig. 8
figure 8

Formulation of the electric diagram for the one-train reduced scenario analysis

Fig. 9
figure 9

Power losses evaluation with and without the inclusion of a PTD, for the one-train reduced scenario analysis, as function of the normalized train position (\(\alpha\))

Assuming \(|V_{\text{s}1}| = |V_{\text{s}2}|\), \(Z_{\text{tps}1} = Z_{\text{tps}2}\) and near-equal power factors, with a perfect power balance obtained with the PTD, the train current is divided in half, and with the same catenary length of the two branches, this results in the invariant power losses of Fig. 9 when it concerns to the \(\alpha\) value.

This constant power losses, \(P_{z_l + z_r}\), are the sum of the catenary, TPS and PTD losses, where half of the train current is associated to the \(Z_{\text{tps}}+\alpha Z_{\text{cat}}\) impedances, and the other half of current passes through the \(Z_{\text{tps}}+ Z_{\text{cat}} + 2 Z_{\text{ptd}} + (1-\alpha )Z_{\text{cat}}\) impedances.

Then, if the train is only supplied by the TPS1 with a current \(I_{\text{s}1}\), the power losses can be estimated:

$$\begin{aligned} P_{\text{loss}}^{\text{left}} = |I_{\text{s}1}|^2 \left( R_{\text{tps}}+\alpha R_{\text{cat}}\right) \end{aligned}.$$
(2)
Fig. 10
figure 10

System power losses difference before and after the inclusion of the PTD, as function of the train position and power consumption with 10 000 scenarios simulated

On the other side, if the PTD forces the division of the currents in the two TPS, where \(I_{\text{s}1}=I_{\text{s}2}= \nicefrac {I}{2}\), and considering unitary power factors in TPS1 and TPS2, then:

$$\begin{aligned} P_{\text{loss,ptd}}^{\text{left}}= & {} \left( \frac{|I_{~}|}{2}\right) ^2 \left( R_{\text{tps}}+\alpha R_{\text{cat}}\right) \end{aligned},$$
(3)
$$\begin{aligned} P_{\text{loss,ptd}}^{\text{right}}= & {} \left( \frac{|I_{~}|}{2}\right) ^2 \left( R_{\text{tps}}+(1-\alpha )R_{\text{cat}}+R_{\text{cat}}+2~R_{\text{ptd}}\right) \end{aligned}.$$
(4)

The intersection point, where \(P_{\text{loss}}^{\text{left}} = P_{\text{loss,ptd}}^{\text{left}} + P_{\text{loss,ptd}}^{\text{right}}\), can be obtained as following:

$$\begin{aligned} \alpha _0 = \frac{-R_{\text{tps}} + R_{\text{cat}} + R_{\text{ptd}}}{2~R_{\text{cat}}} \end{aligned}.$$
(5)

Then, for the considered assumptions, this results in \(\alpha _0 \approx 0.6\). This value means that the adoption of a PTD with the objective of power balancing leads to a minimization of the system power losses if the train is located between \(\alpha _0\) and the NZ. The closer the train is to the NZ, the higher the advantages in terms of the system losses, as expected.

Fig. 10 presents the evaluation of the difference of the system power losses for the N scenarios of (1). Each point of the graph corresponds to a simulated scenario (without and with the PTD), and the lines represent the equipotential values where the power losses difference (“after-before” the PTD) is the same. Thus, negative values mean that the PTD action reduced the power losses in the system.

In the simulation results of Fig. 10, it is visible that the actual \(\alpha _0\) value is close to the theoretic value of \(\alpha _0 \approx 0.6\), from expression in (5). This result is obtained with the balancing in the two TPS visible in Fig. 11.

Fig. 11
figure 11

Simulation results for the N=10 000 scenarios of Eq. (1): a active power in TPS1 and TPS2 before the inclusion of PTD; b Active power in TPS1 and TPS2 after the inclusion of PTD; c histogram analysis of the difference of power between TPS1 and TPS2; d histogram analysis after the inclusion of PTD

The previous reduced scenario analysis is useful to evaluate the advantages expected from the inclusion of the PTD. First, it should be highlighted that the major compensation strategy is in the balancing of the two TPS. The example presented can reduce an unbalance from 8 MW to a maximum of 0.45 MW (with the majority of the unbalance values closer to 0 MW, thus a perfectly balanced system in terms of power consumption).

Fig. 12 presents the results for a new testbed of scenarios, where are considered two trains in each TPS branch in addition to the parameters of the previous reduced scenario simulation. Both trains of all N=10000 scenarios follow the expression in (1).

Fig. 12
figure 12

Simulation results for N scenarios of Eq. (1) for two trains: a active power in TPS1 and TPS2 before the inclusion of PTD; b Active power in TPS1 and TPS2 after the inclusion of PTD; c histogram analysis of the difference of power between TPS1 and TPS2; d histogram analysis after the inclusion of PTD

The results after the inclusion of the PTD show final power unbalance between the two TPS (±0.45 MW, corresponding to 5.5 % of the maximum transferred power), which are caused by the compensation strategy of the PTD. Essentially, the simulation framework performs an initial power flow analysis for both TPS without the influence of the PTD. Based on the difference of power in the two TPS, this difference is used as a setpoint for the power transfer device and the simulation framework is subject to a final power flow analysis. Only with an iterative process is possible to reduce the final unbalance, which, in terms of implementation, means the need for a feedback loop if a perfect balance is required.

A preliminary analysis of the system power losses before and after the inclusion of the PTD might not justify the adoption of the power converter. Only in a hypothetical scenario where the majority of trains are located near the NZ (where \(\alpha > \alpha _0\)) is justified the adoption of PTD. For obvious reasons, the train position follows a uniform PDF across the entire railway line. Then, as illustrated by Fig. 10, the majority of the scenarios increase in the system power losses, when comparing the power losses before and after the inclusion of the PTD.

When considering the PTD constant power losses, the results of Fig. 10 are affected by an offset. For example, if these power losses are higher than 0.15 MW, all of the simulation scenarios increase the system power losses.

This section leads to addressing the two metrics of analysis when is considered the inclusion of the PTD: the system power losses and the minimization of unbalance in the TPS power. For the following sections, any discussion on results must consider an analysis on both metrics. As an example, even if every scenario of analysis increases the system power losses, the advantages of adopting a PTD lie in balancing both TPS.

5 Monte Carlo analysis

This section presents the details of the simulation scenarios, specifically on the input parameters. Considering the diagram of Fig. 1, a simplified analysis of the adoption of a PTD (to improve the railway infrastructure) can consider two catenary branches of two subsequent TPS.

The universe of simulation scenarios must cover several parameters variation, such as:

Fig. 13
figure 13

a Line geographic relative coordinates (different x and y scales) and b branch lengths (in ascending order) of a 250 km railway line with 7 TPS

  • branch number of trains, \(\in {\mathbb {N}}\);

  • branch length, \(\in {\mathbb {R}}\);

  • TPS power profile, \(\in {\mathbb {R}}\);

  • train power consumption, \(\in {\mathbb {R}}\);

  • train power factor, \(\in {\mathbb {R}}\);

  • train position, \(\in {\mathbb {R}}\);

The surface of this universe of simulation scenarios is, at least, \(S \in {\mathbb {N}}~{\mathbb {R}}^5\).

To reduce the universe of simulation scenarios, some assumptions must be made based on the Monte Carlo statistical analysis. Being the Monte Carlo term associated with the process of modeling and simulating a system that is affected by randomness [50], this analysis requires that each of the parameters follows an appropriate probability distribution functions (PDF).

With the generation of several random scenarios (where the parameters follow PDF functions), and the collection of relevant statistics, thus it can be assessed the performance of a decision policy or the value of an asset [50], or in this work, the decision of the installation of a PTD. The usage of a specific railway line and a dataset of measurements enable the fitting of the PDF for each of the parameters, as shown in the following statistical analysis.

5.1 Statistical analysis

In the following, it is considered the relevant characteristics of a 250 km railway line with seven substations. The first aspect analyzed in this work is on the catenary branch length, where the distance of each branch is visible in Fig. 13.

The branch distance of Fig. 13 does not follow a specific PDF: it could be highlighted that the catenary branch distance is uniformly distributed between 6 km and 30 km.

Despite the uniform distribution of the length of the catenary branch, it would be interesting the analysis of clustered lengths. It could be argued that long lines can have much more branches and, then, a normal PDF function would be appropriate. Then, in this work, two clusters of line lengths can be defined, where \(\mu\) is the average and \(\sigma\) is the standard deviation, following:

  • Short catenary branch lengths, following a normal PDF with \({\mathcal {N}}(\mu =15, \sigma =2.5)\);

  • Long catenary branch lengths, following a normal PDF with \({\mathcal {N}}(\mu =21, \sigma =2.5)\).

Statistical analysis for the respective clusters is presented in the histogram graphs of Fig. 14.

Fig. 14
figure 14

Result histogram of 10000 branch lengths for the two defined clusters

Furthermore, this work considers two types of TPS, each of them having a randomly defined number of trains, and a randomly defined power consumption profile.

The first assumption is to define clusters of types of TPS. The rationale behind this assumption is the following: the number of trains throughout the day will define the power consumption requirements of this TPS. For example, a TPS covering a dense metropolitan area will be more likely to have more trains and higher power consumption than a TPS covering a sparse rural area.

The seven TPS of the 250 km railway line under analysis have the histogram in Fig. 15 regarding the number of trains, after the analysis of an extensive time-window concerning a specified time-table. From the results presented, two clusters of trains can be considered:

Fig. 15
figure 15

Frequency of trains in a railway line covered by 7 TPS. The frequency corresponds to the number of seconds of a day where the TPS is feeding t trains

Fig. 16
figure 16

Frequency of trains for the two clusters and respective fitting: a dense TPS; b sparse TPS

Fig. 17
figure 17

Histogram graph of active power consumption for all 7 TPS of a 250 km railway line

  • Dense area, covered by TPS1, TPS2, TPS6 and TPS7;

  • Sparse area, covered by TPS3, TPS4 and TPS5.

The two clusters are presented in the blue bars of Fig. 16. Based on the available data, the number of trains for both types of TPS follows a Poisson PDF, defined as:

  • Dense area, with \({\mathcal {P}}ois(\lambda =1.4)\);

  • Sparse area, with \({\mathcal {P}}ois(\lambda =0.3)\);

where the Poisson PDF is given by (6) and \(\lambda\) is equal to the expected value of k.

$$\begin{aligned} F(k,\lambda ) = \frac{\text{e}^{-\lambda } \lambda ^k}{k!} \end{aligned}.$$
(6)

Then, this can be better illustrated in the fitting bar graph of Fig. 16.

Regarding the TPS power consumption, a similar analysis can also be made in Fig. 17.

Considering the two clusters of TPS (dense and sparse areas), the power consumption is also directly related to this cluster, as visible in Fig. 17.

Based on the available data, the clustered TPS power consumption can be approximated to a log-normal PDF, as follows:

  • Dense area, with \({\mathcal {L}}ognormal(\mu =\ln (5\mathrm {e}{6}), \sigma =0.75)\);

  • Sparse area, with \({\mathcal {L}}ognormal(\mu =\ln (1\mathrm {e}{6}), \sigma =1.00)\);

where the log-normal PDF is given by

$$\begin{aligned} F(x,[\mu ~\sigma ]) = \frac{1}{x \sigma \sqrt{2 \uppi }} \exp {\left( -\frac{(\ln {x}-\mu )^2}{2\sigma ^2}\right) } \end{aligned}.$$
(7)

The histogram of the clustered TPS power consumption and respective fitting is visible in Fig. 18.

Fig. 18
figure 18

Histogram graph of active power consumption for the two clusters of TPS and respective fitting: (a) dense TPS; (b) sparse TPS

As stated, the data of power consumption presented in Fig. 18 were fitted to the proposed log-normal PDF. However, this approximation should be taken with caution, since the performed statistical hypothesis tests from [51] do not prove that the data fits the distribution with a high degree of confidence.

Regarding the train power consumption, from the available data is visible that this parameter does not follow any PDF. Even if the authors in [41] shows a possible fitting of multiple normal PDF, the histogram results of Fig. 19 lack the fitting for specific distribution functions. Then, this work assumes a uniform PDF for the train power to comprise the diversity of types of trains present in the dataset of more than 300 train journeys.

Fig. 19
figure 19

Histogram graph of all train apparent power consumption. In this histogram, the frequency is in a logarithmic scale

Regarding the train power factor, from the analysis of all available data, is visible a possible dependence of the train power factor as a function of the train power, as illustrated in Fig. 20.

Fig. 20
figure 20

Evaluation of the train power factor as function of apparent power

For lower train power consumption values, the train power factor varies from 0.4 to 1. However, and as expected, for higher train power consumption values, the train power factor is closer to unitary.

Then, a bivariate PDF is fitted, where the power factor follows a normal distribution function and the \(\mu\) and \(\sigma\) values depend on the train power consumption. Specifically, as illustrated in Fig. 21, it was approximated these values to sigmoid functions.

Fig. 21
figure 21

Fitting of \(\mu\) and \(\sigma\) values of the normal PDF for the power factor as function of train power

Fig. 22
figure 22

Resultant randomly generated train power scenarios (active power plus power factor) for proposed variable \(\mu\) and \(\sigma\) values

The resultant bivariate surface after applying the \(\mu\) and \(\sigma\) curve values is presented in Fig. 22.

5.2 Scenario generation algorithm

The previous analysis is essential to define an automatic scenario generation that covers the universe of possible scenarios of a railway electrification line. Fig. 23 proposes an algorithm to generate a new scenario.

Fig. 23
figure 23

Algorithm for the generation of a randomly distributed scenario

The first step is to generate a random length cluster, following a Bernoulli PDF for each TPS branch, resulting in four equally distributed scenarios:

  • TPS1 \(\in \{short,~long\}\);

  • TPS2 \(\in \{short,~long\}\).

Then, the length of the branch follows a normal PDF, as stated in the \(\mu\) and \(\sigma\) values that led to Fig. 14.

Furthermore, the third step is to generate a random type of TPS, or TPS cluster, following a Bernoulli PDF as well, resulting in four equally distributed scenarios:

  • TPS1 \(\in \{sparse,~dense\}\);

  • TPS2 \(\in \{sparse,~dense\}\).

From the definition of the TPS cluster, the number of trains is generated in step four, based on the Poisson PDF that led to the results presented in Fig. 16.

The fifth step is the generation of the TPS total power consumption, which depends on the TPS cluster. Both sparse and dense TPS power consumptions follow a log-normal PDF, where the \(\mu\) and \(\sigma\) parameters are the ones that led to Fig. 18.

Then, this TPS power consumption is associated with the number of trains on the branch, where the power of each train is generated using a uniform PDF between a minimum and a maximum, and a final gain is applied to all train power consumptions to ensure that the sum of all individual powers is equal to the TPS power consumption selected in step five.

Using a normal PDF for each train, step seven generates a random power factor using the estimated train power consumption and following the \(\mu\) and \(\sigma\) parameters that led to the results in Figs. 21 and 22.

Fig. 24
figure 24

Evaluation of \(\mu\) and \(\sigma\) values for 1 million simulated scenarios

Finally, for each TPS branch, based on the number of trains and the TPS branch length, the position of each train is estimated using a uniform PDF (a minimum distance between trains is ensured in this phase).

5.3 Monte Carlo metrics

In previous subsections, all the possible parameters that affect the analysis of the inclusion of a PTD in a railway line were extensively analyzed. As stated in the previous reduced scenario section, the scenarios where the advantages (in terms of system losses) justify the adoption of a PTD are when the train is near the NZ. However, the train position follows uniform PDF. The type of conclusions to be made should be: “With the adoption of a PTD, during more than 60% of the operation time-frame, the inclusion of the PTD leads to an increase of system power losses”, and not “Only if all trains are closer to NZ during the operation timewindow, is justified the adoption of the PTD”.

The Monte Carlo analysis must consider the analysis of the two parameters—the TPS length cluster and the TPS density cluster—that are fixed for a specific railway line. This analysis allows the identification of the relevant characteristics of a railway line where the study of the system power losses is essential to support this identification. The Monte Carlo analysis performed in this work will be focused on listing the combination of characteristics of a railway line (short or long TPS branches and dense or sparse TPS clusters) that contribute to a higher number of advantageous scenarios.

Regarding the number of scenarios necessary to be evaluated, let us consider a repetitive simulation of 1000 randomly generated scenarios, throughout 1000 iterations (thus a total of one million scenarios), as illustrated in Fig. 24. This figure presents the result metric for the mean value of the power loss difference (losses after minus losses before the installation of PTD).

The mean value converges to a power loss difference of 11.62 kW after the inclusion of a PTD. It is visible in the results of Fig. 24 that if only 1000 scenarios are considered for analysis, the average can vary from 7.33 kW to 16.97 kW, which might be unacceptable, (±41.5 % maximum error, when compared to the 1 million average). If there are considered 10 000 scenarios, from Fig. 24, the same value can vary from 10.3 to 12.51 kW (±9.4 % maximum error, when compared to the 1 million average). When 100 000 scenarios are considered, the maximum error is reduced to ±1.72 %, from a value variation from 11.4 to 11.8 kW.

Thus, only with a considerable amount of simulations is possible to increase the confidence level on the conclusions. The minimum number of simulations required and the computational needs to achieve the conclusions should be appropriately balanced. It could be argued that with this analysis, the degree of confidence of the results generated through 100 000 scenarios is better than 2 %.

Regarding the sigma value, similar convergence conclusions can be made. Both the average and variance values are also useful to evaluate the percentage of scenarios where the inclusion of the PTD is advantageous, as illustrated in Fig. 25.

Fig. 25
figure 25

Histogram of the power loss difference values for 1 million simulated scenarios

Since the PTD for the 1 million simulated scenarios is modeled without constant power losses (the PTD only has conduction power losses), there is a considerable percentage of scenarios where the PTD is advantageous in terms of system power losses.

The percentage value of the advantageous scenarios can be obtained from the definition of the cumulative distribution function for the normal PDF, following:

$$\begin{aligned} \varPhi (x \le 0) = \frac{1}{2} \left[ 1 + \text {erf}\left( \frac{-\mu }{\sigma \sqrt{2}}\right) \right] \end{aligned},$$
(8)

where \(\text {erf}(z)\) is the Gauss error function, given by

$$\begin{aligned} \text {erf}(z) = \frac{1}{\sqrt{2 \uppi }} \int _{0}^{z} \text{e} ^{\nicefrac {-t^2}{2}} \, \text{dt} \end{aligned}.$$
(9)

As example, for the given histogram of Fig. 25, the percentage value of the advantageous scenarios, \(\varPhi (x \le 0)\), is \(39.7\%\).

The final results presented in the following section are obtained with the simulation of 100 000 scenarios (and a different PTD).

6 Results and discussion

This section addresses the evaluation of the PTD inclusion, for 100 000 different scenarios. In particular, the comparison is on the different clusters of TPS and branch lengths.

Two different analyses are made with different power transfer devices: i) a PTD with typical permanent power losses, with a value of 50 kW corresponding to 20 % of the total power losses; and ii) a PTD with reduced permanent power losses, 25 kW corresponding to 10 % of the total PTD power losses. The power losses at nominal power are the same, 2 % corresponding to 250 kW of power losses (when the PTD is in nominal operation, transferring 12.5 MW from one section to the other).

The results for 100 000 scenarios in the following subsections are presented in Tables 14, where each row comprises 10 000 simulated scenarios. Each cell of Tables 14 refer to approximately 10000\16 scenarios.

6.1 Power transfer device with typical power losses

The results were obtained with a PTD with a nominal power of 12.5 MVA , where the total nominal losses are 2 % and 20 % of the total nominal losses are constant ones.

Table 1 Evaluation of system power losses difference (after minus before, average, in kW) with and without the inclusion of PTD for different clusters: TPS density and branch lengths

Table 1 presents the average of the system power losses difference. The global average value is 55.8 kW. This means that in the average of the simulated scenarios, the inclusion of a PTD result in a power balance between two TPS with the disadvantage of increasing the losses in an average amount of 55.8 kW. Then, from the results of Table 1, further conclusions can be obtained:

  • If one TPS has a short branch and is a high density one, and the other TPS has a long branch and sparse density, then the average system power losses difference is near 10 kW higher than the global average (columns 6 and 11);

  • If one TPS has a long branch and high density, and the other TPS has a short branch and sparse density, then the average system power losses difference is around 6.3 kW lower than the global average (columns 7 and 10).

Table 2 Evaluation of system power losses difference (after minus before, variance, in kW) with and without the inclusion of PTD for different clusters: TPS density and branch lengths

In addition to the average value, Table 2 presents the variance of the system power losses difference. Based on the application of the expression in (8) using the \(\mu\) and \(\sigma\) values from Table 1 and Table 2, respectively, Table 3 presents the percentage of favorable scenarios where is advantageous the installation of a PTD in terms of system power losses.

Table 3 Percentage of favorable scenarios \(\varPhi (x \le 0)\), where the inclusion of PTD with typical power losses is beneficial for different clusters: TPS density and branch lengths

From the results of Table 3, it is identified the best scenario (column 7) where the probability of having favorable scenarios is the highest (15.1 %). This scenario is when TPS1 is dense and long and TPS2 is sparse and short. Then, more limited conclusions regarding the sparsity/density can be made, as follows:

  • Due to the low \(\sigma\) values in the scenarios having both sparse TPS, Table 3 shows a near-zero probability of having favorable scenarios (columns 13 to 16);

  • The probability of having favorable scenarios if one of the TPS is dense is 9.1 % higher than having both TPS sparse (columns 5 to 12);

  • If both TPS are dense, the probability of having favorable scenarios is 11.6 % higher than having both TPS sparse (columns 1 to 4).

Regarding the clustering associated with the TPS lengths, other conclusions are listed:

  • The probability of having favorable scenarios if both TPS branches are short is 4 % (columns 1, 5, 9 and 13);

  • The probability of having favorable scenarios if both TPS branches are long is 9.2 % (columns 4, 8, 12 and 16);

  • If both TPS are dense and if one TPS branch is different from the other, the probability of having favorable scenarios is 13.1 % (columns 2 and 3);

  • If one TPS is dense and short, and the other TPS is sparse and long (columns 6 and 11), the probability of having favorable scenarios is 6.2 %;

  • If one TPS is dense and long, and the other TPS is sparse and short (columns 7 and 10), the probability of having favorable scenarios is 14.2 %.

6.2 High-efficiency power transfer device

A second analysis was made with a more efficient PTD. The total nominal losses are still 2 % of the nominal power of 12.5 MW. The constant power losses are 10 % of the nominal power losses.

Table 4 presents the percentage of favorable scenarios where is advantageous the installation of a PTD.

Table 4 Percentage of favorable scenarios \(\varPhi (x \le 0)\), where the inclusion of high-efficiency PTD with reduced constant power losses is beneficial for different clusters: TPS density and branch lengths

Similar to the analysis made for the PTD with typical power losses, the percentage results in Table 4 show the best scenario being the same (column 7), where the probability of having favorable scenarios is the highest (24.2 %).

The conclusions for the sparsity/density are the following:

  • The scenarios having both sparse TPS show an average of 0.65 % in the probability of having favorable scenarios (columns 13 to 16);

  • The probability of having favorable scenarios if one of TPS is dense is 19.3 % higher than having both TPS sparse (columns 5 to 12);

  • If both TPS are dense, the probability of having favorable scenarios is 20.8 % higher than having both TPS sparse (columns 1 to 4).

The conclusions for the clustering associated with the TPS lengths are the following:

  • The probability of having favorable scenarios if both TPS branches are short is 12.8 % (columns 1, 5, 9 and 13);

  • The probability of having favorable scenarios if both TPS branches are long is 16.8 % (columns 4, 8, 12 and 16);

  • If both TPS are dense and if one TPS branch is different from the other, the probability of having favorable scenarios is 22.7 % (columns 2 and 3);

  • If one TPS is dense and short, and the other TPS is sparse and long (columns 6 and 11), the probability of having favorable scenarios is 17.9 %;

  • If one TPS is dense and long, and the other TPS is sparse and short (columns 7 and 10), the probability of having favorable scenarios is 23.4 %.

When compared to a PTD with typical constant power losses, Table 4 shows a considerable increase in all the probabilities for having favorable scenarios, as expected. This increase is on average 10.5 percentage points (scenarios 1 to 12, with 1.2 percentage points in standard deviation).

Since the constant power losses of this PTD analysis are 25 kW (half of the typical PTD), if both TPS are sparse (columns 13 to 16), there is a slight increase in the probability of having favorable scenarios (in the neighborhood of 1 %).

6.3 Discussion

The analysis previously made is useful if the study of the implementation of a PTD in the NZ is strictly subject to the system power losses. If is needed an analysis on the best location to install one PTD from a universe of possible alternatives, then the first analysis is to evaluate the branch lengths and the TPS density. In particular, when compared to all possible scenarios, it is concluded that the best scenarios are the ones where one TPS is dense and long, and the other TPS is sparse and short (columns 7 and 10) to achieve the highest probability of having favorable scenarios (14.2 % for PTD with typical constant losses, or 23.4 % for highly efficient PTD).

In an opposite situation, if both TPS are sparse, from the results achieved, it is expected that the probability of having favorable scenarios is residual (near-zero for the typical PTD, and near 1 % for PTD with reduced losses), leading to the conclusion that, if there is a choice, it is not justified the installation of a PTD in this cluster of TPS. As visible in Fig. 16, for this scenario, the probability of not having any train in both TPS can be estimated using the Poisson PDF for the sparse area. This value is around 54 %, wherein in all operation scenarios, the PTD only accounts for the unfavorable constant losses. Also, in this scenario, the probability of having one train either in one of the TPS branches is around 33 %, and considering Fig. 10 of the atomic scenario, from all of these scenarios, only 40 % of these are advantageous.

It should be highlighted that the inclusion of the PTD installation in the railway electrification system leads always to a more balanced railway power supply. Despite the major focus of the results presented in this paper are on the system power losses, it is clear that once this system is installed, the power demand of the trains in a catenary branch is distributed among the adjacent TPS.

The compensation strategy illustrated in this work depends on the measurement of the power consumption in both TPS. Then, in a practical implementation of this strategy, it might be required a real-time communication system to transfer the measurements remotely between both TPS and the PTD control system.

Finally, this work does not address the control of the PTD reactive power. As stated in [3], a reactive power compensation device can improve the power quality in the railway electrification, specifically with the stabilization of the catenary voltage (allowing to an increase in the infrastructure capacity). In the limit scenario presented in [3], the catenary power losses can be reduced by more than 50 % if the reactive power in the railway electrification is handled. The practical implementation of the strategy proposed in this work must also consider the management of the reactive power, for improved operation of the PTD system.

7 Conclusion

This paper addresses the effects of the inclusion of a Power Transfer Device between two sections of a railway line based on a Monte Carlo statistical analysis. The main analyzed parameters are the active power balancing in the two substations and the system power losses difference with/without the installation of the PTD.

The proposed methodology considers a generic electrification where several scenarios can be simulated and where each scenario has several different parameters.

From a reduced analysis scenario, the adopted strategy is capable to balance the power in both TPS with a small maximum error (5.5 % in the presented analysis). However, this TPS power balance is achieved with non-negligible system power losses. Specifically, if it is considered an ideal TPS and an ideal PTD (\(R_{\text{tps}}=0\) and \(R_{\text{ptd}}=0\), resulting in \(\alpha _0 = 0.5\)), if the train is in the half of the branch closer to the TPS, then the adoption of a PTD results in higher losses than if no PTD is installed.

To contemplate a generic scenario coverage, then an extensive statistical analysis was made in this paper, using data and measurements obtained from a 250 km railway line with 7 TPS. It was concluded that some of the parameters follow specific probability distribution functions. Specifically, the number of trains follows a Poisson PDF, the TPS power consumption follows a Log-Normal PDF and the train power factor is statistically dependent on the train active power.

The final results of the Monte Carlo analysis show that, depending on the density of trains and on the branch lengths, the scenarios where these factors are different for the two TPS can result in considerable higher losses if not carefully considered. However, the PTD system can always balance the two substations, thus implementing double-side feeding and can also reduce the system total losses in some specific scenarios.