1 Introduction

Electric power plants, especially nuclear ones, are spread worldwide as a leading source of electricity for subscribers. Recently, underground power distribution networks have played a crucial role by integrating different power plants into distributed networks, changing the system's behavior and affecting the system [1, 2]. Typically, MVCs are buried underground rather than overhead to feed electric plants, preserving them against adversary weather conditions [3]. However, those cables might still malfunction with asymmetric and symmetric faults due to insulation breakdown or external damage [4]. Moreover, they are spread in urban areas, industries, nuclear, and densely populated areas. Since traditional fault diagnosis methods consume time and cost, the reliance on automated intelligent solutions increases. Hence, lately, real-time monitoring solutions for systems have recently received significant interest. Among cable fault tracing, in particular, has become crucial in industrial applications, owing to its primary role in interconnecting each electrical element [2].

Without precise fault prediction and tracing solutions for those MVCs research nuclear reactor, it can overload the electric network, resulting in hazardous transients, excessive heating, i.e., fire and explosion, equipment failure, power outages, and losing system reliability [5,6,7]. Over the years, there have been many different ways to measure the location of faults in transmission networks [8]. However, due to structural differences between underground cables distribution networks and the transmission network, some of the current solutions are not proper for both power network types [8]. Efficient fault location necessitates the integration of knowledge and methods. It is required to complete the task rapidly in order to limit nuclear reactor outages and prevent future damage to the cable system. This comprises understanding of cable system design, construction, and proper fault locating processes, which should include a high resistance fault. Hence, this paper provides a precise fault tracing solution for MVCs in distributed networks feeding research nuclear reactor.

A well-known fault diagnosis methods including Machine Learning (ML) techniques [9], impedance-based methods [10], traveling wave-based algorithms [11], and state estimation-based approaches [12]. The authors of [6] surveyed different fault detection methods, including their pros and cons. The transient patterns of voltage and current are used to determine the fault distance in transient-based fault location methods [13]. Moreover, in [14], in comparison to prior impedance-based approaches, a fault location methodology was proposed that uses both the peak time and amplitude of the fault current at the local place with reasonable accuracy and speed. The authors of [15] discussed applying wavelets for locating faults in distribution systems, resulting in several milliseconds of fault detection speed. Still, such a method cannot trace faults.

Manual outage mapping utilizing information from customers is now the most prevalent method for predicting the fault location [16]. However, new fault location methods in distribution systems have been proposed as a result of recent improvements in monitoring systems. The location of the fault is identified using deep graph deep learning approach in [17]. The graph network model is utilized to locate the fault in this method, which uses input from various sensors on separate buses. [18] suggests installing phasor measurement units (PMU) in the distribution system to develop fault location identification. This method locates the defect using real-time data from PMUs and state estimation. However, it necessitates a high-bandwidth communication connection as well as additional costs for equipment installation. In actual systems, however, just the current and voltage at the upstream power substation are monitored. The fault location can be estimated by calculating the impedance between the problematic point and the substation [19]. In general, existing approaches estimate the fault location in these instances, where only the power substation is monitored, by ignoring fault resistance and load impact during the fault. The pattern recognition method was used to define the fault localization problem in [20] and [21], but it requires extensive databases, which include fault cases for the entire distribution system. As a result, in addition to reliable system modeling, there is always the chance of differing findings when testing in realistic distribution systems. In [22], a traveling wave-based method for fault location in a radial distribution system is proposed. However, because of the large number of laterals in distribution systems, this method necessitates the use of additional equipment and has low accuracy in experimental conditions. [23] Proposes a method for locating faults in a medium voltage (MV) distribution system's underground cable using current and voltage sensors at the primary substation 10 kV feeder. The current and voltage waveforms measured by an overcurrent relay at the head end of the MV distribution system are used instead of installing additional sensor devices. The key transient features for estimating the fault are the peak time and current value. Previous related work concentrated only on a few forms of fault detection without tracing or fastly fixing the problem. Moreover, previous methods are inadequate and unreliable due to the over cost of sensors. The precision of the location in the traveling-wave-based approach, for example, is heavily dependent on the performance of the expensive high-speed data collecting system, and the fault location is identified by traveling wave timing analysis. The voltage and current during pre-fault and post-fault are acquired and analyzed in the impedance-measurement-based technique. The transmission line model can then be used to determine the line parameters, and the fault can be located. The earlier work was less accurate and effective in synchronous supervisor and decentralized control over the whole power cable. With this backdrop, we developed an exact approach based on MCSVM in fraction Fourier domain for fault identification and location of underground power cables feeding research nuclear reactors. Only delivering end transient current for fault detection and classification, this technology was considered a low-cost, high-precision solution.

Lately, SVM has witnessed remarkable achievements in many research areas, such as face recognition, signal processing, and fault diagnosis. Moreover, SVM-based classifiers have better generalization properties than artificial neural networks (ANN)-based classifiers as their efficiency do not depend on the number of features. Hence, such merit is beneficial in fault diagnostics due to the unlimited number of features in our case, making it possible to compute directly using original data without pre-processing them to extract their features. Therefore, SVM is an excellent choice for fault detection and localization applications. Moreover, it is applicable to any distribution system. However, we focus here on research nuclear reactor faults due to its severe importance and safety as it may cause national disaster.

The following are the main contributions of this paper: -

  • A novel supervised machine learning-based fault diagnosis approach for MVC feeding research nuclear reactor using Multiclass Support-Vector Machine (MCSVM) in fractional Fourier domain.

  • The Alternating Transient Program/Electromagnetic Transient Program (ATP/EMTP) is used to simulate an actual 22 kV MVCs that are fed to our proposed method.

  • First, Discrete Fractional Fourier Transform (DFRFT) extracts features from ATP/EMTP current signal. The DFRFT processes both spatial and frequency domains of the processed current signal. Therefore, it is appropriate to handle different fractional power parameters of the current signal throughout the spatial-frequency domain to enhance the classification performance.

  • Singular Vector Decomposition (SVD) extracts the essential features by removing each phase's most significant singular value. This feature reduction helps in reducing the classifier’s processing time, which is proper for real time.

  • Afterward, MCSVM approach is used to detect, classify, and trace faults using the output current training patterns obtained from previous stages.

  • The performance of our proposed method is studied against different fractional power parameters and various kernels of SVM to find out the near-optimal pair.

  • In addition, the MCSVM performance is investigated under various fault conditions (fault resistances (R) and inception angles (Ө)).

Paper organization is as follows: Sect. 2 highlights the concepts of DFRFT, SVD, and MCSVM schemes. Section 3 describes the system model, namely, 22kv MVC, research nuclear reactor load, and faults. The proposed fault diagnosis algorithm for the system under study is discussed in Sect. 4. Section 5 shows ATP/EMTP simulated fault results followed by simulation results. Finally, Sect. 6 summarizes the paper.

2 FRFT, SVD, and SVM overview

This section will briefly discuss FRFT, SVD, and SVM schemes since they are utilized in our proposed fault diagnosis algorithm.

2.1 Fractional Fourier transform (FRFT)

The FRFT, proposed by N.Victor [24], generalizes the Fourier Transform (FT) used in various disciplines, including signal processing, quantum mechanics, quantum optics, optical systems, and communication systems. FT determines the spectral content of a signal, not the spectral components' time location [24]. The FRFT algorithm rotates signals in the time–frequency domain. Time–frequency patterns are critical when analyzing time-varying or non-stationary signals. FRFT forms the signals as rotations in the time–frequency plane as a powerful tool for signal analysis. In this case, an order parameter is utilized to introduce revolve in the time–frequency plane. As a result, the FRFT can convert a signal x (t) to X α (u), which is represented in an intermediate domain between time and frequency.

$$ X_{\alpha } \left( u \right) = FRFT_{\alpha } \left( {x\left( t \right)} \right) = \mathop \int \limits_{ - \infty }^{\infty } x\left( t \right) \cdot K_{\alpha } \left( {t,u} \right)dt $$
(1)

where α is the transform order and Kα is the transform kernel that is calculated as follows:

$$ K_{\alpha } = \left\{ {\begin{array}{*{20}l} {\sqrt {\frac{{1 - j {\text{cot}}\left( \alpha \right)}}{2\pi }e^{{j\left( {\frac{{u^{2} }}{2}} \right){\text{cot}}\left( \alpha \right)}} - j u t {\text{csc}}\left( \alpha \right)} } \hfill & {\alpha \ne n \pi } \hfill \\ {\delta \left( {t - u} \right)} \hfill & {\alpha = 2 n \pi } \hfill \\ {\delta \left( {t + u} \right)} \hfill & {\alpha = \left( {2 n \pm 1} \right) \pi } \hfill \\ \end{array} } \right. $$
(2)

The Dirac function is represented as δ(t). When α is equal to π/2, the FRFT transforms to the standard FT, i.e., the transformed signal is completely in the frequency domain. The converted signal, on the other hand, can be in the time domain if α is equal to 0. As a result, the FRFT has an extra degree of flexibility throughout the order parameter (α) compared to the FT.

Let x (t) be a sampled periodic signal with a period ∆0. The αth order DFRFT of x(t) can be mathematically expressed as follows [18]

$$ f_{\alpha } = \sum\limits_{{K = - \frac{N}{2}}}^{{\frac{N}{2} - 1}} {f\left( {\frac{{\Delta_{0} }}{N}} \right)} \sum\limits_{n = - \infty }^{\infty } {K_{\alpha } } \left( {x\left( {n + \frac{K}{N}} \right)\Delta_{0} } \right) $$
(3)

where N is the number of points and k = 0,1,2,…..N of DFRFT matrix as diagonal elements. The coefficients of the DFRFT are determined by a fractional factor (α) ranging from 0 to 1 [25]. Thus, due to its superior benefits, we employ eigenvector DFRFT as a feature extractor in the proposed method (with a factor between 0 and 1).

2.2 Singular value decomposition (SVD)

SVD is an orthogonal transformation that simplifies the matrix into a diagonal form, whose eigenvalues reflect several of the original matrix's primary characteristics. Specifically, it is a matrix factorization technique that is extremely useful for various tasks such as pattern recognition, data dimension reduction, matrix approximation, pseudo inverse calculation, and solving linear equations. SVD has been successfully applied to signal processing as a data processing method and approved its effectiveness at avoiding modal aliasing. It decomposes any singular matrix into upper, diagonal, and lower matrices as follows:

$$ A = U \cdot S \cdot V^{\prime } $$
(4)

where U and V are unitary matrices, i.e., UU’ = 1 and VV’ = 1, called left and right singular vectors, respectively. The S matrix is a diagonal matrix representing the singular values of A, which are evaluated by calculating the eigenvalues of AA’. It can be described as follows:

$$ S = \left( {\begin{array}{*{20}l} {s_{1} } \hfill & {} \hfill & {} \hfill & {} \hfill & 0 \hfill & {} \hfill \\ {} \hfill & \ddots \hfill & {} \hfill & {} \hfill & {} \hfill & {} \hfill \\ {} \hfill & {} \hfill & {s_{\rho } } \hfill & {} \hfill & {} \hfill & {} \hfill \\ {} \hfill & {} \hfill & {} \hfill & 0 \hfill & {} \hfill & {} \hfill \\ {} \hfill & 0 \hfill & {} \hfill & {} \hfill & \ddots \hfill & {} \hfill \\ {} \hfill & {} \hfill & {} \hfill & {} \hfill & {} \hfill & 0 \hfill \\ \end{array} } \right) $$
(5)

where ρ is the rank of the matrix A. Note that s1 > s2 > … > sρ, i.e., s1 is the largest singular value. SVD extracts the unique features matrix in several values (singular values), so it is endowed with a dimension reduction strategy with good stability performance. In other words, when the feature matrix element changes, the single values do not vary significantly.

2.3 Support-vector machine (SVM)

SVM is a ML-based approach mainly utilized for classification and regression prediction [26], where it is used to maximize forecast accuracy while avoiding overfitting the data automatically. A system that uses the hypothesis space of a linear function in a high-dimensional feature space can be defined as this technique. This system is trained using an optimization-based learning algorithm that incorporates a learning bias derived from statistical learning theory. The SVM's primary objective is to find the optimal separating hyperplane that maximizes the training data's margin [27].

The optimal separating hyperplane is obtained by maximizing the margin between two or more training data set classes. The separable case's optimization problem (no misclassified training example)

$$ \begin{aligned} & {\text{Minimze}}: \frac{1}{2}\|W\|^{2} \\ & {\text{subject}}\, {\text{to}}\, {\text{the}}\, {\text{constrains }}:y_{i} \left( {\left( {W.C_{i} + b} \right) \ge 1} \right) \\ \end{aligned} $$
(6)

where n denotes the number of training data points. Then, the solution to this problem is achieved by optimizing the following equation:

$$ E = \frac{1}{2}\|W\|^{2} - \mathop \sum \limits_{i} \alpha_{i} \left( {y_{i} \left( {\vec{W}.\overrightarrow {{V_{i} }} + b} \right) - 1} \right) $$
(7)

where W is a collection of weights, one for each input feature, y is the output result that shows the class label (y ∈ {1, − 1} for binary classifier), and C is the input feature vectors V’s (also known as support vectors) are picked locations from the input training features’ data (C) that satisfy the maximum margin above and below the hyperplane. The parameters b and α are used to determine a unique maximum margin solution. The chosen support vectors V's are represented by the y's and α’s. The decision function d (Q), which is positive for class 1 and negative for class 2, predicts the classification of an unknown vector Q and is defined for kernel (K) as follows:

$$ d\left( Q \right) = {\text{sign}}\left( {\sum\limits_{i} {{\varvec{K}}\left( {{\varvec{Q}},{\varvec{V}}_{i} } \right)} } \right)y_{i} \alpha_{i} + b $$
(8)

To classify the DFRFT transform's output, different SVM kernels such as the linear-SVM and nonlinear formulations such as the quadratic and Gaussian radial basis function (RBF) were investigated.

2.3.1 Multiclass SVM

SVM was developed to address the problem of binary classification. Numerous extensions to multiclass classification are proposed [28], including One-Versus-All (OVA), one-versus-one (OVO), and the Crammer–Singer multiclass extension. OVO-SVM generates all possible pairwise SVMs with training examples drawn from a set of N classes. For each pair of classes I and j, the decision function is defined as follows:

$$ f^{ij} (x) = \left\langle {\varphi ({\mathbf{x}}) \cdot {\mathbf{w}}^{ij} } \right\rangle + b^{ij} $$
(9)

Different decision functions are used for N (N-1)/2-class problems. Various strategies can be used to obtain a shared decision between the generated classifiers. The most common is majority voting, also referred to as "max-wins." Thus, the decision function can be defined as follows:

$$ \arg \max_{i} \sum\limits_{j \ne i,j - 1}^{k} {sign(f^{ij} ({\mathbf{x}}))} $$
(10)

3 System model

3.1 MVC research nuclear reactor description

Figure 1 shows the layout of 22 kV MVC feeding research nuclear reactor have length 20 km, transmitted power is 1500 MVA, 0.024 + j 0.23 positive sequence and 0.412 + j 1.13 zero sequence impedance per phase in Ω [3, 23, 29]. A balanced direct-sequence three-phase set of 50 Hz sinusoidal currents specified by Eq. (11), with 53-A rms and 18 kV rms, defines the phase conductor currents and voltages.

$$ I_{P} = 53 \, \left[ {1, \, e^{ - j2\pi /3} , \, e^{j2\pi /3} } \right] \, A $$
(11)
Fig. 1
figure 1

Power network comprised of a research nuclear reactor load, a 20-km three-phase single-core MVC

Due to significant influence on the nuclear underground cable behavior under different faults must be either accurately measured or reliably predicted by simulations. This performance requires accurate modeling of system components. As a result, high-frequency model components using ATP/EMTP are used to implement 22 kV MVC feeding research nuclear reactor load.

3.2 Power station modeling

The power station's synchronous generator (SG = 1500 MVA/22 kV) is represented by the ATP/EMTP model SM59/58 have 14Ω resistance and 0.35 mH inductance.

3.3 MVC modeling

Figure 2 shows the underground cable type and geometry used in this study. Under identification of dimension of 22 kV underground cable as in Table 1, the MVC was simulated using the LCC JMarti model (Frequency-dependent model with constant transformation matrix), which was likewise based on traveling wave theory [3, 29, 30].

Fig. 2
figure 2

Structure and installation of the three-phase single core cable

Table 1 Specification of 22 kV cross-linked polyethylene (XLPE) underground cable material

Where: ρc: Resistivity of the conductor material. ρs: Resistivity of the sheath material. µc: Relative permeability of the conductor material. µs: Relative permeability of the sheath material. µ r: Relative permeability of the insulating material outside the conductor. εr: Relative permittivity of the insulating material outside the conductor.

3.4 Load (research nuclear reactor) modeling

A standard component RLCY3 can easily model the load as a Three-Phase Grounded-Wye load with parallel R, L elements R = 300 Ω and L = 1.5mH.

3.5 Asymmetric, symmetric faults modeling

Faults are classified as asymmetric and symmetric faults based on the types shown in Fig. 3 [3, 6]. This type of asymmetrical fault is called a single phase to ground fault (LG), a two-phase fault (LL), and a two-phase fault to ground (LL-G). However, three-phase faults like three-phase short circuit (LLL) and three-phase short circuit to ground (LLL-G) are also symmetric faults. Accordingly, a short circuit is modeled as a time-controlled switch with resistance in ATP/EMTP to study the effects of different fault inception angels and fault resistances.

Fig. 3
figure 3

Fault types

4 Proposed fault diagnosis technique

Since DFRFT eliminates the discrimination's reliance on signal energy and noise effects and SVD is an efficient feature dimension reduction scheme, our proposed technique makes use of them as a feature extractor and a feature reducer, respectively. Besides, MCSVM is utilized as our classifier because SVM achieves high discrimination efficiency. Among, we used the OVO-SVM to generate all possible paired SVMs with low computational complexity. Additionally, the linear function is chosen as the kernel function to maximize classification performance while minimizing execution time. Figure 4 presents a detailed flowchart for the steps of our proposed fault diagnosis approach, which is divided into three stages. The first stage processes the current signals obtained from the MVC’s sending end for various fault types and locations under different conditions. In the second stage, the signals are transformed using DFRFT and SVD to remove the discrimination's dependence on the signal's energy and noise effects. In the third stage, identification, categorization, and tracing are performed using MCSVMs, composed of several binary SVMs. Thus, the most efficient parameters that produce the best results can be identified. The final stage involves the prediction of fault locations using MCSVMs, which are composed of several binary SVMs.

Fig. 4
figure 4

Flowchart of the proposed method

4.1 ATP/EMTP simulated results

Figure 5 shows the simulated design of a practical 22 kV MVC feeding research nuclear reactor. The ATP/EMTP simulation uses a 20 km MVC model, part of the nuclear power plant system published in [31]. Each 1 km in length, twenty identical blocks are connected in a cascade to develop the 20 km MVC model. Faults have been conducted at the intermediate junctions of each consecutive block, and the fault current waveforms are recorded at the sending end only. Faults depend on four main fault parameters: type, distance r, resistance R, and inception angle Ө.

Fig. 5
figure 5

Simulated MVC model

In this regard, five fault types (LG, LL, LL-G, LLL, LLL-G), ten fault resistances in the case of ground fault (R = 0 Ω, 10 Ω, 20 Ω, 30 Ω, 40 Ω, 50 Ω, 75 Ω, 100 Ω, 150 Ω, 200 Ω), five inception angles (including Ө = 0°, 45°, 90°, 135°, 180°), and nineteen distances of fault from recording point (including r = 1 km, 2 km, 3 km, 4 km, 5 km, 6 km, 7 km, 8 km, 9 km, 10 km (Midpoint), 11 km, 12 km, 13 km, 14 km, 15 km, 16 km, 17 km, 18 km, 19 km) are simulated. Figure 6 depicts the sending end three-phase current waveforms after applying various fault types to phases at a distance of 10 km (cable midpoint) from the source. These faults are implemented at Ө = 0° for the system phases and R = 20 Ω.

Fig. 6
figure 6

Three-phase currents waveforms at different faults type (with R = 20Ω, Ө = 0° and r = 10 km)

Figure 7 shows the phase (a) current waveform at the sending end under different locations (with fault resistance R = 20, inception angle Ө = 0°, and fault location r = (1 km, 5 km, 10 km, 15 km, and 19 km) for the one-phase (a) to ground fault. The faulted phase current amplitude was reduced from 950A at 1 km to 799A at 19 km by about 16%.

Fig. 7
figure 7

Phase a current waveform at sending end under different location (with fault resistance R = 20Ω, Inception Angle Ө = 0° and fault location r = (1 km, 5 km,10 km, 15 km, 19 km)

Figure 8 shows the phase (a) current waveform at the sending end under different fault resistances (with fault resistance R = (10, 20, 50, 75, 100), inception angle Ө = 0° and fault location r = 10 km for the one-phase (a) to ground fault. It is noticed that the faulted phase current amplitude is reduced at high fault resistance from 1509A at 10 Ω to 238A at 100 Ω by about 84%.

Fig. 8
figure 8

Phase a current waveform at sending end under different fault resistance (with R = (10 Ω, 20 Ω, 50 Ω, 75 Ω, 100Ω), Ө = 0° and fault location r = 10 km

Figure 9 shows the phase (a) current waveform at the sending end under different inception angles (with fault resistance R = 20, inception angle Ө = (0°, 45°, 90°, 135°, 180°) and fault location r = 10 km for the one-phase (a) to ground fault. It is concluded that the initial current amplitude in the case of fault inception angles of 45°, 90°, and 135° is much more than 0° and 180°.

Fig. 9
figure 9

Phase (a) current waveform at sending end under different inception angle (with fault resistance R = 20 Ω, Inception Angle Ө =  = (0°, 45°, 90°, 135°,180°) and fault location r = 10 km

Fig. 10
figure 10

The output of DFRFT of different Faults

5 Results and discussion

To illustrate the efficiency of the proposed method, five fault types (LG, LL, LL-G, LLL, LLL-G), ten fault resistances in the case of ground fault R (0 Ω, 10 Ω, 20 Ω, 30 Ω, 40 Ω, 50 Ω, 75 Ω, 100 Ω, 150 Ω, 200 Ω), five inception angles Ө (including 0°, 45°, 90°, 135°, 180°), and nineteen distances of fault from recording point (including r = 1 km, 2 km, 3 km, 4 km, 5 km, 6 km, 7 km, 8 km, 9 km, 10 km (Midpoint), 11 km, 12 km, 13 km, 14 km, 15 km, 16 km, 17 km, 18 km, 19 km) are simulated This section describes the related results of Feature extraction, and reduction using DFRFT, SVD fault detection, classification and location via the MCSVM approach.

5.1 Feature extraction, and reduction using DFRFT, SVD

The proposed technique utilizes DFRFT and SVD to extract features. The DFRFT is used because of its fractional power parameter, which allows for a spatial-frequency representation of the signal. As a result, both the spatial and frequency domains can be utilized. By extracting the most significant singular value for each phase, SVD is used to reduce the number of features.

Figure 10 illustrates the output of the DFRFT on various faults’ three-phase current signal. This approach eliminates the effect of noise on classification accuracy. In addition, the DFRFT coefficients are used in conjunction with SVD to reduce the number of features, which are then classified using the MCSVM (Fig. 10).

5.2 Fault detection and classification (discrimination) using MCSVM

The main goal is to determine exactly which underground cable is faulty or healthy. To accomplish this, the SVM classifier is fed training current patterns derived from DFRFT and SVD. An SVM will build a hyperplane that separates in an n-dimensional space that forms the boundary between the two data sets if the input data is processed as two pairs of vectors in that space. To compute the margin, we construct two parallel hyperplanes that are superimposed over the two data sets. The hyperplane with the greatest distance to the neighboring data points of both classes, intuitively, gives a suitable separation. The goal is to reduce the classifier's generalization error by increasing the margin or gap between these parallel hyperplanes. The decision function, by utilizing a kernel function, returns the faulty or healthy (No fault).

As soon as the faulty phases have been detected, our next goal is to determine the type of fault, at which point the MCSVM is activated. The proposed fault classification approach is divided into two steps. The first is DFRFT feature extraction, followed by SVD reduction, and the second is MCSVM classification. In the first step, the maximum singular value of DFRFT for each fault state is used to produce one feature for each phase. One feature are obtained from the maximum singular value of DFRFT for one phase only. The fault currents were gained from ATP/EMTP simulation. The simulation time is 0.12 s with 5 microsecond time step, 10 ms fault clearing. Fault type, fault location and fault inception time are changed to obtain training patterns covering a wide range of different power system conditions. In ATP simulation, the sample frequency is about 120 kHz and the transients of only one-terminal phase currents is processed. For each fault state of a three-phase 22 kV underground cable, three characteristics are retrieved. As a result, the size of the feature vector, which comprises of three-phase transient characteristics, is 3 × 950 for each failure type. The MCSVM classifier will be trained and tested using these criteria. We selected the characteristics corresponding to 3 × 750 faults occurring at 19 distinct locations along the 20 km cable for training. The remaining 3 × 200 fault cases are used to evaluate the MCSVM classifier. Table 2 lists the fault types and their associated class labels. The performance and training time using different SVM kernels are shown in figs. 11, 12 and 13. The linear kernel, with the hyperplane discriminating the coefficients properly and the DFRFT fractional factor (α) equal to 0.5, is the most often used kernel.

Table 2 Training label for MCSVM fault type classification
Fig. 11
figure 11

The proposed method's performance based on different fractional factor (α) values utilizing Linear, Quadratic, and RBF kernels

Fig. 12
figure 12

The proposed method's execution time is based on fractional factor (α) values utilizing Linear, Quadratic, and RBF kernels

Fig13
figure 13

The proposed method's execution time for fractional factor (α = 0.5) utilizing Linear, Quadratic, and RBF kernels

Linear MCSVM's performance is compared to that of the k-nearest neighbor method (kNN), probabilistic neural networks (PNN), multilayer perceptron (MLP) and radial basis function networks (RBF). Table 3 presents the fault classification results obtained utilizing various network architectures.

Table 3 Results of several network tests

The DFRFT spectrum characteristics for a real signal are conjugated symmetric, which means that the first half of the DFRFT spectrum is a mirrored conjugate to the second half. As a result, the features may be identified using only one half of the DFRFT spectrum, which reduces complexity and saves time. DFRFT and SVD provides optimal representation of signal by packing most of the information in few coefficients for a given signal. The MLP takes a lot of times because the computations are difficult and time consuming and the proper functioning of the model depends on the quality of the training.

As shown in Table 3, the best results were obtained using MCSVM, which achieved 99.8% performance levels and required less execution time than the others since MCSVM is trained using just support vectors rather than the entire training data set. As a result, MCSVM is the optimal option for classification problems. Although the MCSVM has the best classification performance, these results might have happened by chance. To evaluate the MCSVM classifier's results, a second validation test is required. Table 4 summarizes the results of the fivefold cross-validation test. The dataset is randomly partitioned into five exclusive subsets of nearly identical size for fivefold cross-validation, and the suggested approach is used five times. Each time, one of the five subsets is used as the test set, while the other four are combined to create the training set. The average percentage error for all five trials is then calculated. In this strategy, it makes no difference how the data is split. Each data point appears exactly once in a test set and four times in a training set. Table 4 shows that all fault categories are appropriately categorized.

Table 4 The results of the fivefold cross-validation test

5.3 MCSVM-based fault tracking

Our second goal, estimating the fault distance of the underground cable, is triggered after the MCSVM classifier identifies the MVC faulty phases. MCSVM attempts to locate the actual location of the fault using inputs containing three-phase current data. There are several fault types to consider, including single phase-to-ground (LG), phase-to-phase (LL), two-phase-to-ground (LL-G), three-phase (LLL), and three-phase-to-ground (LL-G) (LLL-G). SVM was trained and evaluated using data from simulated faults that occurred at different locations along the 20-km MVC (approximately 19 data points for each case). Furthermore, the resistance and inception angles of faults are changed to illustrate the SVM's performance under varied operating situations. After defining the fault classes, five distinct SVMs (SVM-1–SVM-5) were employed to forecast the locations of LG, LL, LL-G, LLL, and LLL-G problems, respectively. The specific high-frequency features of each type of fault were determined using DFRFT and SVD, and were then employed in the second stage of the method to derive the revised location of the fault.

The results demonstrate that the cable's nearest points have the highest estimate errors. In terms of fault localization accuracy, similar findings have been found for various fault kinds and system situations. After defining the fault classes, five distinct SVMs (SVM-1–SVM-5) were used to forecast the locations of LG, LL, LL-G, and LLL-G problems. Tables 5, 6, 7, 8 and 9 provide the SVM-1–SVM-5 testing results. In the worst-case situation, the maximum percentage inaccuracy in identifying the fault is restricted to (0.47% of total line length) kilometers, as indicated in the tables. Each test case's SVM results were examined to determine the estimation error associated with that test instance. The difference between the estimated fault distance (A) and the actual fault distance (r) for the test instance was used to quantify the inaccuracy. The accuracy of the method is equal to this estimation error. The algorithm's accuracy decreases as the distance deviation increases. The overall accuracy, E, is defined as the highest estimate error for the line's whole length range, as well as for all conceivable fault types, expressed as a percentage of the total line length L.

$$ {\text{Error}} \left( E \right) = \left| {\frac{{{\text{estimated}}\, {\text{location}}\, \left( A \right) - {\text{exact}} \,{\text{location}}\, \left( r \right)}}{{{\text{total}}\, {\text{cable}}\, {\text{length}} \,\left( L \right)}}} \right| \times 100 $$
(12)
Table 5 Results of SVM-1 for LG faults
Table 6 Results of SVM-2 for LL faults
Table 7 Results of SVM-3 for LL-G faults
Table 8 Results of SVM-4 for LLL faults
Table 9 Results of SVM-5 for LLL-G faults

Table 10 shows the actual and estimated locations of faults at 19 kms with various fault resistance and inception angles. The location of the fault, the resistance of the fault, and the angle of inception all affect the accuracy of fault location. The maximum error (0.525789) in fault location occur when the fault resistance is equal to 200 Ω and the inception angle is 90 degrees at the cable receiving end (19 km), as shown in Table 10.

Table 10 Testing results of MCSVM for different fault resistances and inception angle

The results of this research demonstrate that using MCSVM in conjunction with DFRFT and SVD is an excellent strategy for identifying faults on subterranean cable feeding research nuclear reactor. The suggested approach may conveniently, rapidly, and precisely locate the position of the fault.

6 Conclusion

This paper presented a novel fault classification technique and location in a MVC research nuclear reactor that uses MCSVM. First, ATP/EMTP simulates several faults with varying fault locations under varying conditions to prepare input data. Additionally, optimize classification and localization performance and execution time by combining DFRFT with SVD and MCSVM. The influence of the rotation angle of the DFRFT on classification efficiency was also investigated, and it was observed that classification efficiency varies with the rotation angle value. SVM linear, quadratic, and RBF kernels were used for classification. The results indicate that a quadratic kernel is the most efficient due to the quadratic plane's efficient separation of classes. The most obvious conclusion from examining various DFRFT factors is that 0.5 factors perform the best in linear and quadratic kernels.

On the other hand, the complexity of the kernel is the critical determinant of execution time. Thus, the linear kernel executes at the fastest rate, attaining 99.8% performance and a time of 0.01 s for various faults at multiple locations, resistances, and angles of inception. The maximum error is equal to 0.525789% in fault location occur the fault resistance is equivalent to 200Ω and the inception angle is 900 at the MVC receiving end. This shows that the proposed technique can offer acceptable accuracy in fault classification and fault location estimation. Moreover, MCSVM could be used as a part of a new generation of high-speed advanced fault locators.