Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis

Sparse component analysis (SCA) has been widely used for blind source separation(BSS) for many years. Recently, SCA has been applied to operational modal analysis (OMA), which is also known as output-only modal identification. This paper considers the sparsity of sources' time-frequency (TF) representation and proposes a new TF-domain SCA under the OMA framework. First, the measurements from the sensors are transformed to the TF domain to get a sparse representation. Then, single-source-points (SSPs) are detected to better reveal the hyperlines which correspond to the columns of the mixing matrix. The K-hyperline clustering algorithm is used to identify the direction vectors of the hyperlines and then the mixing matrix is calculated. Finally, basis pursuit de-noising technique is used to recover the modal responses, from which the modal parameters are computed. The proposed method is valid even if the number of active modes exceed the number of sensors. Numerical simulation and experimental verification demonstrate the good performance of the proposed method.


Introduction
During the last few decades, modal analysis has been a useful analytical tool in linear dynamical systems. Modal information, including natural frequency, damping ratio and mode shape, can be extracted through modal identification to completely characterize the dynamics of a linear system. Output-only modal identification or operational modal analysis (OMA) has attracted the interest of many researchers in structural dynamics because of its capacity to extract modal information from output measurements alone [1][2][3].
Modal identification coincides with the concept of blind source separation(BSS), which is the process of extracting original sources and mixing matrix from output signals without prior knowledge of the mixing procedures or sources. The linear instantaneous-mixing BSS model can be formulated as where x(t) is a column vector of m, zero mean, and output observations that represent an instantaneous linear mixture of source signals in column vector s(t); A ∈ R m×n is an m × n matrix, which is referred to as the mixing matrix. The observation vector x(t) may be construed as a superposition of the columns of matrix A loaded by individual sources s i (t), i = 1, . . . , n. The objectivity of BSS is to recover sources s(t) from observations x(t) only without any knowledge of A. From the perspective of modal expansion, the BSS problem has a similar form to OMA, in which the output responses are a linear combination of decoupling monotone modes viewed as source signals.
Many studies have attempted to use BSS for OMA [4][5][6][7][8]. [4] was the first to recognize the correlation between the vibration modes of mechanical systems and the sources computed through independent component analysis(ICA); ICA is effective in weakly damped modes(with less than 1% damping). Classical second-order BSS(SOBSS) algorithms such as multiple unknown signal extraction(AMUSE) and second-order blind identification(SOBI) were successfully applied to modal identification and enhanced damped mode separation [5,6]. [7,9] highlighted the limitations of SOBI and proposed a framework using Hilbert transform to estimate complex mode shapes. [10] proposed a method that combines SOBI and state-space realization on the basis of the non-Hermitian joint approximate diagonalization of the correlation matrices at several time lags; the method established a theoretical link between SOBSS and stochastic subspace identification(SSI) [2,11], which is a popular modal identification algorithm.
Unlike traditional methods, BSS is straightforward, computationally efficient and can function unsupervised; however, it can estimate only as many active modes as the number of output responses [12]. The BSS problem that occurs when the number of sensors is less than the number of the sources is referred to as underdetermined BSS(UBSS), which has received much attention in recent years [10,13]. Sparse component analysis(SCA) [14][15][16] was recommended to solve UBSS. Unlike traditional BSS methods that rely on source independence, SCA requires sources to have sparsity in the time domain or some transformed domain. Many studies proposed algorithms based on the sparsity in the TF domain [17][18][19][20][21][22]. In [17], singular value decomposition was used on wavelet coefficients to identify frequencies and damping ratios; this method is faster than other ridge detection algorithms. In [19], principal component analysis was performed on the wavelet coefficients to produce the partial mixing matrix of UBSS.
Speech signals are highly transient, consisting of voiced segments exhibiting a well-defined frequency structure, which are separated by noise bursts. In each short period of time, only a small number of frequency bands hold most of the energy, while other frequency bands contain little energy. Therefore, if the speech signals are transformed to time-frequency domain, magnitude of most of the time-frequency coefficients in the time window are approximate to zero, which meets the sparsity condition of SCA; hence SCA has been widely studied in speech processing [23][24][25][26][27][28][29][30]. [23,24] propsed the degenerate types of the unmixing estimation techniques under the assumption that the W-disjoint orthogonal condition occurs across the whole TF region. [25,26] proposed the types of time-frequency ratio of mixtures(TIFROM) methods to relax the restrictive orthogonal assumption. The TF coefficients of sources were allowed to overlap, but adjacent TF regions where only one source occurs must exist. In [27], the restriction of the TIFROM algorithm was further relaxed because only two adjacent points with a single source contribution in the same frequency bin were demanded. This relaxation was based on the fact that the magnitude of the mixture vector at each single-source-point(SSP) is proportional to that of one of the mixing column vectors. [28] reported a simple algorithm of identifying SSPs through hierarchical clustering. This algorithm achieved high estimation performance which could be further improved by using highly suitable clustering algorithms.
In OMA, the sources are exponentially-decaying sinusoids with different instinct frequencies, the sparse property of the sources in frequency or TF domain makes it suitable to execute modal identification using SCA. [31] studied SCA by using the sparsity of modal responses in the frequency domain, and estimated the mixing matrix and modal parameters. Instead of exploring frequency-domain sparsity, the present study investigates sparse representations in the TF domain. The proposed SCA consists of two steps: estimation of the mixing matrix and reconstruction of sources. The mixed signals are first transformed into the TF domain. The scatter plot of the TF coefficients reveals the hyperlines that correspond to the column vectors of the mixing matrix because of sparsity. The mixing matrix can be estimated through cluster analysis. SSP techniques [28] and K-hyperline clustering are combined to improve the performance of SCA because the estimation of the mixing matrix plays a key role in the latter reconstruction of sources. Finally, the theory of linear equations for determined BSS or basis pursuit de-noising (BPDN) for UBSS is used to recover the modal responses. Modal parameters are then estimated by using the single-degree-of-freedom(SDOF) parameter fitting method. The main contribution of this paper is to introduce the TF-SSP technique to OMA. The procedure of extracting SSPs can eliminate the points which don't lie in hyperlines and make estimation of mixing matrix more accurate, especially in the presence of noise. Numerical and experimental results show the accurate and robust identification performance of the method.
The paper is structured as follows. Section 2 introduces the theory of modal expansion, and Section 3 presents the derivation of SCA. Sections 4 and 5 provide the numerical simulations and experimental verification, respectively. The conclusion is drawn in Section 6.

Modal Expansion and BSS
For an n-degree-of-freedom linear system, the governing equations can be written as where M ∈ R n×n is the mass matrix, C ∈ R n×n is the damping matrix, and K ∈ R n×n is the stiffness matrices; f (t) is the external force, x is the n-dimensional displacement response vector, and dot denotes derivatives with respect to time.
On the basis of modal expansion theory, the system responses x(t) = [x 1 (t), . . . , x n (t)] T with diagonalizable damping that can be expressed as where Φ ∈ R n×n is the modal matrix that consists of the mode shape vector ϕ ϕ ϕ i ∈ R n , and q(t) = [q 1 (t), . . . , q n (t)] T is a matrix of the corresponding modal coordinates. In particular, in free vibration, that is f (t) = 0, the single-mode signals can be expressed as monotone exponentially-decaying sinusoids where ω i and ζ i are natural frequencies and damping ratios, respectively; ω di is the damped natural frequency, ω di = ω i 1 − ζ 2 i ; and u i and θ i are constants under corresponding initial conditions. To identify Φ and q(t) on the sole basis of x(t), a parametric model must be established by using traditional identification methods. The common method of SSI [2,11] is based on the state space model and uses the theory of subspace identification to identify the parameters of the model structure. However, this method has some drawbacks in terms of model order determination and spurious mode problems, which require the standard stabilization diagram to obtain exact result. Note that Equations (1) and (3) share the same format. The modal coordinate q(t) is a special case of the general sources s(t) with time structure, whereas Φ corresponds to the mixing matrix A. Furthermore, distinct modal coordinates are uncorrelated and thus automatically satisfy the requirement of sources in BSS. The coordinates are also spectrally monotone in the frequency domain and thus meet the sparsity assumption of SCA . Therefore, Φ and q(t) can be obtained from the output x(t) by applying BSS and SCA, both of which provide a straightforward and efficient alternative for output-only modal identification.

Sparse Component Analysis (SCA)
In this section, we discuss all techniques used in the proposed SCA. First, the sparsity of signals in different transformed domains is illustrated, and the reason for the use of TF domain is explained. Then, SSP detection and K-hyperline clustering are presented to improve the accuracy of the mixing matrix estimation. Finally, the algorithm for sources reconstruction is discussed.

Sparsity in Transformed Domain
A signal is considered sparse in some ξ-domains if only a small number of coefficients in the ξ-domain significantly differ from zero. The linear transformation that changes the time-domain BSS model to the sparse ξ-domain can be represented as follows: If s(ξ) is sufficiently sparse and disjoint at some ξ k , that is, s j (ξ k ) = 0 and s i (ξ k ) = 0 for i = j, then At these points (i.e., SSPs), only one source is active and the mixture vector x(ξ k ) is proportional in magnitude to one of the mixing matrix columns. Therefore plotting the magnitude of the mixture vectors in ξ-domain as a scatter diagram (e.g., a plot of x 1 (ξ) versus x 2 (ξ)) will form clusters and show a clear orientation toward the directions of the column vectors of the mixing matrix A. The number of active sources can be counted by recognizing significant directions. The clustering algorithm can be applied to estimate A.
Two mixtures are generated as The two mixtures are respectively transformed into the frequency and TF domain for comparison. Instead of Fourier transform, discrete cosine transform (DCT) is used in the frequency domain on account of the reason reported in [31]. In the TF domain, short-time Fourier transform (STFT), which has been successfully applied in speech signals, is used because of its simplicity and intuitiveness. Mirror operation and window function with long overlap are applied to solve the problems of edge effect and low accuracy. In this study, STFT is conducted with a Hanning window(length is 256 and overlap is 192). In order to better compare the result between DCT and STFT, the coefficient matrix Figure 1.
The scatter points in the time domain are disordered and reveal minimal information about the mixing matrix. However, the points in the frequency and TF domains form clusters around three significant directions, which correspond to the three columns of the mixing matrix A. Furthermore, the STFT result is superior to that of the DCT because only a small fraction of points remarkably reveal the directions in the DCT scatter diagram. DCT performance is thus sensitive to noise and outliers. In this study, we choose to perform SCA in the TF domain by using STFT.

Mixing Matrix Estimation
Estimation of the mixing matrix is important in SCA. Estimation accuracy can be improved in two ways: SSP detection and K-hyperline clustering.

SSP Detection
The SSP is the point where only one source is active, and the direction of the magnitude of the mixture vectors is proportional to that of the columns of the mixing matrix. SSP detection can thus reduce the effect of noise and outliers as well as improve clustering performance.
Equation (1) can be expressed in the TF domain using STFT as where . . , S n (t, k)] T represent the STFT coefficients of the mixtures and sources,respectively, in the kth frequency bin at time frame t; a i = [a 1i , . . . , a mi ] T is the ith column of the mixing matrix A. We assume a case with two sources and two sensors. If the (t 1 , k 1 ) in the TF domain is an SSP, that is, s 1 (t 1 , k 1 ) = 0 and s 2 (t 1 , k 1 ) = 0, then the following can be obtained from Equation (7): The real and imaginary parts can be equated through plural unfolding as In this case, the absolute directions of the real and imaginary parts are the same as that of a 1 . This condition is derived from the fact that only one source is active at an SSP. The real and imaginary parts of a non-SSP (t 2 , k 2 ) at which both sources are active can be written as If the absolute directions of the real and imaginary parts need to be the same, then the following condition must be satisfied: When extending to the general case of multiple mixtures and multiple sources, the condition that the absolute directions of Re{X(t, k)} and Im{X(t, k)} at any point (t, k) are the same requires that the ratios between the real and imaginary parts of the STFT coefficients of all source signals at those points must also be the same. However, the probability of satisfying this requirement is low and decreases as the number of sources increases [28]. Therefore, we can conclude that a point is an SSP if the absolute directions of Re{X(t, k)} and Im{X(t, k)} at a point (t, k) are the same; otherwise, the point is a multiple-source-point. However, the points that accurately satisfy this condition are limited. Therefore, [28] developed an algorithm that relaxes this condition for SSP detection. In the TF plane, the points where the difference between the absolute directions of Re{X(t, k)} and Im{X(t, k)} is less than ∆θ are taken as SSPs. This condition can be expressed as where | · | represents the absolute value, and y = y T y. The SSPs that satisfy Equation (14) can be preserved for the estimation of the mixing matrix. Figure 2 illustrates an example of speech utterances with five sources and two mixtures. All STFT coefficients are shown in Figure 2a, whereas the SSPs detected by using Equation (14) are shown in Figure 2b. The directions in Figure 2b are much clearer than those before SSP detection, and the scale of points for clustering is also reduced. Therefore, SSP detection makes SCA simple, effective, and robust to noise and outliers.

K-Hyperline Clustering
After SSP detection, estimating the mixing matrix A can be cast into a hyperline clustering problem. Several standard clustering methods have been used to extract directions; examples of these methods include K-means, fuzzy-C clustering [32] and the linear geometric ICA-based method [33]. However, these methods require the sources to be sufficiently sparse, and their performance in practice may be affected by many factors such as noise, outliers, and insufficient sparseness of sources, as mentioned in [16].  Considering these problems, [34,35] proposed and studied K-hyperline clustering method. In contrast to the K-means clustering method, K-hyperline allows each data sample to have a coefficient value that corresponds to the cluster centroid, and the cluster centroids are normalized to unit 2 −norm. Clustering can be rapidly realized by solving an optimization problem using singular value decomposition or an iterative linear scheme. Interestingly, this method is robust to the outliers and performs well even in noisy cases. Further discussion on this clustering method is not presented in this paper due to space limitation.

Reconstruction of Sources
Once the mixing matrix is estimated and denoted asÃ, the sources can be directly recovered by solvings(t) =Ã −1 x(t) for m = n or by using the least squares method for m > n. However, for the case of UBSS(i.e., m < n), an infinite amount of feasible solutions may be obtained in accordance with the theory of linear equations. From these solutions, we want to find the one that has the least error with our real sources. Given the sparsity of the sources in ξ-domain, the solution is required to have the fewest non-zero elements, which can be formulated as: (P 0 ) :s(ξ) = argmin||s(ξ)|| 0 , where ||x|| 0 denotes the 0-norm of x. However, (P 0 ) is subject to its non-convexity and high-discreteness and is therefore difficult to executed in practice. Replacing the subscript 0 with 1, we obtain the problem (P 1 ) :s(ξ) = argmin||s(ξ)|| 1 , which is also called the basis pursuit(BP) problem. [36] proved that (P 1 ) is a good approximation to (P 0 ). (P 1 ) can be realized by a well-defined convex optimization problem, whose solution can be guaranteed as global optimum and can be solved efficiently by using standard linear programming techniques. For the noiseless case, BP is a effective solution to recover the sparse sources. However, BP can not find a good solution in the presence of noise. Thus, we use BPDN technique to recover the sources in this manuscript. The optimization problem of BPDN is formulated as follows: The case σ = 0 corresponds to a solution of BP. The MATLAB solver SPGL1 is used to recover the sources from the time-frequency coefficients. For the noiseless case in Sections 4.1 and 4.2, the BP method is used, and the BPDN method is used in the experimental part and the noise case of the simulation part. Additional description of the 1 -minimization technique can be found in [36][37][38][39][40].
When the source signalss(ξ) in ξ-domain are recovered, the time-domain sourcess(t) in Equation (1) can be calculated by the corresponding inverse transform as Then the modal parameters can be extracted from the recovered sources by using the SDOF parameter fitting method or logarithmic decrement method.

Numerical Simulations
A 3-dof mass-spring-damper system ( Figure 3) is considered in this section to validate the effect of the proposed TF-domain SCA method. Different system parameters are selected for various situations, such as well-separated modes, highly-damped modes, and closely-spaced modes. In each situation, determined and underdetermined identification are both considered. The robustness to noise is studied in Section 4.3. The proposed algorithm is performed under the following settings. The simulation outputs are first transformed into the TF plane using STFT with a Hanning window length of 512 and an overlap of 384. Then, the SSPs in the TF plane are found by the value ∆θ of 10°. a time-frequency trade-off occurs when using STFT. The width of the windowing function relates to how the signal is represented-it determines whether there is good frequency resolution or good time resolution. a wide window gives better frequency resolution but poor time resolution. a narrower window gives good time resolution but poor resolution. The methods for obtaining high resolution STFT have been studied in [41,42]. [41] proposed a method to separate nonstationary and stationary signals overlapping in time-frequency domain; time and frequency varying windows and overlapped STFT was used to improve time and frequency resolution. [42] developed a high-resolution time-frequency representation to achieve robust estimation of multiple frequency hopping signals based on sparse Bayesian method. The source signals in OMA are exponentially-decaying sinusoids. Thus, the time resolution is not so important and we can select a wide window. Meanwhile, the overlap is set at 75% of the window length to obtain adequate single source points. In practice, the minimal natural frequency is another factor that should be considered in deciding the window length. As the model in Section 4.1 shows, the minimal natural frequency is 0.0895 (the period is 112 points at the sampling frequency 10 Hz); thus, the window length should contain at least several periods. On the basis of the reasons above, the window length and overlap are set as the ones above. ∆θ is set to 10 • to make sure that we can obtain adequate clean SSPs. The mixing matrix is obtained through K-hyperline clustering. With the estimated mixing matrix, the modal responses are directly recovered by solving Equation (1) in the time domain for the determined case. For the underdetermined case, the frequency-domain modal responses are first recovered by BPDN. Then the overlap add (OLA) method is used to execute inverse STFT to get the time sequence, details of OLA method can be found in [43]. After the inverse STFT is carried out, the time-domain modal responses are obtained. Finally, the natural frequency and damping ratio of each estimated modal response can be calculated by using either the simple SDOF parameter fitting or logarithmic decrement method.
The modal assurance criterion (MAC) is used to determine the errors of the estimated mode shape as whereφ ϕ ϕ i and ϕ ϕ ϕ i are the ith estimated and theoretical mode vectors, respectively. The MAC value ranges from 0 to 1, in which 0 means no correlation and 1 indicates complete correlation. For the underdetermined case, the estimated mode vectors have lower dimensions than the theoretical ones. The corresponding rows of the theoretical mode vectors are selected to constitute the partial mode shapes. Thus the MAC indicates the estimating accuracy of the partial mode shapes.

Well-Separated Modes
First, the case of well-separated modes is considered. The system parameters, identical to those in [31], are set as follows:

Determined Modal Identification
In this section, all sensor measurements are used for identification. The responses of the free vibration are shown in Figure 4, and the scatter diagram of the identified SSPs is shown in Figure 5, from which we can see that the SSPs form three clear directions. The mixing matrix, that is, the matrix of mode shape vectors, can then be accurately estimated through K-hyperline clustering. The performance can be indicated by the high MAC values presented in Table 1. The recovered time-domain modal responses, as shown in Figure 6, have waveforms that are similar to those of monotone exponentially decaying sinusoids. The frequencies and damping ratios are estimated from these modal responses, as shown in Table 1. Compared with the theoretical values, the proposed algorithm obtains excellent results regardless of damping levels.

Underdetermined Modal Identification
In the underdetermined case, the first two measurements are utilized. Figure 7 shows the 2D scatter of the SSPs. Furthermore, three clear directions represent the mode shape vectors. The 1 -optimization algorithm is used to recover the time-domain modal responses, and the results of the case where α = 0.08 are shown in Figure 8. The high MAC values and the accurately identified parameters are presented in Table 1. These results reveal that the proposed algorithm is also suitable for underdetermined modal identification with limited sensors.
The TF-domain SCA in this study achieves a better performance compared with the frquency-domain SCA in [31]. On the one hand, more points remarkably reveal the orientation, which make the directions more obvious and recognizable. This feature is helpful in enhancing estimation accuracy of the mixing matrix. On the other hand, the system parameters and mode shapes are identified with high accuracy.

Closely-Spaced Modes
Closely-spaced modes are studied in this section to further evaluate the capacity of the proposed SCA. The model is set up with the following parameters:  Figure 9 shows the free vibration of the model under the same initial conditions as those in Section 4.1. As indicated in the spectrum, the second and third modes cannot be clearly separated. However, the scatter plot of the identified SSPs in Figure 10 reveals three significant directions. Although the angle between the two of them is slightly small, high MAC values can be obtained as presented in Table 2, where the estimated natural frequencies and damping ratios in the determined and underdetermined cases are also displayed. The identified parameters match the theoretical ones well. Figure 11 shows the recovered modal responses in the underdetermined case.
Similar to the frequency-domain SCA, the TF-domain SCA is effective for closely-spaced modes.   Frequency (Hz) Figure 11.
Recovered modal responses using two sensors in the case of closely-separated modes.

Robustness to Noise
To investigate the robustness to noise, the simulation outputs are corrupted by Gaussian white noise, the root mean square(RMS) amplitude of which is set to a certain percentage of the signal RMS value. The model is the same as that used in Section 4.1. The identified SSPs and the recovered modal responses under 10% noise are shown in Figures 12 and 13. Although the addition of noise results in a confusing scatter, many points are still distributed around the direction lines, and their average-value are used to reduce the effect of noise. BP and BPDN are used to recover the sources. Comparison shows that BPDN performs better than BP in the presence of noise. To evaluate the performance of the proposed algorithm with increasing noise percentage, 5%, 10%, 15% and 20% RMS noise are added to the original source. Fifty groups of sources with different white noise under each noise level are processed with the proposed algorithm because of the non-deterministic characteristic of the noise. The mean values of the MAC of the successful identifications are shown in Table 3. The identification is considered successful when the MAC of each identified mode is greater than 0.95. The high MAC values shown in Table 3 indicate that the proposed algorithm can also achieve good results in the presence of noise. The proposed algorithm may fail when the noise level increases to a certain extent (20% for this example).  The scatter diagram of one failed case under 20% noise leve is shown in Figure 14a. The valuable single source points that correspond to the shortest hyperline are flooded in the noise, and the direction vector cannot be found in this case. There is another condition in which the method may fail due to noise exists. We have examined the performance for the close-separated modes in Section 4.2. The scatter plot of the mixture when the sources are polluted by noise (10%) is shown in Figure 14b. This figure shows that the single source points deviate from their original position(due to noise) and become mixed.

Experimental Setup
A steel cantilever beam that measures 0.31 m × 0.0012 m × 0.0004 m is used to validate the effectiveness of the proposed modal identification algorithm. The steel beam is made of carbon tool steel whose Young's modulus and density are 2.06 × 10 11 N · m −2 and 7.85 × 10 3 kg · m −3 , respectively. One end of the steel beam is fixed on a table, and the length from the fulcrum to the end is 0.28 m.
When excitation is provided by human finger tapping, the steel beam begins to vibrate. To capture the displacement responses, a high-speed camera is used instead of traditional accelerometer. Figure 15a shows the experimental setup, and Figure 15b shows the high-speed camera head. Twenty-one locations with a spacing of approximately 0.01 m are marked for displacement measurement. Gray 8-bit images can be transferred at 500 fps (in a 150 × 1200 pixel area, one pixel corresponds to 0.0002 m) by using the device. The camera head is connected to a PC. After image acquisition, the displacement responses at marked locations are extracted by image processing. The displacement responses extracted from some of the locations are shown in Figure 16.  Figure 16. System responses of the steel beam extracted by image processing.

Cantilever Beam Theory
The vibration of the cantilever beam satisfies Euler-Bernoulli beam theory, by which the governing differential equation without damping is formulated as where a = EJ/ρA, E is Young's modulus, J is the second moment area of the beam cross-section, ρ is the material density, and a is the cross-sectional area of the beam. Under the boundary conditions of the cantilever beam, the natural frequency function and corresponding mode shape function can be calculated as where β i l is determined by cos(βl)ch(βl) = −1, and l is the length of the cantilever beam. The natural frequencies of the former three orders based on the substitution of the actual parameters of the steel beam are shown in Table 4. The corresponding mode shapes are shown in Figure 17.

Results Analysis
To accomplish the modal analysis of the steel cantilever beam, the proposed SCA algorithm is applied to the displacement responses. Figure 18 depicts the scatter diagram of the identified SSPs using two of the measurements, which reveals three significant directions. This result indicates that three modes of the cantilever beam are excited. The modal responses recovered in the determined case using all the displacement responses as well as that using only two measurements (1st and 10th markers) with the proposed SCA are shown in Figures 19 and 20. The recovered responses are similar to the monotone exponentially decaying sinusoids. Modal responses recovered by the proposed SCA using all of the 21 measurements.  Figure 20.
The modal responses recovered by the proposed SCA using only two measurements.
As a common and effective method for modal identification, SSI is also used in this study for comparison with the proposed SCA. To avoid the drawbacks of the SSI, all displacement responses are used as the input in subspace identification. The detailed results of the corresponding natural frequencies and damping ratios are listed in Table 4. The identified mode shapes by both SSI and the proposed SCA in the determined case are shown in Figure 17 for contrast with the theoretical ones. In Table 4, the identified natural frequencies and damping ratios obtained by the proposed SCA in both the determined and underdetermined cases agree very well with those obtained by SSI. The identified frequencies also match the theoretical results. In addition, the identified mode shapes by the two methods show a high correlation with the theoretical waveforms in Figure 17, as also indicated by the MAC values. a comparison between the underdetermined case and the determined one shows that the former uses fewer measurements but attains a similarly good performance.
This experiment successfully validates the effectiveness of the proposed SCA for OMA in practice. Especially for the underdetermined case, similar performance can be obtained despite the use of limited measurements, whereas the SSI method needs more measurements to ensure good results. The proposed method provides a new way for the output-only modal identification.

Conclusions
This paper presents a new SCA method for OMA. Compared with existing methods, the proposed method explores the sparse representations in the TF domain to perform output-only modal identification. Some clusters are formed when the mixed signals are transformed into the TF plane. SSP detection and K-hyperline clustering are combined to improve the estimation accuracy of the mixing matrix. The SSP is the point in which only one source is active, and the direction of the magnitude of the mixture vectors are proportional to those of the columns of the mixing matrix. Therefore, SSP detection can reduce the effect of noise and outliers as well as improve the clustering performance. After SSP detection, the estimation of the mixing matrix is cast into a hyperline clustering problem. Therefore, K-hyperline clustering is introduced to fulfill the clustering task. After the mixing matrix is estimated, the theory of linear equations for determined BSS or the 1 -minimization technique for UBSS is used to recover the modal responses, from which the modal information (natural frequency, damping ratio, and mode shapes) can be extracted. The numerical simulations and experimental results demonstrate the effectiveness of the proposed SCA method in various cases. a comparison of the proposed method with SSI reveals that the two methods show equal performance in the experimental section. Further investigation is required to explore the applications of the proposed SCA in complex modes cases, which are not covered in this study.