Bayesian Compressive Sensing Approaches for Direction of Arrival Estimation With Mutual Coupling Effects

The problem of estimating the dynamic direction of arrival (DOA) of far-field signals impinging on a uniform linear array, with mutual coupling effects, is addressed. This paper proposes two novel approaches able to provide accurate solutions, including at the endfire regions of the array. First, a Bayesian compressive sensing Kalman filter is developed, which accounts for the predicted estimated signals rather than using the traditional sparse prior. The posterior probability density function of the received source signals and the expression for the related marginal likelihood function are derived theoretically. Next, a Gibbs sampling-based approach with indicator variables in the sparsity prior is developed. This allows sparsity to be explicitly enforced in different ways, including when an angle is too far from the previous estimate. The proposed approaches are validated and evaluated over different test scenarios and compared to the traditional relevance vector machine (RVM)-based method. An improved accuracy in terms of average root-mean-square error values is achieved (up to 73.39% for the modified RVM-based approach and 86.36% for the Gibbs sampling-based approach). The proposed approaches prove to be particularly useful for DOA estimation when the angle of arrival moves into the endfire region of the array.


I. INTRODUCTION
Direction of arrival (DOA) estimation is the process of determining which direction a signal impinging on an array has arrived from.Commonly used methods of solving this problem are: MUSIC [1], [2], ESPRIT [3]- [6] and the maximum likelihood DOA estimator [7]- [9].However, these methods have some disadvantages, in particular they require knowledge of the number of signals present beforehand and evaluation of a covariance matrix of the array output (adding computational complexity).
Compressive Sensing (CS) theory says that when certain conditions are met it is possible to recover signals from fewer measurements than used by traditional methods [10], [11].Hence, CS can be applied to the problem of DOA estimation [12]- [15] by splitting the angular region into N potential DOAs, where only L << N of the DOAs have an impinging signal (alternatively N − L of the angular directions have a zero-valued impinging signal present).These DOAs are then estimated by finding the minimum number of DOAs with a non-zero valued impinging signal that still give an acceptable estimate of the array output.
The problem can also be converted into a probabilistic form and solved via Bayesian compressive sensing (BCS) [16], implemented with a relevance vector machine (RVM) [17]- [19].Such a method has been used to solve the problem of static DOA estimation [20], [21], where a belief of having a sparse received signal is made and the most likely values found.
The Kalman filter (KF) can be used to track dynamic DOAs, with the angular range narrowed to focus in more closely on the DOA estimate from the previous iteration [22].However, this prevents directly working with the measured array signals and introduces an additional stage of having to reevaluate the steering vector of the array at each iteration of the KF.Hierarchical KFs have been used to track dynamic sparse signals [23], [24], where the predicted mean of the signals at each iteration is taken as the estimate from the previous iteration and the hyperparameters are estimated using BCS, hence the term Bayesian compressive sensing Kalman Filter (BCSKF).
However, a problem remains when a BCSKF is applied to dynamic DOA estimation with a uniform linear array (ULA).The estimation accuracy can be reduced when the DOA approaches the endfire region of the array, i.e. when the impinging signal arrives parallel to or almost parallel to the array.This can be particularly problematic when there is a lot of noise present.
An additional challenge to address when considering the DOA estimation problem is that of mutual coupling.One way of modeling the mutual coupling effects is to use a mutual coupling matrix [25], [26].In [25] the mutual coupling matrix is found using two methods: minimum mean-square matching and the mutual impedance method.The method in [26] applies a symmetric Toeplitz matrix, where only antennas within a set separation of each other can cause mutual coupling effects.In this work the method in [26] is used to ensure mutual coupling effects are included in the signal model.The contributions of this paper are: i) A BCSKF with a modified RVM, where the traditional sparsity prior is replaced with a belief that the estimated signals will instead match predicted signal values, is proposed.The result of this new prior is that a new posterior distribution and marginal likelihood have been derived.Initial results for this method using a signal model without mutual coupling have been reported in [27].ii) A Gibbs sampling approach is proposed.In this approach zero valued signals can be explicitly enforced when there is too large a change in DOA in order to alleviate the estimation accuracy problem for the endfire region of the array.iii) A comprehensive performance evaluation is provided, with the proposed methods being compared to a BCSKF using the traditional RVM approach.Significant improvements in terms of the average root mean square error (RM SE) values are observed (up to 73.39% for the BCSKF with modified RVM and up to 86.36% for the Gibbs sampling approach).
The remainder of this paper is structured in the following manner: Section II gives details of the proposed estimation methods, including the array model with mutual coupling effects (II-A), the modified RVM framework for BCS (II-B), the BCSKF (II-C) and the Gibbs sampling implementation (II-D).In Section III an evaluation of the effectiveness of the proposed approaches is presented and conclusions are drawn in Section IV.

A. Array Model
A narrowband ULA structure consisting of M omnidirectional antennas, with identical responses is shown in Figure 1.Here, a plane-wave signal mode is assumed, i.e. the signal impinges upon the array from the far field and the angle of arrival is limited to 0 • ≤ θ ≤ 180 • .The distance from the first antenna to subsequent antennas is denoted as d m for m = 1, 2, . . ., M , with d 1 = 0, i.e. the distance from the first antenna to itself.Note, these values are multiples of a uniform adjacent antenna separation of ∆d.
The steering vector of the array is given by where Ω = ωT s is the normalised frequency with T s being the sampling period, µ m = dm cTs for m = 1, 2, . . ., M , c gives the wave propagation speed and {•} T denotes the transpose operation.
The array output, y k , at time snapshot k is then given by where given by a zero mean multivariate Gaussian random variable and A st = [a(Ω, θ 1 ), a(Ω, θ 2 ), ..., a(Ω, θ N )] ∈ C M×N is the matrix containing the steering vectors for each angle of interest.Note, N is the number of points in the grid of potential DOAs the angular region has been split into.However, only L << N of these angular directions will have an impinging signal present.
In practice there will also be mutual coupling effects present, which alter the pattern of an individual antenna as compared to if it was being used on its' own.As a result (2) has to be altered to account for this fact.A mutual coupling matrix is used to achieve this [26], by giving the true steering vector matrix as Here M MC ∈ C M×M is the mutual coupling matrix given by In (4) the mutual coupling coefficients are given by m i = ρ i exp{jφ i } for i = 2, ..., D − 1, D, ..., M , where ρ i and φ i give the amplitude and phase, respectively.The variable D places a limit on the separation between antennas above which there will be no mutual coupling effects.In other words when i > D, then ρ i = 0.This then gives the following: Equation ( 5) can then be split into real and imaginary components (given by R(•) and I(•), respectively) as follows The difference between y k and ỹk is that y k has been split into its real and imaginary components in ỹk .As a result the dimensions of ỹk are increased.A similar relationship exists between A and Ã, x k and xk and n k and ñk .

B. Modified Relevance Vector machine for DOA Estimation
The aim is to now find a solution for xk which gives the closest possible match to a predicted set of signal values.To achieve this one can follow a modified RVM framework [27], by evaluating the following where σ 2 k is the variance of the Gaussian noise n k , p k = [p k,1 , p k,2 , ..., p k,2N ] T contains the hyperparameters that are to be estimated and xp = [R(x p ) T , I(x p ) T ] T = [R(x p,1 ), ..., R(x p,N ), I(x p,1 ), ..., I(x p,N )] T holds the predicted values of xk .
From (6) it is possible to find: The traditional RVM would now apply a belief that xk is sparse.However, here this is changed to a belief that xk will match the predicted signals xp : Note, when xp = [0, 0, ..., 0] T then (9) reverts to the hierarchical prior used in the traditional RVM [16], [17] and |P k | indicates the determinant of P k , where It is also necessary to define the hyperparameters over p k and σ 2 k .There are various possibilities for the structuring of the priors on p k , which represent mixing parameters in a scale mixture of normals representation of the marginal distribution of x k , which will here be in the Student-t family, see e.g.[28].One possibility would be to treat the complex components of x k as complex Student-t distributed, as detailed in [29], [30].However, this work treats the real and imaginary components of x k as independent Student-t distributed random variables, and hence there are independent Gamma priors for the mixing variables p k,n over all real and imaginary components of x k : A Gamma prior can also be used for σ where β 1 , β 2 , β 3 and β 4 are scale and shape priors.
It is known that and where the covariance matrix and the mean are given by respectively.Note, the maximum of ( 13) is the posterior mean µ.For a derivation of (13) please see Appendix A.
Similarly to [17], the probability P(σ 2 k , p k |ỹ k , xp ) can be represented in the following form: where P(x p ) is constant as fixed values are used and the second two terms on the right of are constant if as in [17].Therefore, maximising This can be achieved by a type 2 maximisation of its logarithm, which is given by (please see Appendix B): where This is now differentiated with respect to p k,n and σ −2 k to obtain the update expressions where For the derivation of ( 18) and (19) please see Appendix C. The maximisation is then achieved by iteratively finding Σ and µ, followed by p new k,n for n = 1, ..., N and σ 2 k,new until a convergence criterion is met [16], [17].In other words, the new estimates for the noise variance and precision hyperparameters found from (19) and ( 18) are then used in ( 14) and ( 15) to find new estimates of the covariance matrix and mean of the distribution in (13).Note that when xp = [0, 0, ..., 0] T the update expressions match those used by the traditional RVM.
The final estimate of the received signals is then given by where σ 2 k,opt and P k,opt = diag(p k,opt,1 , p k,opt,2 , ..., p k,opt,2N ) are the result of optimising the noise estimate and hyperparameters, respectively.Now xk,opt can be used to reconstruct the estimated signals as where n = 1, 2, ..., N .
The thresholding scheme in [20] can then be applied to keep the L most significant signals.To do this find the total energy content of the estimated received signals and then sort them.A threshold value, η, is then defined as a percentage of the energy content that is to be retained.Starting with the most significant estimated signal, the estimated signals are summed until the threshold is reached and the remaining signals are then set to be equal to 0. The remaining non-zero valued signals then give the DOA estimates and L is an estimate of the number of far field signals impinging on the array.

C. Bayesian Compressive Sensing Kalman Filter
In order to track the changes in the DOA estimates at each time snapshot the modified RVM based DOA estimation procedure detailed above is combined with a Bayesian KF, giving a BCSKF for DOA estimation [27].The signal model described above is again used along with the prediction and update steps of the BCSKF.Here, k|k − 1 indicates prediction at time instance k given the previous measurements and ∆x is determined by the assumed DOA change.Note, ∆x is fixed by the predetermined constant motion rather than being a random noise term.For example, if the angular range is sampled every 1 • and the DOA is assumed to increase by 2 • then ∆x will be selected to increase the index of the non-zero valued entries in xk−1|k−1 by two to give the index of the non-zero valued entries in xk|k−1 .
At each time snapshot it is necessary to estimate the noise variance and hyperparameters in order to evaluate the prediction and update steps of the BCSKF.This is done by considering the log likelihood function given by which can be optimised by following the procedure described in Section II-B.In other words we apply the modified RVM framework to ỹe,k , using the KF prediction xk|k−1 as the expected estimate values xp .It is worth noting that the continued accuracy of the proposed BCSKF relies on the accuracy of the initial estimate and the parameter values selected.If the initial estimate (made using the framework described in Section II-B and xp = [0, 0, ..., 0] T ) of the received signals is accurate and sparse, then the priors that are enforced will ensure this continues to be the case.However, an inaccurate initial DOA estimate or poorly matched expected DOA change can lead to the introduction of inaccuracies in subsequent estimates.Similarly, if the initial estimate of the received signals is not sparse then subsequent estimates are likely to not be sparse.As a result, care should be taken when choosing the initial parameter values and determining the likely DOA change.

D. Gibbs Sampling for DOA Estimation
The method described in the previous sections based on a BCSKF with a modified RVM required the use of prior knowledge of the predicted change in DOA.However, in practice this may not always be known, making it important to have an alternative method that can still give improved accuracy for the endfire region.
This work proposes using a sparsity prior which is given as a combination of a point mass concentrated at zero (Dirac delta function) and a zero mean Gaussian distribution, [31]- [33], giving T .Note, zk,n is the indicator variable for xk,n and determines which of the two components in (25) is selected.When zk,n = 0, the value of xk,n is determined solely by the point mass concentrated at zero.As a result, xk,n = 0 and sparsity is explicitly introduced.Alternatively, when zk,n = 1 the value of xk,n is determined by the Gaussian distribution allowing a non-zero valued estimate.The repetition of z k in zk means that the same indicator variable is used for both the real and imaginary parts of each entry in x k .
This indicator value can also be used to address the endfire accuracy problem by selecting the value of z k,n = 0 if |n−i| > j.Here i is the index of the closet non-zero valued estimate from the previous time snapshot and j defines a maximum allowed change in the DOA estimate.Only n = 1, 2, ..., N is considered to get the entries for z k , with zk then being found as previously stated.
This leaves the following where z 1 k,n and z 2 n are defined by the following Beta distributions In order to enforce zero-valued estimates when |n − i| > j, it is necessary to select β 2 5 and β 2 6 to ensure a zero-valued z k,n is preferred.However, when |n − i| ≤ j it is necessary to choose β 1  5 and β 1 6 so that the chances of z k,n = 0 and z k,n = 1 are equal.This can be achieved by where The posterior distribution of xk can be written as [33] P Now also define Ãn as being the entries in Ã relating to the index n and Ã−n are the entries of Ã excluding the entries relating to index n (and similarly for xk ).Then as per Appendix D this gives where p 0 = 1/σ 2 k and ỹk,n = ỹk − Ã−n xk,−n .There are two further posterior distributions that have to be considered.That is the distributions for p k,n and p 0 which are given by 2 ) (34) and respectively.Note, in (34) x k−1,nj gives the entries within x k−1 that have an index within the distance j of index n.By using x rather than x to find x k−1,nj it guarantees the same value of ||x k−1,nj || 2 2 and ||x k−1,nj || 0 for both the real and imaginary components.
As a result the Gibbs sampling steps are as detailed below: These steps are done for each of the T iterations of the Gibbs sampler, where the first T BI iterations are the burnin iterations.The final estimate of the received array signals is then given by the mean values of the final T − T BI iterations [32].The DOA estimate can then be found using the previously described thresholding scheme (see II-B), with the remaining non-zero valued estimates corresponding to the DOA estimates.
Note, the performance of this method will again heavily depend on the accuracy of the first estimate.As a result, it is possible to use the traditional BCS DOA estimation method (Section II-B with xp = [0, 0, ..., 0] T ) to ensure an as accurate as possible intial estimate at the first time snapshot.The proposed Gibbs sampling based method can then be used at the subsequent time snapshots to get the next DOA estimate.

III. PERFORMANCE EVALUATION
In this section a comparison in performance of the proposed methods and the traditional RVM based BCSKF method will be made over five example scenarios, under the same test conditions.Firstly, an example is considered where the initial DOA starts outside of the endfire region and then moves into it.Secondly, an example is given where the DOA remains out of the endfire region.In the third scenario the initial DOAs and the signal values are randomly generated.Then the evaluation will also consider the scenario where there is a mismatch between the actual and assumed change in DOA.Finally, the evaluation will consider a random change in DOA at each time snapshot.This means that ∆x which is selected for the modified RVM based BCSKF will not be a true reflection of how the DOA actually changes for the last two examples.
Note, the term traditional RVM based BCSKF method means the entries of P k in the prediction step of the BCSKF are found using the RVM optimisation method as detailed in [16], [17].In other words this is the method detailed in Section II-B with xp = 0.All of the examples are implemented in Matlab on a computer with an Intel Xeon CPU E3-1271 (3.60GHz) and 16GB of RAM.
The performance of each method will be measured using the RM SE in the DOA estimate.This is given by where θ l is the actual DOA, θl is the estimated DOA and Q is the number of Monte Carlo simulations carried out, with Q = 100 being used in each case.This gives a measure of the estimation accuracy and the computation time will be used as a measure of the complexity of each method.
For the Gibbs sampling method a burn-in period of 250 iterations is used and then 50 further iterations used to find the final estimate of the received array signals.When a distance of j = 5 • is exceeded a zero-valued estimate of the received signals is enforced in order to alleviate the endfire accuracy problem.
For all the design examples considered the selection of σ 2 k = 0.4 as the noise variance is used, with an initial estimate of the noise variance given by σ 2 k,0 = 0.1.The array geometry being used is that of a ULA with M = 20 antennas and an adjacent antenna separation of λ 2 , where λ is the wavelength of the signal of interest.This gives an array aperture of 9.5λ.For the mutual coupling matrix a value of D = 3 is selected, meaning that a separation of 1.5λ or greater gives negligible mutual coupling effects.The values ρ 1 = 0.65, ρ 2 = 0.25, φ 1 = π/7 and φ 2 = π/10 are then also used.Finally, in each example a single narrowband signal impinging on the array is considered, meaning L = 1.

A. Endfire Region
For this example the initial DOA of the signal is θ = 20 • , which then decreases by 1 • at each time snapshot.The signal value at each snapshot is set to be 1.Table I summarises the performance of the three methods for this example, with the RM SE values at each time snapshot being shown in Figure 2.
Here it can be seen that there has been a significant decrease in the average RM SE values for both the modified RVM method (67.51% improvement) and the Gibbs sampling based method (80.78% improvement).Overall this suggests that a more accurate estimate of the DOA is possible.It is worth noting that there has still been an increase in the RMSE for the modified RVM based approach in the endfire region of the angular range.However, this has come much later on the than for the traditional RVM based approach (indicating a degradation in performance for a smaller angular range) and the maximum RM SE value reached is lower (indicating the degradation is less severe).
These improvements have come at the cost of an increased computation time in both instances.For the modified RVM method this increase is insignificant as the average computation time is still less than one second.The increase for the Gibbs sampling based method is larger, illustrating an increase in computational complexity.However, it is worth remembering that this increase has resulted in a more accurate DOA estimate being achieved without prior knowledge about what the change in DOA will be.
In this instance the results suggest that one of the two proposed methods should be used when the estimated DOA approaches the endfire region of the array.If the change in DOA is known in advance and computational complexity is a primary concern then the modified RVM based method is the most suitable (a more accurate estimate can be achieved  without a large increase in computation time).However, when this information is not available, or computational complexity is not a concern, it is possible to get a significant improvement in accuracy (at the cost of computation time) using the Gibbs sampling based method.
In the previous simulation an adjacent antenna separation of λ/2 is used as it is known that this is the largest separation that can be used while still avoiding a degraded performance due to the introduction of grating globes [34].However, an example of what the relative performance of the methods is when a smaller adjacent antenna separation will now be considered.
In this instance an adjacent antenna separation of λ/4 is selected.As the array aperture is kept constant (to allow a fair comparison between adjacent antenna separation sizes) this means the number of antennas is given by M = 39.This also means a value of D = 9 is required to keep the same distance limits on mutual coupling occurring.The values of ρ i and φ i are then selected to be uniformly spread over the range of 0.65 to 0.25 and π/7 to π/10, respectively.The remaining parameters are kept constant and the same test scenario as for the previous example is used.
The performances of each of the methods in this instance are summarised in Table II and Figure 3, respectively.Here it can be seen that the larger number of antennas used has resulted in a lower average RM SE values for all three of the methods.In this instance only the modified RVM method has performed better that the traditional RVM based method when comparing average RM SE values (decrease in average RM SE of 61.09%).However, by looking at the maximum RM SE values it can be seen that the largest estimation error possible with the traditional RVM based method is larger than that for the Gibbs sampling based method (16.42 • as compared  It is worth noting that such an array configuration is unlikely to be used in practice.This is due to the costs associated with the number of antennas required.As a result, the remaining examples will stick to the adjacent antenna separation of λ/2 and associated parameters previously defined.

B. Non-Endfire Region
For this example the initial DOA is θ = 100 • with the DOA increasing by 1 • at each time snapshot, with the signal value remaining constant at -1.The performance of the three methods is summarised in Table III, with the RM SE values illustrated in Figure 4.
In this instance it can be seen that there has not been as large an increase in RM SE for the traditional RVM method, as the DOA does not enter the endfire region.However, both the modified RVM and Gibbs sampling based methods have managed to achieve improvements in average RM SE values of 36.42% and 50.00%, respectively.For the Gibbs sampling based method this comes at the expenses of an increase in computation time, whereas the time for the modified RVM based method is comparable to the traditional RVM based method.As with the previous test scenario this would suggest that the modified RVM based method should be used when the expected DOA change information is available and the Gibbs sampling based method when this is not the case, or when computational complexity is not a major concern.

C. Random Initial DOA
Next consider the case where the initial DOA is randomly chosen from the entire angular range and increased by 1 • at each time snapshot.The signal value is randomly selected  as ±1 for each simulation and remains constant as the DOA changes.
Table IV and Figure 5 summarise the performance of the various methods in this instance.Again, it can be seen that the modified RVM based method has offered improvements in terms of RM SE(73.39%),without a significant increase in computation time.The Gibbs sampling based method has also given an estimation accuracy improvement (76.12%) and has even outperformed the modified RVM based method, without prior knowledge of how the DOA was going to change.However, this has come at the expense of an increased computation time.

D. Mismatched Actual and Assumed DOA Change
This subsection compares the performances of the estimation methods for two situations where the actual change in DOA is not known.First, consider the case where there is an initial DOA of θ = 100 • which increases by 1 • for 9 time snapshots before decreasing by 1 • for the remaining time snapshots.In this instance, assume a constant signal value of 1 throughout.
The performance comparison is now made between the traditional RVM based method, the modified RVM based method with the assumed DOA change set to a constant increase of 1 • , the modified RVM based method with the assumed DOA change set to a constant decrease of 1 • and the Gibbs sampling based method.The performances for each are summarised in Figure 6 and Table V, respectively.
In this instance the average RM SE values suggest a comparable performance in terms of estimation accuracy between the traditional RVM based methods and the two modified RVM based examples.This can be explained by the fact that for both of the modified RVM based examples, the assumed  DOA change does not match the actual DOA changes for the entire time range which means the same improvements as for the previous scenario can no longer be guaranteed.Figure 6 highlights this in the results for the two modified RVM examples.It demonstrates that with an assumed increasing DOA the modified RVM offers some initial improvements, while the performance is significantly degraded when the DOA starts to decrease again.On the other hand, the example with an assumed decreasing DOA performs worse than the traditional RVM based method initially and then offers significant improvements when the actual DOA also starts to decrease.It can also be seen that for the Gibbs sampling based method there has been an improvement in DOA estimation accuracy.In terms of average RM SE values this is a decrease of 68.60%, which has been achieved without any knowledge of how the DOA was going to change.However, there is again an increase in the computation time.
To illustrate how a larger mismatch between actual and assumed DOA changes effects the performance of the modified RVM based method now consider an example where the actual DOA is increasing by 1 • in each snapshot, while the assumed change is a decrease of 3 • .Here, the initial DOA is 100 • , with a constant signal value of -1.The RM SE values for the methods are shown in Figure 7 and summarised in Table VI along with the computation times.
Here it can be seen that the Gibbs sampling based method has offered an 86.36% improvement in average RM SE as compared to the traditional RVM based method.There has again been a significant increase in the computational com-  For the modified RVM based method the average RM SE values suggests that there has been an improvement in estimation accuracy.However, this is smaller than when the actual and assumed DOA changes match.It is also unlikely that this improvement would be obtained in every scenario the the method could be applied to.From looking at Figure 7 we can see that the traditional and modified RVM based methods are showing comparable performance for the just over half of the time frame considered.This is the relative performance that would be expected in the majority of cases.

E. Random Changes in Direction of Arrival
Finally, consider the example where the initial signal value is assumed to be equal to 1 and the initial DOA is chosen to be 100 • .The actual DOA is then allowed to randomly change by up to ±3 • for each time snapshot.For the modified RVM method assume that the actual DOA change is an increase of 3 • .This gives the results as summarised in Table VII and Figure 8.
It can be seen that the Gibbs sampling based method has again outperformed the modified RVM based method in terms of estimation accuracy, due to the fact that no prior knowledge of how the DOA will change is required.As compared to the traditional RVM based method there has been an improvement in RM SE of 78.81%.However, as is expected this is at the cost of computation time.
It is also worth noting that the average RM SE values suggest that the modified RVM and traditional RVM have offered a comparable performance.This is due to the fact that the assumption of how the DOA will change is not valid,  meaning the modified RVM no longer offers any improvements.Therefore, in this situation the Gibbs sampling based method would be the best to use, assuming computational complexity is not the main motivating factor.

IV. CONCLUSIONS
This paper has proposed two novel approaches for the estimation of a dynamic direction of arrival using uniform linear arrays with mutual coupling.The first approach is a Bayesian compressed Kalman filter with a modified relevance vector machine, where the traditional sparsity assumption is replaced by an assumption that the estimated signals will instead match predicted signal values.This results in the derivation of a new posterior probability density function of the received signals and the expression for the related marginal likelihood function.The second proposed approach is a Gibbs sampling approach, where sparsity is explicitly enforced if there is a large difference between the previous direction of arrival estimate and the angle currently being considered.The proposed approaches will be particularly useful when applied to the problem of dynamic direction of arrival estimation in the endfire region of antenna arrays.Such problems can arise in numerous application areas such as in communications and surveillance.
An extensive performance evaluation is provided and shows that both of the proposed approaches outperform the traditional relevance vector machine based Bayesian compressive sensing Kalman filter in terms of mean root mean square error values, by up to 73.39% for the modified relevance vector machine based method and 86.36% for the Gibbs sampling based method.

A. Derivation of Posterior Distribution
Bayes' rule gives where P(ỹ k |x k , σ 2 k ) and P(x k |p k , xp ) are known from ( 8) and ( 9), respectively.Now following the method suggested in [17] carry out the multiplication on the right hand side of (37), collect terms in xk in the exponential and complete the square.
where Σ and µ are given by ( 14) and (15), respectively.This then gives the posterior distribution as (13), with the remaining exponential terms

B. Derivation of Marginal Likelihood
From (37) the following is known: meaning the term in the exponential will be (39) where Therefore the exponential term is given by The term outside of the exponential is given by Using the Woodbury matrix inversion identity gives which means where tr(•) indicates the trace.As tr(Σ ÃT Ã) can be written which in turn gives (19).
Thus giving us (33), allowing us to write the posterior distribution for x k,n in the form given in (30).

Fig. 1 .
Fig. 1.Linear array structure being considered, consisting of M antennas with a uniform adjacent antenna separation of ∆d.

Fig. 3 .
Fig. 3. RM SE values for the endfire region example with reduced antenna separation.

Fig. 5 .
Fig. 5. RM SE values for the random initial DOA example.

Fig. 6 .
Fig.6.RM SE values for the increasing DOA followed by decreasing DOA example.

Fig. 7 .
Fig. 7. RM SE values for the increasing DOA with an assumed decrease in DOA of 3 • example.

Fig. 8 .
Fig. 8. RM SE values for the random DOA change example.

TABLE I PERFORMANCE
SUMMARY FOR THE ENDFIRE REGION EXAMPLE.

TABLE III PERFORMANCE
SUMMARY FOR THE NON-ENDFIRE REGION EXAMPLE.

TABLE IV PERFORMANCE
SUMMARY FOR THE RANDOM INITIAL DOA EXAMPLE.

TABLE V PERFORMANCE
SUMMARY FOR THE INCREASING DOA FOLLOWED BY DECREASING DOA EXAMPLE.

TABLE VI PERFORMANCE
SUMMARY FOR THE INCREASING DOA WITH AN ASSUMED DECREASE IN DOA OF 3 • EXAMPLE.

TABLE VII PERFORMANCE
SUMMARY FOR THE RANDOM CHANGES IN DOA WITH AN ASSUMED INCREASE IN DOA OF 3 • EXAMPLE.