Abstract

In order to obtain the measurement parameters of the sea bottom geomorphology or underwater objects, the first step in side-scan sonar (SSS) image processing is bottom detection. Due to the complexity of the marine environment, the acoustic signals received by SSS are usually polluted by noises, which affect its image quality and make the extraction of image features difficult. To address this problem, this study proposes an automatic detection method for the sea bottom line based on the actual experimental acquisition of SSS images, which is supposed to support the autonomous underwater vehicle (AUV) for intelligent target detection and classification. The proposed method comprises four main steps. First, the raw SSS data is analyzed to obtain a grayscale image, and the blind zone boundary of the image is obtained using the threshold method. Then, the noise characteristics of the image are analyzed and the denoising algorithm is optimized to effectively remove high-frequency noise. Next, spatial-temporal matching calculations are performed on each ping port and starboard data, and the accurate coordinates of first bottom returns are obtained through extreme value detection. Finally, automatic and accurate detection of the bottom line is realized according to the smooth processing of the coordinate sequence of first bottom returns. The experiments have demonstrated the effectiveness of the proposed method. As the method does not require human intervention in adjusting parameters during operation, the proposed method with a certain time window imposed during image acquisition will be suitable for AUV missions when the SSS is determined.

1. Introduction

Side-scan sonar (SSS) is an active sonar system, which mainly consists of a transducer array, peripheral auxiliary sensors, and a real-time data acquisition processing module. The transducer array is the core component of the SSS, which is usually designed to be receive-transmit combined line array [1]. SSS uses the principle of echo bathymetry to record scattered echo intensity from the sea bottom and generate sonar images according to echo intensities [13]. The SSS images can be utilized to observe changes in parameters such as sea bottom geomorphology, presence of obstructions, and types of sea bottom substrate, and its features have important applications in fields such as marine scientific research, marine engineering, and marine military. In a SSS image, there is a clear junction line between the water column region and the seabed image region—called the sea bottom line, which is the distance from SSS to the sea bottom and an important parameter for measurements of the sea bottom topography or underwater objects, slant range correction, and SSS image grayscale equalization [1, 2, 4, 5].

Al-Rawi et al. [2] proposed two bottom detection methods: one to take the logarithm of the SSS echo data and use cubic spline regression algorithm for fitting and the other to filter the SSS data by sliding average filtering within a certain time window. Based on the data processed using these two methods, the center track line of the water column region is detected, and then, the first bottom return of each Ping data is obtained using the threshold method. Zhang et al. [6] solved the problem of the weak echoes of the sea bottom line in the actual SSS data by introducing the Laplacian of the Gaussian function, which is comparable to the bottom detection by commercial SSS data processing software. However, when the water noises are rather strong, the noises will be mistakenly extracted as the sea bottom line and it is difficult to obtain accurate bottom detection results, as the Gaussian smoothing filter can only eliminate the scales of intensity changes less than the smoothing factor. In view of the interferences of the SSS image in the water column region by the transmitting pulse, the sea surface echo, the wake, and the large area suspended matter, Zhao et al. [7, 8] proposed a last-peak detection method of the bottom line. Further, to tackle with ping data loss and ping observation anomaly that often exist in SSS measurements, Zhao proposed a Kalman filtering based sea bottom line repair method, which uses the asymptotic nature of the sea bottom variation and sea bottom line symmetry on SSS both sides. Ku et al. [9] measured single beam bathymetry and real-time depth of towed body using GPS time synchronization, and then, the bottom line was obtained by combination with these two kinds of data through interpolation and smoothing. Using actual measured data, the advantages of this method compared with the method proposed by Zhao et al. [7, 8] were verified. Moreover, the bathymetry measurement on this method was relatively less influenced by the environment and its accuracy was higher. According to the spatial characteristics of the continuous and dense distribution of the sea bottom line, Wang et al. [10] constructed a point set including the bottom points along the roughly parallel direction of the track and then clustered on the chain densely distributed along the track by the “density-based spatial clustering of applications with noise (DBSCAN)” algorithm for chain searching to extract sea bottom line. Compared with the traditional bottom detection method, this method also has good stability and anti-interference ability in relatively complex environment. Yan et al. [11] used a one-dimensional convolution neural network to traverse and identify the backscatter intensity sequence in the SSS data and established a complete processing method for real-time sea bottom line detection and tracking, which has strong robustness in the field test environment.

An autonomous underwater vehicle (AUV) is capable of carrying detection equipment to accomplish automatic tasks such as marine surveys and seabed mapping [1214]. Due to its advantages of high autonomy and low risk, AUVs are gradually being widely used, and their combinations with the SSS for underwater operations will become more frequent so that the superiority of SSS in target positioning and identification can be fully exploited [2, 12, 1416]. However, few of the published research studies have mature engineering applications for the automatic bottom detection in SSS images suitable for AUV missions.

One of the main objectives of this study is to develop an automatic bottom detection method in SSS images suitable for AUV missions. The remainder of this paper is arranged as follows. Section 1 briefly introduces the characteristics of an SSS image and detailedly analyzes the published research studies and results of bottom detection. In Section 2, the current problems of bottom detection in SSS image for AUV missions are stated. Section 3 elaborates the proposed bottom detection method and verifies its effectiveness using the SSS actual test data on an AUV. Section 4 summarizes this study.

2. Preliminaries and Problem Statements

For AUV missions, there are two main deficiencies in the current research on SSS bottom detection.(1)Under the hypothesis that there is either small and negligible or no blind zone between the port and starboard sonar transducer arrays, the installation of SSS is always taken as an ideal geometry. However, in actual working environments, the sonar transducers transmit sound waves to both sides, so there is a blind zone in the sonar image, and the space occupied by the blind zone in the sonar image should not be ignored. To avoid destroying the true geometry of the seabed image, it is necessary to consider eliminating the effects of the blind zone in the water column area.(2)The published research results were obtained through complex postprocessing processes [79] or high online computing capability for AUV [10, 11]. Moreover, due to the complex influence factors of SSS images, manual intervention is often required to set the initial value according to the situation, and most mature SSS image processing software, such as Triton and Prism, can only achieve reliable semiautomatic bottom detection [7, 10, 12, 17].

3. Automatic SSS Bottom Detection

3.1. Automatic Bottom Detection Procedure

Figure 1 shows the designed automatic bottom detection procedure for the SSS suitable for AUV missions. The procedure is described in four main steps.

3.1.1. Step 1. Obtaining the Blind Zone Boundary according to the SSS Parameters

In view of the geometry problem described in Section 2, first, it is necessary to calculate the parameters of the blind zone.

Since the blind zone is located directly below the SSS, the echo intensities of each ping are mutated from the blind zone to the outside, and the numerical values drop precipitously (see Figure 2). According to this feature, blind zone boundary detection can be realized. In general, the echo directly below the SSS is a strong positive emission wave, so a threshold can be used to detect the blind zone boundary detection.

Given a SSS, the ping data format is fixed (i.e., to 8, 16, or 32 bits). If the ping data bits of the SSS is N, then the threshold value for the blind zone boundary detection can be determined according to the measurement environment as follows:

From the blind zone to the outside, the first echo intensities of each ping on the port and starboard sides that are less than the threshold value are denoted as Kleft and Kright, respectively. As the transducer arrays are mounted symmetrically on the port and starboard sides of the AUV with respect to its center axis, the width of the blind zone of each ping, Km, according to the principle of bilateral symmetry, is determined as follows:

For each ping, the maximum sampling numbers of AUV left and right echo data are Kleftmax and Krightmax, respectively. Thus, the left echo data [Km, Kleftmax] and right echo data [Km, Krightmax] of all pings, which imply the seabed image region, can be obtained.

3.1.2. Step 2. Elimination of Anomalies in the SSS Port and Starboard Seabed Image Region

The median filtering algorithm can be used to remove the anomalies in SSS echo data, that is, the high-frequency noise and outliers [6, 9]. Herein, anomalies in both port and starboard SSS echo data are eliminated with an improved median filtering algorithm, and its calculation flow is as follows:(1)Calculate the intensity median of every sampling echo in each ping of the port and starboard SSS data.(2)Calculate the absolute intensity deviation of every sampling echo in each ping and determine the intensity value for the corresponding sampling echo based on its absolute intensity deviation. When the absolute intensity deviation is greater than a set value, the intensity median of the said sampling echo is taken as its new intensity value. Otherwise, no anomaly removal is performed and the intensity value for the said sampling echo is its sampling measured value. The above set value is a calculated value based on the echo intensity sampling values in each ping.

For example, for the SSS port echo data [Km, Kleftmax], let [i, j] indicate the index of the jth echo sampling measure of the ith ping, and let Iij and Pmax denote the intensity value of the [i, j]th port echo data and the total number of SSS pings, respectively. Then, for all [i, j] combinations satisfying 1 < i < Pmax and Km ≤ j < Kleftmax, the following processing is performed:(1) Calculate the intensity median Zij using the [i − 1, j − 1]th, [i − 1, j]th, [i − 1, j + 1]th, [i, j − 1]th, [i, j]th, [i, j + 1]th, [i + 1, j − 1]th, [i + 1, j]th, and [i + 1, j + 1]th sampling measured intensity values in port echo data, as well as the absolute intensity deviation Eij = abs (Iij − Zij).It is obvious from the above example that the number of adjacent pings and the number of adjacent echo sampling measures in each ping are both determined according to the actual needs. The same can be said for the numerical calculation of the intensity value of every sampling echo in each ping of the starboard SSS data, which is described subsequently.(2)Deciding absolute intensity deviation Eij [18]: if Eij > 9.78 + 1.44 × Iij − 8.86 × 10−4 ×  + 2.95 × 10−7 ×  − 3.26 × 10−11 × , then Zij is used as the new intensity value for the [i, j]th echo sampling measure in SSS port data; otherwise, no anomaly removal is performed.

For the SSS starboard echo data [Km, Krightmax], let [i, l] indicate the index of the lth echo sampling measure of the ith ping, and let Iil denote the intensity value of the [i, j]th starboard echo data. Then, for all [i, l] combinations satisfying 1 < i < Pmax and Km ≤ l < Krightmax, the following processing is performed:(1)Calculate the intensity median Zil using the [i − 1, l − 1]th, [i − 1, l]th, [i − 1, l + 1]th, [i, l − 1]th, [i, l]th, [i, l + 1]th, [i + 1, l − 1]th, [i + 1, l]th, and [i + 1, l + 1]th sampling measured intensity values in starboard echo data, as well as the absolute intensity deviation Eil = abs (Iil − Zil).(2)Deciding absolute intensity deviation Eil: if Eil > 9.78 + 1.44 × Iil − 8.86 × 10−4 ×  + 2.95 × 10−7 ×  − 3.26 × 10−11 × , then Zil is used as the new intensity value for the [i, l]th echo sampling measure in SSS starboard data; otherwise, no anomaly removal is performed.

3.1.3. Step 3: Computing Coordinates of First Bottom Returns

According to the principle of bilateral symmetry of the sea bottom line, the first bottom returns received by the transducer arrays on the port and starboard sides of the SSS come from directly below AUV, the distances of the first bottom return echoes are equal, and the detected sea bottom lines on both sides are symmetrical to the AUV track [7, 911]. The above principle allows for better sea bottom line detection.

Herein, calculation of the coordinates of first bottom returns involves the following flow:(1)The two spatial-temporal echo sampling measure sequences in each ping of SSS port and starboard data are matched and aligned.The issue of unequal maximum numbers, Kleftmax and Krightmax, of the port and starboard echo sampling measures in one ping results in different temporal and spatial information implied by Iij and Iil when l = j. To maximize the usage of the principle of bilateral symmetry of the sea bottom line and to achieve a combined comparison between the port and starboard measures, the above problem must first be solved for removing as many influences of the suspended matter as possible.In this study, bottom detection focuses on the water column region data and the mutated part of the seabed region data from the water column region. Therefore, to maintain the robustness of the bottom detection method, the SSS port and starboard echo data in each ping can be regarded as two spatial-temporal sequences for matching and alignment. When performing matching the alignment, consistency in the spatial-temporal information of mutations from the water column region to the seabed region should be maintained. Thus, the dynamic time warping (DTW) algorithm is used to achieve this goal [19], which includes the following processes:(a)For the ith ping, there are two spatial-temporal sequences, the port echo sampling measures {Iij, Km ≤ j ≤ Kleftmax}, and the starboard echo sampling measures {Iil, Km ≤ l ≤ Krightmax}. Take one echo from each of the two sequences and calculate the Euclidean distance D (Iij, Iil) between them, where Km ≤ j ≤ Kleftmax, Km ≤ l ≤ Krightmax. By calculating the Euclidean distances for all echoes in the two sequences, the Euclidean distance table can be constructed, as shown in Table 1.(b)Search for the shortest path in the above Euclidean distance table. In the above table, dynamic planning is used to search for the shortest path from node D (IiKm, IiKm) to node D (IiKleftmax, IiKrightmax). The shortest path must satisfy the condition that if the current node is D (Iij, Iil), then the next node must be selected among D (Iij+1, Iil), D (Iij, Iil+1), and D (Iij+1, Iil+1), and the chosen path must be the shortest.(c)According to the nodes on the shortest path through the Euclidean distance table, two discrete sequences can be obtained as aligned echo sampling measure pairs with the same sequence length recorded as Kmax.(2)Perform logarithm transformation on the matched and aligned port and starboard echo sampling measures in each ping; that is, take the base 10 logarithm.(3)Calculate the standard deviation of the logarithmic sequence of the port and starboard echo sampling measures in each ping, and use them to construct a composite sequence of this ping.Set a window to W, with a value as small as possible, to avoid flooding the mutant features of the first bottom return. Empirically, W can be calculated as follows:By calculating the standard deviation of the logarithmic sequence of the port and starboard echo sampling measures in each ping, two standard deviation sequences, denoted as {Liq, 1 ≤ q ≤ Kmax − W +1} and {Lis, 1 ≤ s ≤ Kmax − W +1}, are obtained, where the sequence length is Kmax − W + 1.In SSS echo data, there may be echo anomalies in the water column region, which will cause large fluctuations during the calculation of its standard deviation sequence and, in turn, will affect the accuracy of subsequent extreme value detection. Therefore, to maximize the usage of the principle of bilateral symmetry of the sea bottom lines, we further take the minimum value of the port and starboard logarithmic sequence standard deviation of each ping. Consequently, for any s = q, 1 ≤ q ≤ Kmax − W + 1, and 1 ≤ s ≤ Kmax − W +1, take Lig = min (Liq, Lis), and a composite sequence of ith ping {Lig, 1 ≤  ≤ Kmax − W +1} is constructed.(4)Perform extreme value detection on the composite sequence of each ping to obtain the extreme point coordinates.Based on the characteristics of the SSS data, the sea bottom line comprises the first strong sea bottom echoes on the port and starboard side. Thus, the mutation of the echo intensity from the water column region to the seabed region will result in the first maximum value in the composite sequence {Lig, 1 ≤  ≤ Kmax − W + 1}. Therefore, the extreme point coordinates, denoted as , can be obtained by performing extreme value detection.(5)The coordinates of the port and starboard first bottom returns are obtained using the above extreme point coordinates.

The coordinates qb and sb of the standard deviation sequences corresponding to the logarithm of the obtained port and starboard echo sampling measures can be obtained using the extreme point coordinate . In turn, the echo intensity Ibleft corresponding to the coordinate qb + W − 1 and Ibright corresponding to the coordinate sb + W − 1 of the matched and aligned port and starboard sampling measure sequences are obtained by searching algorithms:

{Iij, Km ≤ j ≤ Kleftmax}, and the starboard echo sampling measures {Iil, Km ≤ l ≤ Krightmax}.

For each ping, by searching the port sequence {Iij, Km ≤ j ≤ Kleftmax} and finding the first echo intensity value that equals Ibleft, the coordinate of the first bottom return is derived as jb. Similarly, by searching the port sequence {Iil, Km ≤ l ≤ Krightmax} and finding the first echo intensity value that equals Ibright, the coordinate of the first bottom return is derived as lb.

3.1.4. Step 4: Extraction of Sea Bottom Line

Considering the asymptotic nature of the sea bottom variation, precise sea bottom lines can be detected by sliding average filtering of the coordinates of the port first bottom returns {jb} and the coordinates of the starboard first bottom returns {lb} of all pings obtained through the above calculation [2].

3.2. Experimental Results and Discussion

In this experiment, the Sea Scan® ARC SCOUT Mk-II from the Marine Sonic Technology mounted on an AUV is used. The main operating parameters of the SSS are set to a ping frequency of 900 kHz and a slant range of 10 meters. Raw data of all pings are collected through actual measurements in the offshore test.

The waterfall plot of the SSS echo data in reverse of the image displayed at 256 grayscale levels is shown in Figure 3. The corresponding separate SSS images for the port and starboard sides with blind zone are shown in Figures 4(a) and 4(b).

A waterfall plot of considerable echo data intercepted after the ping starting in Figure 3 is shown in Figure 5, where A1, A2, A3, A4, and A5 indicate the sea bottom line, sea surface line, blind zone, water column region, and seabed image region, respectively. It can be seen obviously that the darker the color, the stronger the echo intensity, and, conversely, the whiter (brighter) the color, the weaker the echo intensity.

Echo sampling measures of a single ping are shown in Figure 2. After removing the blind zone according to Step 1 in Section 3.1, separate SSS images of the port and starboard sides without showing the blind zone can be obtained, as shown in Figure 6.

In Figure 6, it can be seen that there are many anomalies in the echo data which may interfere with the detection of the sea bottom line and even cause false detection in complex environments, such as suspended matter present in the water column region. These echo anomalies are essentially high-frequency noise and outliers that can be effectively removed from the image during data processing according to an improved median filtering algorithm in Step 2 in Section 3.1, as shown in Figure 7.

The separate SSS images of the port and starboard sides without the blind zone after high-frequency noise and outliers filtering are shown in Figure 7. Compared to Figure 6, the echo anomalies in the water column region can be eliminated to some extent.

Processed with Step 3 and Step 4 in Section 3.1, the detected sea bottom line marked separately on the port and starboard sides without showing the blind zone is shown in Figure 8, while the sea bottom line marked in the waterfall of the SSS image without showing the blind zone is shown in Figure 9.

4. Conclusions

In this study, the automatic bottom detection method of SSS image for AUV missions is presented. The existence of blind zone in the SSS image is taken into account, and the blind zone boundary of the SSS image is calculated based on the SSS parameters conveniently. The anomalies in SSS image are eliminated effectively using an improved median filtering algorithm. Further, to perform extreme value detection according to the principle of bilateral symmetry of the sea bottom lines, the SSS port and starboard echo data sequences in each ping are retained by using the logarithm of the space-time align-matched data. Therefore, an accurate sea bottom line is obtained through a sliding average filter with the coordinate sequences of the first bottom returns. The experimental results show that, without manual intervention to set the initial values online, the proposed method can overcome the strong noise interferences and achieve high detection accuracy and good detailed continuity in the process of sea bottom line detection.

Data Availability

The underlying data supporting the results of our study can be found at https://www.researchgate.net/profile/Huapeng_Yu.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

This research was supported by the project of the National Natural Science Foundation of China (Grant no. 61803381).