Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach

Chen, Yu; Wei, Yongming; Wang, Qinjun; Chen, Fang; Lu, Chunyan; Lei, Shaohua

doi:10.3390/rs12172767

Open AccessArticle

Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach

¹

CAS Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

²

Key Laboratory of the Earth Observation of Hainan Province, Aerospace Information Research Institute, CAS, Hainan Research Institute, Sanya 572029, China

³

College of Computer and Information Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China

⁴

School of Geography, Nanjing Normal University, Nanjing 210023, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(17), 2767; https://doi.org/10.3390/rs12172767

Submission received: 22 July 2020 / Revised: 18 August 2020 / Accepted: 24 August 2020 / Published: 26 August 2020

(This article belongs to the Special Issue Slope Stability Monitoring and Investigation Using Remote Sensing Techniques)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A serious earthquake could trigger thousands of landslides and produce some slopes more sensitive to slide in future. Landslides could threaten human’s lives and properties, and thus mapping the post-earthquake landslide susceptibility is very valuable for a rapid response to landslide disasters in terms of relief resource allocation and posterior earthquake reconstruction. Previous researchers have proposed many methods to map landslide susceptibility but seldom considered the spatial structure information of the factors that influence a slide. In this study, we first developed a U-net like model suitable for mapping post-earthquake landslide susceptibility. The post-earthquake high spatial airborne images were used for producing a landslide inventory. Pre-earthquake Landsat TM (Thematic Mapper) images and the influencing factors such as digital elevation model (DEM), slope, aspect, multi-scale topographic position index (mTPI), lithology, fault, road network, streams network, and macroseismic intensity (MI) were prepared as the input layers of the model. Application of the model to the heavy-hit area of the destructive 2008 Wenchuan earthquake resulted in a high validation accuracy (precision 0.77, recall 0.90, F1 score 0.83, and AUC 0.90). The performance of this U-net like model was also compared with those of traditional logistic regression (LR) and support vector machine (SVM) models on both the model area and independent testing area with the former being stronger than the two traditional models. The U-net like model introduced in this paper provides us the inspiration that balancing the environmental influence of a pixel itself and its surrounding pixels to perform a better landslide susceptibility mapping (LSM) task is useful and feasible when using remote sensing and GIS technology.

Keywords:

landslide susceptibility mapping (LSM); convolutional neural network (CNN); U-net; post-earthquake; support vector machine (SVM); logistic regression (LR)

1. Introduction

Serious earthquakes, particularly those that occurred in mountainous regions can trigger thousands of landslides because of unfavorable geomorphic environments [1]. These landslides could damage roads, crush buildings, block rivers, etc. [2,3]. Remote sensing images acquired by satellite, airborne, or UAV (unmanned airborne vehicle) have been widely used to investigate landslides, especially high spatial resolution imagery [4]. However, some previous stable slopes without showing an obvious displacement during an earthquake may slide latter when influenced by subsequent rainfall, human activities, or other factors, which could threaten human’s lives and properties for many years. For example, after the 2005 Kashmir earthquake [5], the 1999 Chi-Chi earthquake [6], and the 2008 Wenchuan earthquake [7], they are all observed that the landslide area and number showed a surge tendency. Therefore, mapping the post-earthquake landslide susceptibility is very valuable for a rapid response to earthquake in terms of relief resource allocation and posterior earthquake reconstruction.

Many methodologies have been developed for landslide susceptibility mapping (LSM) based on remote sensing and GIS [8], including heuristic, general statistical and machine learning methods. Heuristic methods usually conducted by experts mainly include the analytical hierarchy process (AHP) [9], the expert knowledge system [10], and the gray relational method [8,11]. The experience of experts and their cognition to study site could greatly influence the LSM result. The general statistical method uses simple statistical relationships between landslides and the influencing factors, which can be logistic regression (LR), a frequently chosen method [12,13], or other statistical methods such as the information value method [14,15], weights-of-evidence method [16,17,18], or frequency ratio method [19,20,21], etc. Machine learning methods such as support vector machine (SVM) [22], decision tree (DT) [23], random forest (RF) [24,25], and artificial neural networks (ANN) [26] have been more popular in the recent years because of their ability of modeling the complex nonlinear relationship between landslides and the influencing factors. Generally, these machine learning methods perform better in LSM than the heuristic and statistical methods. In addition, some methods hybrid two or more above methods have also been developed. For example, an adaptive neuro-fuzzy inference system (ANFIS) method, which combines fuzzy and neural network methods performance well on the LSM task [22,27,28,29]. Chen [30] proposed a method which combines expert knowledge, neuro-fuzzy inference systems, and evolutionary algorithms for spatial modeling of landslide susceptibility. Nonetheless almost all the methods mentioned above are pixel- based without considering the spatial structure information of possible influencing factors [31].

Due to the interaction of adjacent pixels, whether a pixel slides or not in the field is closely related to itself and the surrounding pixels, which means the adjacent pixels’ spatial structure information of possible influencing factors are important for LSM task. Convolution neural networks (CNN) have great ability to mine the spatial structure information, which has been used in many image processing tasks. Recent literatures have demonstrated the strong performance of CNN for LSM. For example, Sameen [32] developed an one-dimensional convolutional network (1D-CNN) for an LSM task in Southern Yangyang Province, South Korea, which showed better performance than ANN and SVM methods; Wang [33] observed that a CNN showed performance equivalent to or better than an optimized SVM for LSM tasks in Yanshan County, China. As one of the famous CNN models, U-net has been very successful in many semantic segmentation tasks [34,35,36,37,38] but no literatures have reported the application of U-net for LSM. A U-net model consists of an encoding path and a decoding path and connects the two paths to compensate for the information eliminated by decoding because fine information may be lost by pooling and deconvolution [35,39]. This characteristic is important for LSM tasks as the original influencing factors of each pixel may be weakened during the convolution and pooling operations.

The main goal of this paper is to develop a U-net like model for assessing post-earthquake landslide susceptibility. To achieve this, an U-net like model architecture for post-earthquake LSM was built and discussed to perform better results; the model was trained, validated, and tested on the data collected in a heavy-hit area of the destructive 2008 Wenchuan earthquake; results from the U-net like model were compared with those from traditional LR and SVM models to evaluate their performance.

2. Materials and Methods

2.1. Study Area

The study site (Figure 1) is a mountainous area prone to earthquakes. In the history, 44 earthquakes with the U.S. Geological Survey (USGS) magnitude value above 4.5 have happened during the date1 January 1900 to 12 May 2008 (the location were shown in Figure 1). The heaviest historical earthquake of the study area was happened on 28 August 1933 with the Mw7.3 in Mao county. On 12 May 2008, Wenchuan earthquake occurred with the Mw 7.9 (Ms 8.0) and 19 km depth. This earthquake was characterized mainly by thrust motion with right-lateral strike slip [40], the max peak ground acceleration (PGA) value 1.15 g according to USGS. It was the most destructive event since the 1976 Tangshan earthquake, leaving 69,200 dead, 18,195 missing, 374,216 injured, 5,362,500 houses collapsed, 21,426,600 houses badly damaged, and more than five million people made homeless [41]. This earthquake originated from the collision between the Indian and Eurasian plates [42], and directly caused a huge number of landslides due to the high and steep slopes in mountainous areas of 13 counties [2,43].

2.2. Remote Sensing Data

2.2.1. Pre-Earthquake Data

For the purpose of post-earthquake landslide susceptibility mapping we should use the pre-earthquake data to train the model so that we could predict the susceptibility of the whole study area. The original pre-earthquake Landsat remote sensing data was used to represent the land use/land cover and vegetation cover conditions which often used as influencing factors in LSM task.

Google Earth Engine (GEE) provides free and publicly accessible historical remote sensing images at a global scale including those collected Landsat, MODIS, and Sentinel [44] sensors. In addition, Google provides an efficient web-based interactive development environment (IDE) that enables rapid and easy accessing these data according to different requirements [45].

Given the date for the main earthquake is 12 May 2008, remote sensing data before this date are pre-earthquake data. With the help of Google Earth IDE, we conveniently created a pre-earthquake Landsat dataset for the study area based on the U.S. Geological Survey (USGS) Landsat 5 Surface Reflectance Tier 1 dataset. The dataset had 278 scenes acquired from 12 May 2005 to 11 May 2008. We composited all the images to a single scene for which a median method was used to minimize the effects of cloud and cloud shadows [45]. The final dataset contained 7 bands including 4 visible and near-infrared (VNIR) bands, 2 short-wave infrared (SWIR) bands and one thermal infrared (TIR) band. The VNIR and SWIR bands have a resolution of 30 m/pixel. The original TIR band at 120 m/pixel has been resampled to 30 m/pixel (https://developers.google.com/earth-engine/datasets/catalog/LANDSAT_LT05_C01_T1_SR#description). Figure 2 shows a color composite of the pre-earthquake Thematic Mapper (TM) images.

2.2.2. Post-Earthquake Data

Airborne images were acquired using Leica ADS40/80 digital cameras after the Wenchuan earthquake. The advanced airborne ADS40/80 camera has a large field of view (FOV) of 64 degrees and an instantaneous field of view (IFOV) of 0.1 mrad. The camera has a panchromatic band and four spectral bands including blue, green, red, and near infrared. The spatial resolution of these images ranges from 0.3 to 0.5 m, depending on the flight altitude involved. Eight image strips obtained from 15 May 2008 to 28 May 2008 were used for identifying the post-earthquake landslides. No. 1 to 7 images (listed on Table 1 and shown on Figure 2) were used as the model area for the development of the model while No. 8 image for independent testing.

2.3. Landslide Inventory

The serious earthquake induced large number of landslides. The good vegetation coverage in the study area and the high spatial resolution (0.3–0.5m) of post-earthquake airborne images make it easy to identify these landslides. More than 5 experienced researchers from the Chinese Academy of Sciences (CAS) performed visual interpretations of landslides for about one week. The interpretation result was verified in the field showing a validation accuracy more than 98% [46,47]. The total number of interpreted landslides was 1148, with the type of which including slide (392), rock fall (535), and debris flow (221), respectively. Most of the interpreted landslides only contain depletion zones, but sometimes they comprise track or accumulation zones, because some of them are mixed and difficult to distinguish. The average area of these landslides was 58,500 m² with the largest area being 153,000 m² and the smallest area being 715 m². The landslide distribution and photographs of typical landslides in the field were shown in Figure 3.

2.4. Landslide Influencing Factors

Spatial distribution of the post-earthquake landslides is significantly controlled by the surrounding topography, geology, human activity conditions. These conditions combined with the earthquake activity in different layers were taken as influencing factors of the post-earthquake landslides. All these influencing factors were prepared in GIS using ArcGIS 10.6.

2.4.1. Topography

Topography affects the post-earthquake landslides in many aspects. It is widely accepted that a high angle slope is prone to landslides because a larger inclination increases the shear stress on soil [48]. Elevation and aspect impact vegetation cover, moisture retention and soil strength which can influence landslide initiation [49,50]. In addition, because of the direction of the seismic wave propagation, different altitudes and aspects may suffer different movements [48,51]. In this paper, the slope and aspect were both computed from the SRTM digital elevation model (DEM) using ArcGIS10.6. The slope value ranges from 0 to 87° instead of the true degree of the slope. The aspect is measured clockwise in degrees from 0 (due north) to 360 (again due north), coming full circle. Flat areas are given a value of −1.

Global ALOS multi-scale topographic position index (mTPI) was also used. The mTPI can be used to distinguishes ridges from valleys which contribute to the landslide occurrence. It is calculated using elevation data for each location subtracted by the mean elevation within a neighborhood. The neighborhood radius (km) are 115.8, 89.9, 35.5, 13.1, 5.6, 2.8, and 1.2, respectively [52]. The dataset is available from GEE (https://developers.google.com/earth-engine/datasets/catalog/CSP_ERGo_1_0_Global_ALOS_mTPI#description). The topography information of post-earthquake landslides is shown in Figure 4.

2.4.2. Lithology and Fault

Geology conditions such as lithology can affect the strength and permeability of the surface and sub-surface material which add the occurrence probability of landslide [53]. Lithology and faults can affect the rock fragmentation which can be an important influencing factor of the post-earthquake landslides [53,54]. The lithology and fault were extracted from the geological map with the scale 1:200,000 (Figure 5 and Table 2).

2.4.3. Human Activity

In mountainous areas, road construction often results in excavation of slope toe while stream can erode the toe of mountain and saturate the slide due to increase in water infiltration. All of these can lead to the destabilization of slope and eventually sliding [53]. The road and stream network were obtained from the National Geomatics Center of China (NGCC) with the scale 1:50,000 (Figure 6).

2.4.4. Seismic Parameters

In very short time after a serious earthquake, the USGS can produce a series of earthquake products such as macroseismic intensity (MI), peak ground acceleration (PGA), and peak ground velocity (PGV) maps [55,56]. The map MI is calculated by PGA, PGV and internet users’ shaking and damage reports which could more effectively reflect the damage of the ground surface by the earthquake. So, we used the map MI (Figure 7) as an influencing factor to evaluate post-earthquake landslides susceptibility (https://earthquake.usgs.gov/earthquakes/eventpage/usp000g650/shakemap/intensity).

2.5. U-Net Like Model for Post-Earthquake LSM

2.5.1. Traditional CNN and U-Net Model

Traditional CNN models consist of one or more convolution, pooling, or fully connected operation layers. The convolution operation is to extract different features from the input layer. The pooling operation is to reduce the dimensionality of feature maps and make the model more concern about the existence of certain features rather than precise location of the features. The fully connected layer reorganizes extracted features to map to the final outputs. CNN model was first time trained by Lecun Y. using backpropagation method for the task of classifying images of handwritten digits in 1989 [57,58]. Then many scholars successfully developed different CNN models such as LeNet [59], AlexNet [60], GoogleNet [61], VGGNet [62], and ResNet [63] et al. These traditional CNN models have had great success on image classification tasks, where the typical output to an image is a single class label. However, on image segmentation tasks, the class label is supposed to be assigned to each pixel. For this purpose, sliding-window to each pixel [64] or adding decoding path [65] to CNN architecture could be used. The later method has developed rapidly in recent years due to higher efficiency, and U-net model is one of them. The U-net model (Figure 8) consists of an encoding path and a decoding path and connects (copy and crop) the two paths. The encoding path usually includes many convolutional and max pooling layers while the decoding path includes convolution and up-convolution layers. U-net has been very successful in many semantic segmentation tasks [34,35,36,37,38], but no literatures have reported the application of U-net for LSM.

2.5.2. Model Architecture

Unlike traditional semantic segmentation tasks, the input layers for LSM can be regarded as an image with several landslide influencing factors. Each pixel of the input layers is represented by a set of features which are defined by the landslide influencing factors. Due to the interaction of adjacent pixels, whether a pixel slides or not in the field is closely related to itself and the surrounding pixels, which means the adjacent pixels’ spatial structure information of possible influencing factors are important for LSM task. Traditional CNN model can mine the spatial structure information for LSM task, but the original influencing factors of each pixel may be weakened during the convolution and pooling operations. So, in this paper, we develop a U-net like model for mapping post-earthquake landslide susceptibility because U-net model connects encoding path and decoding path to compensate for the information eliminated by decoding [35,39]. What we are concerned about is how to balance the influence of a pixel itself, and its surrounding pixels, to perform an optimal LSM task.

First, the model cannot have too many convolution and pooling layers. For each pixel in the image, a 3×3 size kernel convolution operation means this pixel would be affected by 8 pixels around it. So, the more convolution and pooling layers, the more affected by the surrounding pixels. However, for the LSM task, too many convolution and pooling layers may cause a lot of noise and make the model difficult to train.

Second, the original input layers and the last convolution layers should be connected (copy and crop) because the original influencing factors of each pixel are the most important factors for landslides occurrence.

In this study, we finally constructed a U-net like model for post-earthquake LSM shown in Figure 9. The number of input landslide influencing factors in this study was 16 as described in Section 2.5.3. We set the number of first convolution layers to the same value and double it after pooling operation as traditional U-net model does. Totally three convolution layers and one max pooling layer were induced in this model. After each convolution operation, the Batch Normalization function was used to normalize the feature map and dropout at a rate of 0.2 to avoid overfitting. All the convolution layers use the ReLU activation function while the final output layer uses the Sigmoid function to ensure an output range between 0 and 1, indicating the landslide susceptibility.

2.5.3. Input and Output

The input layers detailed in Table 3 include pre-earthquake Landsat TM images with 7 bands and the influencing factors listed in Section 2.4 such as DEM, slope, aspect, mTPI, lithology, fault, road network, stream network, and MI. Among these influencing factors, faults, road, and stream network were represented by polylines, the Euclidean distance to the closest source was used to calculate their potential influence on landslides. The lithology is a categorical variable, and was assigned to be a dummy variable as described in the literature [66]. In this way, 14 dummy variables were used to represent 14 lithologies listed in Section 2.4.2. In total, there were 29 input layers, their original data values were shown in Table 3 and all were normalized to the range 0–1 when training in the model.

For a fully convolutional network, the sizes of input and output layers have no influence on the result of the model. In our U-net like model the output label size was set to 2×2-pixels meaning the input layers size was 10×10-pixels. The use of such a small size layer could conveniently control the number of landslide samples when training the model.

2.5.4. Training, Validation, and Independent Testing

In the model area, the landslide pixels and non-landslide pixels were extremely unbalanced (the ratio was about 1:33) because of some very small size landslides. To accommodate both landslide and non-landslide pixels, two sampling approaches were used. First, 5000 label images with 2 × 2-pixels in the modelled area were selected randomly. Second, another 5000 images with at least 2 landslide pixels in the label area were randomly selected. By this way, approximately one-third of the landslide pixels and 1/110 of the non-landslide pixels were included so that for modeling the ratio of landslide and non-landslide pixels was about 1:2.26. These model data were split to create the training (75%) and validation (25%) datasets for the U-net like model. Precision, Recall, F1 score, and relative operative characteristics (ROC) curve were used to evaluate the accuracy of the model. There is also another method to use all the landslide pixels in the model area and the corresponding number of non-landslide pixels as samples and split them to train and validate the model. However, the model has brought in the surrounding pixels to convolution operating, it might bring redundancy if the adjacent landslide pixels were both conducted as independent samples for training.

Precision, recall and F1 score

Precision p is the number of true positive (TP) divided by the number of all test positive results (true positive (TP) + false positive (FP)) as shown by Equation (1), and recall r is the number of TP divided by the number of all positive samples (true positive (TP) + false negative (FN)) as defined in Equation (2). F1 score is a measure of a test’s accuracy taking into account both the precision p and the recall r of a test. The F1 score Equation (3) is the harmonic mean of the precision and recall with 1 meaning a perfect precision and recall.

p = \frac{TP}{TP + FP}

(1)

r = \frac{TP}{TP + FN}

(2)

F 1 = 2 * \frac{p * r}{p + r} = \frac{2 TP}{2 TP + FP + FN}

(3)

Relative operative characteristics (ROC)

The receiver operation characteristic (ROC) curve is a graphical representation of the relationship between the sensitivity and specificity of a laboratory test over all possible diagnostic cutoff values. It reflects the corrections between the “1-Specificity” Equation (4) and “Sensitivity” (equivalent to recall) [8]. We generally use the area under the ROC curve (AUC) to reflect the total accuracy of these models. A greater AUC value means a higher prediction performance [33,67].

1 - S p e c i f i c i t y = \frac{FP}{FP + TN}

(4)

To compare the performance of the model with other models, another 2 test datasets were prepared in the similar way. Test dataset 1 contained 10,000 landslide pixels and 10,000 non-landslide pixels. Test dataset 2 came from the airborne imagery shown as the yellow polygon in Figure 10 and contained 3000 pixels with equal landslide and non-landslide pixels. The ROC curve and confusion matrix (CM) were used to compare the performance of the U-net model and traditional logistic regression (LR) and support vector machine (SVM). LR is widely used for predicting the presence or absence of an outcome based on values of a set of predictor variables [68]. The probability ρ of a positive outcome is transformed from the interval (0,1) to its logit ln(ρ/(1-ρ)) [12]. The SVM algorithm is built upon a hyperplane or a set of hyperplanes in a high-or infinite-dimensional space, and can be used for separating landslide and non-landslide cases. The two models have been successful used in many LSM tasks [68,69,70,71,72,73].

3. Results

3.1. Spatial Analysis of Landslides

The spatial analysis between different influencing factors and post-earthquake landslides was performed. Figure 11 shows the relationship between landslide and pre-earthquake Landsat TM data. Each Landsat TM band was reclassed to six sub-categories based on the quantile method in the ArcGIS 10.6 toolbox. The radar maps in Figure 11 were calculated using the ratio of landslide area and each sub-category area. Almost all the seven Landsat TM bands have higher landslide susceptibility on the area of higher surface reflectance value sub-category, which mostly represent barren land. Figure 12 shows the relationship between landslide and other influencing factors. Similarly, the higher slope angle, MI value, and the distance closer to fault, road, stream all show higher landslide susceptibility. East and southeast direction of slope aspect have higher landslide susceptibility than the other slope aspects. The lower mTPI value, which represent the bottom of the valley has higher landslide susceptibility. Lithology from Cambrian with Metamorphic grit and Limestone also have a high susceptibility to landslide.

3.2. LSM Result of U-Net Like Model

Our U-net model was developed using the TensorFlow 2.0 software with python. TensorFlow is a flexible and scalable software library which enables users to efficiently program and train neural network and deploy them to production [74]. The hardware environment of this study was NVDIA Quadro P2000 graphics card, Intel i7-8850H processor, and the memory was 32 G. The Adam optimizer with default learning rate 0.001 and binary cross entropy loss were used for training the model. We set the batch size to 100 and it took 3 h 25 min to train our U-net like model with an early stop (patience = 10) at 23rd epoch (the max epoch value was set to 100). The training and validation results are shown in Table 4. The validation precision 0.77 means that for the areas having the susceptibility value above 0.5, 77% of them were truly landslide pixels. The validation recall value 0.90, means 90% of the total existed landslides pixels were predicted to have a susceptibility above 0.5. Both F1 score and AUC are higher than 0.83 meaning a strong performance with 1 being a perfect performance.

All the training and validation indexes listed in Table 4 were based on the susceptibility value threshold 0.5 which was commonly used in a binary classification method. In order to examine the LSM result in more detail, we divided the result into six levels based on the landslide susceptibility value (marked as ls) i.e., extremely low (ls < 0.1), very low (0.1 ≤ ls < 0.3), low (0.3 ≤ ls < 0.5), high (0.5 ≤ ls < 0.7), very high (0.7 ≤ ls < 0.9), and extremely high (ls ≥ 0.9) as shown in Figure 13. The percentages of total area and landslide area of each level in the model area were shown in Figure 14 where the three low levels (ls < 0.5) and three high levels (ls ≥ 0.5) represent the non-landslide and landslide area, respectively. The most important application of a landslide susceptibility map is to guide users to find the place prone to landslide. Hence, the area of high susceptibility level should not be too large so that it could efficiently arrange engineers to investigate these prone-landslide areas. In this result, the “extremely high” and “very high” level areas were 4.56% and 6.74% of total model area, respectively, but accounted for 42.31% and 31.58%, respectively, of exist total landslides. Therefore, we have strong confidence that the areas at the “extremely high” and “very high” levels are prone to landslide in future and should be paid more attention.

3.3. Compare with LR and SVM Models

In this study, the traditional LR and SVM were compared with the U-net model for their performance. The LR model was implemented in the TensorFlow software based on the Sigmoid function while the SVM model was programmed using the regression learner app of MATLAB 2019b software. The cubic kernel function and box constraint value 2.17 were finally confirmed to be the best based on Bayesian optimizer for SVM model. The input dataset for LR and SVM models was the same as the U-net model but without convolution and pooling. The time consumption of LR and SVM model were less than 10 min as they based on different algorithm implementation.

The results for the LR and SVM models to predict the landslide susceptibility of the whole study area are shown in Figure 15. Figure 16 shows the ROC curves for the three models tested on test dataset 1 (Figure 16a) and test dataset 2 (Figure 16b). The AUC was calculated showing that the U-net model performed better than LR and SVM models on the two datasets. Figure 17 shows the airborne imagery and landslides distribution (Figure 17a) in test area 2 as well as the LSM results by U-net (Figure 17b, LR (Figure 17c) and SVM (Figure 17d), respectively. Compared with LR and SVM models, the U-net model resulted in an LSM map showing a high accuracy and fine detail which should be more reliable to use. For example, the southeast of the study area is the Chengdu Plain in which the stronger performance of the U-net model is evident. Figure 18 shows the comparation of the LSM results by the three models in the Chengdu Plain. Both LR and SVM models predicted that the landslides in this area should have “high” or even “extremely high” susceptibility level (Figure 18c,d). However, the area is very flat with some cities and farmlands, and thus should not be favorable to the occurrence of landslides.

In order to further compare different model results, we used confusion matrix (CM) based on six levels on the landslide susceptibility value (marked as ls) i.e., ls < 0.1, 0.1 ≤ ls < 0.3, 0.3 ≤ ls < 0.5, 0.5 ≤ ls < 0.7, 0.7 ≤ ls < 0.9, and ls ≥ 0.9 as shown in Table 5 where the three low levels (ls < 0.5) and three high levels ( ls ≥ 0.5) represent the negative (non-landslide) and positive (landslide area), respectively. The total accuracy (acc) was calculated as defined in Equation (5), where the TP is the sum number of pixels which predict value ls ≥ 0.5 and true value equal 1, the TN is the sum number of pixels which predict value ls < 0.5 and true value equals 0, FP is the sum number of pixels which predict value ls ≥ 0.5 and true value equal 0, and FN is the sum number of pixels which predict value ls < 0.5 and true value equals 1. Although U-net model did not show a significant better performance in the test area 2 based on total accuracy (79.70% vs 78.90 and 79.20), it has a higher TP and lower FP on predict “extremely high” and “very high” levels which could make the engineers efficient to investigate these prone-landslide areas. The predict value of LR model was more distributed around 0.5 in test area 1 and 2, which make the engineers difficult to arrange work according to urgency.

a c c = \frac{TP + TN}{TP + TN + FN + FP}

(5)

4. Discussion

For a deep learning model, there are many factors may affect the performance. In this section, we will discuss several aspects of them which could be better for further study of the U-net like model on LSM task.

4.1. Sample Balance for Model Input

The area considered by model involved was about 2265 km² covering about 2.5 million pixels at the spatial resolution 30 m. The number of the non-landslide pixels was more than 2.4 million while the number of the landslide pixels was less than 0.1 million. The unbalanced samples resulted in the regression result bias toward the non-landslide samples. The extreme case is to output all pixels to non-landslide with the value 0 in our study. Obviously, this would not make any sense. Currently, two strategies are often used to deal with this problem: balance the sample or adjust the penalty weights of two types of error [75]. In this paper, we simply reduced the number of non-landslide pixels. Finally, 10,000 2 × 2-pixels (totally 40,000 pixels) with the ratio of landslide and non-landslide being about 1:1.26 were used to train the U-net model and gave rise to a good result. The method of adjusting the penalty weights could also be considered to improve the performance. A simple way is to use the weighted cross entropy with logits function to compute a weighted cross entropy. The TensorFlow software has such a function to allow one to trade off recall and precision by up- or down-weighting the cost of a positive error relative to a negative error.

4.2. Total Convolutional Size of Model Architecture

A traditional U-net model usually use many convolution and pooling layers to present the high-level semantic features of picture for efficiently identifying objects showing a well-defined shape. However, for the purpose of landslide susceptibility mapping, too many convolutional and pooling layers may introduce a lot of noise and make the model difficult to train. In this study, our model used convolution operation (3 × 3 size kernel) three times and pooling operation one time as shown in Figure 9. Hence, each output pixel would be affected by four pixels farthest from it. We considered this size as the total convolution size (TCS) of the model, which is represented by the TCS4 red polygon in Figure 19a. It was hard to determine the best TCS for our study area as the performance of the model could be affected by any change to the model. In this study, we kept all parameters of the model fixed except the convolution kernel size of the first convolution layer (marked as red dotted polygon in Figure 9). If we change the kernel to 5 × 5 size, the TCS becomes five, and the kernel size is 1 × 1, the TCS is equal to three. Meanwhile, to use the same 2 × 2 size pixels as the output samples, the input image size turned to be 12 × 12 pixels for TCS5 and 8 × 8 pixels for TCS3, respectively (Figure 19a). Figure 19b shows the binary cross entropy loss curve of different TCS of the model area from which we could find that the use of TCS4 resulted in a better performance than those using TCS3 and TCS5. The reason for TCS4 to perform better for this study area could be complicated. An interesting finding is that TCS4 was closer to the average landslide area of the model area, which needs more investigation in the future.

4.3. Pixel Itself or Surrounding Pixels for LSM Task

Traditional heuristic, general statistical, and machine learning methods used for LSM tasks are mostly pixel-based without considering surrounding pixels. A few other methods considering slopes as a single unit instead of a single pixel usually lead to a mapping result less smooth than that by pixel-based methods [76,77,78] because it is not always possible to divide different slope units. CNN models make it possible to mine the spatial structure information from surrounding pixels by use of convolution or pooling. However, these operations may introduce noise and make it difficult to train a model. Then the question is how to balance the influence of a pixel itself and its surrounding pixels to perform an optimal LSM task. In this paper, an improved U-net model has been developed to balance this influence by connecting (copy and crop) the original input layer and the last convolution layer. To demonstrate the advantage of the proposed U-net model, two changes to the model architecture were tested. First, the copy or crop operation was eliminated and replaced with the corresponding convolution channels. For example, the black layers in the blue dotted box (Figure 9) could be removed and the number of white color convolution layer channels changed from 16 to 45. This would result in a model architecture similar to a traditional CNN model described in literature [33] considering more the influence of surrounding pixels. The other change to the U-net model was only considering the pixel itself by setting all the convolution and pooling kernel size to 1×1, and this would lead to a model similar to traditional ANN model as described in literature [26]. We call these two models CNN and ANN, respectively. Figure 20 shows the curve of binary cross entropy loss of the ANN, CNN, and U-net like models vary with the training epochs. It is evident that the validation loss line of the ANN model drops very slowly indicating more features need to be added, while the validation loss line of the CNN mode shows significant fluctuation indicating that the model was difficult to train and commit overfitting. In contrast, the U-net model performed better on loss value and time consumption than both the ANN and CNN. Although the model performance can be affected by other factors such as the sample numbers/distribution or other hyperparameter parameters of the model depending on different study areas, there is no doubt that more attention should be paid to balancing the influence of pixel itself and surrounding pixels in future studies on LSM with remote sensing.

5. Conclusions

This is the first study developing a U-net like model for mapping post-earthquake landslide susceptibility. The model architecture was built similar to a traditional U-net model but used less convolutional and pooling layers and connected the original input layers with the last convolution layers. Pre-earthquake Landsat TM images and the influencing factors including DEM, slope, aspect, multi-scale topographic position index (mTPI), lithology, fault, road network, stream network, and macroseismic intensity (MI) were prepared and used as the model input layers. The post-earthquake high spatial airborne images were used for conducting landslide inventory which was the output for training the model. Relative distribution of different influencing factors and post-earthquake landslide occurrence were analyzed. We trained the model for the heavy-hit area of the destructive 2008 Wenchuan earthquake and obtained a good validation accuracy with precision 0.77, recall 0.90, F1 score 0.83, and AUC 0.90. Further detailed analysis of the result indicated that the “extremely high” and “very high” level areas were only 4.56% and 6.74% of total model area, respectively, but accounted for 42.31% and 31.58% (totally 73.89%) of the total landslides, respectively, which could provide reliable reference for engineers to investigate landslides in the field. Comparison of the proposed U-net like model with traditional LR and SVM models showed the strong performance of the former than LR and SVM for both the model area and independent testing area. Through discuss the detail of the model architecture, we found that balancing the environmental influence of a pixel itself and its surrounding pixels to perform a better LSM task is useful and feasible when using remote sensing and GIS technology.

Author Contributions

Conceptualization, Y.C.; data curation, Y.W.; funding acquisition, Y.W. and F.C.; investigation, Q.W.; methodology, Y.C.; validation, Q.W.; writing—original draft, Y.C.; and writing—review and editing, C.L. and S.L.; All authors have read and approved the final manuscript and contributed substantially to the study.

Funding

This research was funded by the National Key Research and Development Program of China (Grant No. 2017YFC1500902), the Bingtuan Science and Technology Project (Grant No. 2017DB005-01), the Second Tibetan Plateau Scientific Expedition and Research (STEP) (Grant No. 2019QZKK0806), and the Special Program for 100 people in Hainan Province.

Acknowledgments

We express our gratitude to Lin Li, Department of Earth Sciences, Indiana University-Purdue University Indianapolis (IUPUI) for his detailed comments and helpful suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dadson, S.J.; Hovius, N.; Chen, H.; Dade, W.B.; Lin, J.C.; Hsu, M.L.; Lin, C.W.; Horng, M.J.; Chen, T.C.; Milliman, J.; et al. Earthquake-triggered increase in sediment delivery from an active mountain belt. Geology 2004, 32, 733–736. [Google Scholar] [CrossRef]
Yin, Y.P.; Wang, F.W.; Sun, P. Landslide hazards triggered by the 2008 Wenchuan earthquake, Sichuan, China. Landslides 2009, 6, 139–152. [Google Scholar] [CrossRef]
Shafique, M.; van der Meijde, M.; Khan, M.A. A review of the 2005 Kashmir earthquake-induced landslides; from a remote sensing prospective. J. Asian Earth Sci. 2016, 118, 68–80. [Google Scholar] [CrossRef]
Bianchini, S.; Raspini, F.; Solari, L.; Del Soldato, M.; Ciampalini, A.; Rosi, A.; Casagli, N. From Picture to Movie: Twenty Years of Ground Deformation Recording Over Tuscany Region (Italy) With Satellite InSAR. Front. Earth Sci. 2018, 6. [Google Scholar] [CrossRef] [Green Version]
Shafique, M. Spatial and temporal evolution of co-seismic landslides after the 2005 Kashmir earthquake. Geomorphology 2020, 362. [Google Scholar] [CrossRef]
Khazai, B.; Sitar, N. Evaluation of factors controlling earthquake-induced landslides caused by Chi-Chi earthquake and comparison with the Northridge and Loma Prieta events. Eng. Geol. 2004, 71, 79–95. [Google Scholar] [CrossRef]
Tang, C.; Zhu, J.; Liang, J.T. Emergency assessment of seismic landslide susceptibility: A case study of the 2008 Wenchuan earthquake affected area. Earthq. Eng. Eng. Vib. 2009, 8, 207–217. [Google Scholar] [CrossRef]
Huang, F.; Cao, Z.; Guo, J.; Jiang, S.-H.; Li, S.; Guo, Z. Comparisons of heuristic, general statistical and machine learning models for landslide susceptibility prediction and mapping. CATENA 2020, 191, 104580. [Google Scholar] [CrossRef]
Yalcin, A. GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): Comparisons of results and confirmations. Catena 2008, 72, 1–12. [Google Scholar] [CrossRef]
Ghosh, J.K.; Bhattacharya, D. Knowledge-Based Landslide Susceptibility Zonation System. J. Comput. Civ. Eng. 2010, 24, 325–334. [Google Scholar] [CrossRef]
Xie, Y. Application of Grey Relational Analysis to the Optimal Selection of Landslide Treatment Scheme. In Proceedings of the ETP/ IITA World Congress in Applied Computing, Computer Science and Computer Engineering, Sanya, China, 8–9 August 2009; pp. 241–243. [Google Scholar]
Brenning, A. Spatial prediction models for landslide hazards: Review, comparison and evaluation. Nat. Hazards Earth Syst. Sci. 2005, 5, 853–862. [Google Scholar] [CrossRef]
Guzzetti, F.; Reichenbach, P.; Cardinali, M.; Galli, M.; Ardizzone, F. Probabilistic landslide hazard assessment at the basin scale. Geomorphology 2005, 72, 272–299. [Google Scholar] [CrossRef]
Sharma, L.P.; Patel, N.; Ghose, M.K.; Debnath, P. Development and application of Shannon’s entropy integrated information value model for landslide susceptibility assessment and zonation in Sikkim Himalayas in India. Nat. Hazards 2015, 75, 1555–1576. [Google Scholar] [CrossRef]
Ba, Q.Q.; Chen, Y.M.; Deng, S.S.; Wu, Q.J.; Yang, J.X.; Zhang, J.Y. An Improved Information Value Model Based on Gray Clustering for Landslide Susceptibility Mapping. ISPRS Int. J. Geo-Inf. 2017, 6, 18. [Google Scholar] [CrossRef]
Pourghasemi, H.R.; Pradhan, B.; Gokceoglu, C.; Mohammadi, M.; Moradi, H.R. Application of weights-of-evidence and certainty factor models and their comparison in landslide susceptibility mapping at Haraz watershed, Iran. Arab. J. Geosci. 2013, 6, 2351–2365. [Google Scholar] [CrossRef]
Zhu, C.H.; Wang, X.P.; Soc, I.C. Landslide Susceptibility Mapping: A Comparison of Information and Weights-of-Evidence Methods in Three Gorges Area. In Proceedings of the International Conference on Environmental Science and Information Application Technology, Wuhan, China, 4–5 July 2009; pp. 342–346. [Google Scholar] [CrossRef]
Dahal, R.K.; Hasegawa, S.; Nonomura, A.; Yamanaka, M.; Masuda, T.; Nishino, K. GIS-based weights-of-evidence modelling of rainfall-induced landslides in small catchments for landslide susceptibility mapping. Environ. Geol. 2008, 54, 311–324. [Google Scholar] [CrossRef]
Poudyal, C.P.; Chang, C.; Oh, H.J.; Lee, S. Landslide susceptibility maps comparing frequency ratio and artificial neural networks: A case study from the Nepal Himalaya. Environ. Earth Sci. 2010, 61, 1049–1064. [Google Scholar] [CrossRef]
Lee, S.; Sambath, T. Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models. Environ. Geol. 2006, 50, 847–855. [Google Scholar] [CrossRef]
Yin, K.L.; Yan, T.Z. Statistical Prediction Models for Slope Instability of Metamorphosed Rocks. In Proceedings of the 5th International Symposium on Landslides, Lausanne, Switzerland, 10–15 July 1988; pp. 1269–1272. [Google Scholar]
Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
Wu, Y.; Ke, Y.; Chen, Z.; Liang, S.; Zhao, H.; Hong, H. Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping. CATENA 2020, 187, 104396. [Google Scholar] [CrossRef]
Sun, D.; Wen, H.; Wang, D.; Xu, J. A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm. Geomorphology 2020, 107201. [Google Scholar] [CrossRef]
Hong, H.; Miao, Y.; Liu, J.; Zhu, A.X. Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping. CATENA 2019, 176, 45–64. [Google Scholar] [CrossRef]
Abbaszadeh Shahri, A.; Spross, J.; Johansson, F.; Larsson, S. Landslide susceptibility hazard map in southwest Sweden using artificial neural network. CATENA 2019, 183, 104225. [Google Scholar] [CrossRef]
Choi, J.; Lee, Y.K.; Lee, M.; Kim, K.; Park, Y.; Kim, S.; Goo, S.; Cho, M.; Sim, J.; Won, J.S.; et al. Landslide Susceptibility Mapping by Using an Adaptive Neuro-fuzzy Inference System (ANFIS). In 2011 IEEE International Geoscience and Remote Sensing Symposium; IEEE: New York, NY, USA, 2011; pp. 1989–1992. [Google Scholar] [CrossRef]
Oh, H.J.; Pradhan, B. Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput. Geosci. 2011, 37, 1264–1276. [Google Scholar] [CrossRef]
Bui, D.T.; Pradhan, B.; Lofman, O.; Revhaug, I.; Dick, O.B. Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput. Geosci. 2012, 45, 199–211. [Google Scholar] [CrossRef]
Chen, W.; Panahi, M.; Tsangaratos, P.; Shahabi, H.; Ilia, I.; Panahi, S.; Li, S.; Jaafari, A.; Ahmad, B.B. Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility. CATENA 2019, 172, 212–231. [Google Scholar] [CrossRef]
Canavesi, V.; Segoni, S.; Rosi, A.; Ting, X.; Nery, T.; Catani, F.; Casagli, N. Different Approaches to Use Morphometric Attributes in Landslide Susceptibility Mapping Based on Meso-Scale Spatial Units: A Case Study in Rio de Janeiro (Brazil). Remote Sens. 2020, 12, 1826. [Google Scholar] [CrossRef]
Sameen, M.I.; Pradhan, B.; Lee, S. Application of convolutional neural networks featuring Bayesian optimization for landslide susceptibility assessment. CATENA 2020, 186, 104249. [Google Scholar] [CrossRef]
Wang, Y.; Fang, Z.; Hong, H. Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China. Sci. Total Environ. 2019, 666, 975–993. [Google Scholar] [CrossRef]
Yao, W.; Zeng, Z.G.; Lian, C.; Tang, H.M. Pixel-wise regression using U-Net and its application on pansharpening. Neurocomputing 2018, 312, 364–371. [Google Scholar] [CrossRef]
Liu, P.; We, Y.; Wang, Q.; Chen, Y.; Xie, J. Research on Post-Earthquake Landslide Extraction Algorithm Based on Improved U-Net Model. Remote Sens. 2020, 12, 894. [Google Scholar] [CrossRef] [Green Version]
Kassim, Y.M.; Glinskii, O.V.; Glinsky, V.V.; Huxley, V.H.; Guidoboni, G.; Palaniappan, K.; IEEE. Deep Unet Regression and Hand-crafted Feature Fusion for Accurate Blood Vessel Segmentation. In Proceedings of the 2019 IEEE International Conference on Image Processing, Taipei, Taiwan, 22–25 September 2019; pp. 1445–1449. [Google Scholar]
Gui, Y.Y.; Li, X.; Li, W.; Yue, A.Z.; IEEE. Multi-Branch Regression Network For Building Classification Using Remote Sensing Images. In 2018 10th IAPR Workshop on Pattern Recognition in Remote Sensing; IEEE: New York, NY, USA, 2018. [Google Scholar]
Cao, L.; Li, L.; Zheng, J.F.; Fan, X.; Yin, F.; Shen, H.; Zhang, J. Multi-task neural networks for joint hippocampus segmentation and clinical score regression. Multimed. Tools Appl. 2018, 77, 29669–29686. [Google Scholar] [CrossRef]
Kamiya, R.; Hotta, K.; Oda, K.; Kakuta, S. Road Detection from Satellite Images by Improving U-Net with Difference of Features. In Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (ICPRAM), Funchal, Portugal, 16–18 January 2018; pp. 603–607. [Google Scholar] [CrossRef]
Wang, W.M.; Zhao, L.F.; Li, J.; Yao, Z.X. Rupture process of the Ms 8.0 wenchuan earthquake of Sichuan, China. Chin. J. Geophys. Chin. Ed. 2008, 51, 1403–1410. [Google Scholar]
Tang, C.; Zhu, J.; Qi, X.; Ding, J. Landslides induced by the Wenchuan earthquake and the subsequent strong rainfall event: A case study in the Beichuan area of China. Eng. Geol. 2011, 122, 22–33. [Google Scholar] [CrossRef]
Dai, F.C.; Xu, C.; Yao, X.; Xu, L.; Tu, X.B.; Gong, Q.M. Spatial distribution of landslides triggered by the 2008 Ms 8.0 Wenchuan earthquake, China. J. Asian Earth Sci. 2011, 40, 883–895. [Google Scholar] [CrossRef]
Tang, H.M.; Jia, H.B.; Hu, X.L.; Li, D.W.; Xiong, C.R. Characteristics of Landslides Induced by the Great Wenchuan Earthquake. J. Earth Sci. 2010, 21, 104–113. [Google Scholar] [CrossRef]
Mutanga, O.; Kumar, L. Google Earth Engine Applications. Remote Sens. 2019, 11, 591. [Google Scholar] [CrossRef] [Green Version]
Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
Huadong, G. Atlas of Remote Sensing of the Wenchuan Earthquake; Huadong, G., Ed.; CRC Press: Boca Raton, FL, USA, 2009; p. 259. [Google Scholar]
van Genderen, J.L. Atlas of remote sensing of the Wenchuan earthquake. Int. J. Digit. Earth 2011, 4, 91–92. [Google Scholar] [CrossRef]
Nepal, N.; Chen, J.; Chen, H.; Wang, X.A.; Pangali Sharma, T.P. Assessment of landslide susceptibility along the Araniko Highway in Poiqu/Bhote Koshi/Sun Koshi Watershed, Nepal Himalaya. Prog. Disaster Sci. 2019, 3, 100037. [Google Scholar] [CrossRef]
Xiao, L.M.; Zhang, Y.H.; Peng, G.Z. Landslide Susceptibility Assessment Using Integrated Deep Learning Algorithm along the China-Nepal Highway. Sensors 2018, 18, 4436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Terzaghi, K. Mechanism of Landslides; Geological Society of America: Boulder, CO, USA, 1950. [Google Scholar] [CrossRef]
Peng, Z.G.; Gomberg, J. An integrated perspective of the continuum between earthquakes and slow-slip phenomena. Nat. Geosci. 2010, 3, 599–607. [Google Scholar] [CrossRef]
Theobald, D.M.; Harrison-Atlas, D.; Monahan, W.B.; Albano, C.M. Ecologically-Relevant Maps of Landforms and Physiographic Diversity for Climate Adaptation Planning. PLoS ONE 2015, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Khan, H.; Shafique, M.; Khan, M.A.; Bacha, M.A.; Shah, S.U.; Calligaris, C. Landslide susceptibility assessment using Frequency Ratio, a case study of northern Pakistan. Egypt. J. Remote Sens. Space Sci. 2019, 22, 11–24. [Google Scholar] [CrossRef]
Kamp, U.; Growley, B.J.; Khattak, G.A.; Owen, L.A. GIS-based landslide susceptibility mapping for the 2005 Kashmir earthquake region. Geomorphology 2008, 101, 631–642. [Google Scholar] [CrossRef]
Wald, L.A.; Wald, D.J.; Schwarz, S.; Presgrave, B.; Earle, P.S.; Martinez, E.; Oppenheimer, D. The USGS Earthquake Notification Service (ENS): Customizable notifications of earthquakes around the globe. Seism. Res. Lett. 2008, 79, 103–110. [Google Scholar] [CrossRef]
Thompson, E.M.; McBride, S.K.; Hayes, G.P.; Allstadt, K.E.; Wald, L.A.; Wald, D.J.; Knudsen, K.L.; Worden, C.B.; Marano, K.D.; Jibson, R.W.; et al. USGS Near-Real-Time Products-and Their Use-for the 2018 Anchorage Earthquake. Seism. Res. Lett. 2020, 91, 94–113. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.; Jackel, L.D. Handwritten Digit Recognition with a Back-Propagation Network. Adv. Neural Inf. Process. Syst. 1990, 2, 396–404. [Google Scholar]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. Acm 2017, 60, 84–90. [Google Scholar] [CrossRef]
Szegedy, C.; Liu, W.; Jia, Y.Q.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A.; IEEE. Going Deeper with Convolutions. In 2015 IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2015; pp. 1–9. [Google Scholar] [CrossRef] [Green Version]
Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-scale Image Recognition. In Proceedings of the International Conference on Learning Representations (ICLR 2015), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
He, K.M.; Zhang, X.Y.; Ren, S.Q.; Sun, J.; IEEE. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef] [Green Version]
Ciresan, D.; Giusti, A.; Gambardella, L.M.; Schmidhuber, J. Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images. Proceedings of Advances in Neural Information Processing Systems 25 (NIPS 2012), Lake Tahoe, CA, USA, 3–6 December 2012. [Google Scholar]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention, Pt Iii; Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F., Eds.; 2015; Volume 9351, pp. 234–241. [Google Scholar]
Wang, H.; Zhang, L.; Yin, K.; Luo, H.; Li, J. Landslide identification using machine learning. Geosci. Front. 2020. [Google Scholar] [CrossRef]
Bi, J.; Bennett, K.P. Regression Error Characteristic Curves. Proceedings of 20th International Conference on Machine Learning, Washington, DC, USA, 21–24 August 2003; pp. 43–50. [Google Scholar]
Lee, S. Application of logistic regression model and its validation for landslide susceptibility mapping using GIS and remote sensing data journals. Int. J. Remote Sens. 2005, 26, 1477–1491. [Google Scholar] [CrossRef]
Đurić, U.; Marjanović, M.; Radić, Z.; Abolmasov, B. Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept. Eng. Geol. 2019, 256, 23–38. [Google Scholar] [CrossRef]
Hong, H.; Liu, J.; Zhu, A.X. Modeling landslide susceptibility using LogitBoost alternating decision trees and forest by penalizing attributes with the bagging ensemble. Sci. Total Environ. 2020, 718, 137231. [Google Scholar] [CrossRef] [PubMed]
Marjanović, M.; Kovačević, M.; Bajat, B.; Voženílek, V. Landslide susceptibility assessment using SVM machine learning algorithm. Eng. Geol. 2011, 123, 225–234. [Google Scholar] [CrossRef]
Yilmaz, I. Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat-Turkey). Comput. Geosci. 2009, 35, 1125–1138. [Google Scholar] [CrossRef]
Zhao, Y.; Wang, R.; Jiang, Y.; Liu, H.; Wei, Z. GIS-based logistic regression for rainfall-induced landslide susceptibility mapping under different grid sizes in Yueqing, Southeastern China. Eng. Geol. 2019, 259, 105147. [Google Scholar] [CrossRef]
Pang, B.; Nijkamp, E.; Wu, Y.N. Deep Learning With TensorFlow: A Review. J. Educ. Behav. Stat. 2020, 45, 227–248. [Google Scholar] [CrossRef]
Luo, L.K.; Peng, H.; Zhang, Q.S.; Lin, C.D.; IEEE. Comparison of strategies for unbalance sample distribution in support vector machine. In Iciea 2006: 1st IEEE Conference on Industrial Electronics and Applications, Vols 1–3, Proceedings; IEEE: New York, NY, USA, 2006; pp. 128–132. [Google Scholar]
Tanyas, H.; Rossi, M.; Alvioli, M.; van Westen, C.J.; Marchesini, I. A global slope unit-based method for the near real-time prediction of earthquake-induced landslides. Geomorphology 2019, 327, 126–146. [Google Scholar] [CrossRef]
Jacobs, L.; Kervyn, M.; Reichenbach, P.; Rossi, M.; Marchesini, I.; Alvioli, M.; Dewitte, O. Regional susceptibility assessments with heterogeneous landslide information: Slope unit- vs. pixel-based approach. Geomorphology 2020, 356, 107084. [Google Scholar] [CrossRef]
Amato, G.; Eisank, C.; Castro-Camilo, D.; Lombardo, L. Accounting for covariate distributions in slope-unit-based landslide susceptibility models. A case study in the alpine environment. Eng. Geol. 2019, 260, 105237. [Google Scholar] [CrossRef]

Figure 1. Location of the study area. The gray color area which cover 13 counties is the heavy hit mountain area of the Wenchuan earthquake. The abbreviation of HE represents of historical earthquake.

Figure 2. Pre-earthquake Thematic Mapper (TM) color composite and post-earthquake airborne imagery coverage. The blue color numbers show the number (No) of airborne images as Table 1. No. 1 to 7 airborne images were used as model area for developing the U-net like model while No. 8 imagery was used for independent testing.

Figure 3. Locations and photographs of typical landslides. The abbreviation of HE represents of historical earthquake.

Figure 4. Topography factors of post-earthquake landslides. (a) Digital elevation model (DEM); (b) Terrain slope; (c) Terrain aspect; and (d) Multi-scale topographic position index (mTPI).

Figure 5. Lithology and fault factors of post-earthquake landslides.

Figure 6. Human activity factors of post-earthquake landslides. (a) Road network; (b) Stream network.

Figure 7. Earthquake macroseismic intensity (MI) distribution of study area.

Figure 8. Traditional U-net model architecture [65] (example for 32 × 32 pixels in the lowest resolution and binary classification output).

Figure 9. U-net like model architecture for post-earthquake landslide susceptibility mapping (LSM). Each box corresponds to a multi-layer feature map. The number of layers and the x-y-dimension are shown on the box. The colored arrows denote the different operations. The red dotted polygon shows the first convolution layer and the blue dotted polygon shows the last convolution layer.

Figure 10. A flowchart shows how training, validation, and testing samples were created. The ratio under the arrow shows the ratio of select landslide and non-landslide pixels.

Figure 11. Relative distribution of different influencing factors and post-earthquake landslide occurrence with pre-earthquake Landsat TM data.

Figure 12. Relative distribution of different influencing factors and post-earthquake landslide occurrence with (a) DEM; (b) slope; (c) aspect; (d) mTPI; (e) distance from fault; (f) distance from road; (g) distance from stream; (h) MI; and (i) lithology.

Figure 13. Application U-net model to predict the landslide susceptibility of the whole study area. The white polygons show the airborne imagery coverage area. A greater susceptibility value means a higher probability to landslide.

Figure 14. The percentage of total area and landslide area for each level estimated by the U-net model.

Figure 15. (a) Application of the LR model to predict the landslide susceptibility of the whole study area; (b) Application of the support vector machine (SVM) model to predict the landslide susceptibility of the whole study area. The white polygons show the areas covered by the airborne imagery.

Figure 16. (a) Receiver operation characteristic (ROC) curves for three models using test dataset 1; (b) ROC curves for three models using the test dataset 2.

Figure 17. LSM results on test dataset 2. (a) Airborne imagery and landslides distribution of test area 2; (b) U-net modeled LSM; (c) LR modeled LSM; and (d) SVM modeled LSM.

Figure 18. Comparing LSM results of three models on Chengdu Plain area. (a) TM 7/4/1 imagery of Chengdu Plain; (b) U-net modeled LSM; (c) LR modeled LSM; and (d) SVM modeled LSM.

Figure 19. (a) The input and output sketch map of different total convolution size (TCS) for each pixel; (b) Binary cross entropy loss vs epoch curve of different TCS, the black arrow showed the best model position as we used an early stop with patience = 10 on the TensorFlow software.

Figure 20. Binary cross entropy loss vs epoch curve of different models, the black arrow showed the best model position as we used an early stop with patience = 10 on the TensorFlow software.

Table 1. The information of airborne images used for post-earthquake landslide identification.

No	Date	Area (km²)	Landslides (km²)	Objective
1	15 May 2008	260	4.46	Training/validation
2	15 May 2008	344	13.2	Training/validation
3	16 May 2008	712	6.62	Training/validation
4	16 May 2008	392	6.23	Training/validation
5	16 May 2008	156	9.60	Training/validation
6	16 May 2008	326	16.06	Training/validation
7	18 May 2008	400	21.96	Training/validation
8	28 May 2008	137	8.57	Independent testing

Table 2. The description of lithology in the study area.

Symbol	Description
γ	Granite, diorite
δ	Diorite
ε	Hornblende
Q	Quaternary. Metamorphic sandstone, limestone
K	Cretaceous. Conglomerate, sandstone, mudstone
J	Jurassic. Sandstone, mudstone and their interbeds
T	Triassic. Sandstone, limestone, slate
P	Permian. Thick limestone with slate in the middle
C	Carboniferous. Limestone, marble, sandstone
D	Devonian. Quartz sandstone
S	Silurian. Sandstone, phyllite, limestone interbed
O	Ordovician. Limestone, marble, phyllite
ψ	Cambrian. Metamorphic grit and limestone
Z	Sinian. Metamorphic sandstone, limestone

Table 3. The detail of the input layers for U-net like model.

Layer	Data	Description	Value
1–7	Landsat TM data	7 bands Landsat 5 surface reflectance Tier 1 data obtained from GEE. (https://developers.google.com/earth-engine/datasets/catalog/LANDSAT_LT05_C01_T1_SR#description).	Surface reflectance value of each band with scale 10,000.
8	DEM	SRTM Digital Elevation Data with 30m/pixel resolution obtained from GEE. (https://developers.google.com/earth-engine/datasets/catalog/USGS_SRTMGL1_003#description).	Value in meters.
9	Slope	Computed from DEM data.	Value in degrees.
10	Aspect	Computed from DEM data.	Value was clockwise in degrees from 0 (due north) to 360 (again due north), coming full circle. Flat areas are given a value of −1.
11	mTPI	Obtained from GEE Global ALOS mTPI. (https://developers.google.com/earth-engine/datasets/catalog/CSP_ERGo_1_0_Global_ALOS_mTPI#description).	Value range from −789 to 678 in the study area according to literature [52].
12	Fault	Extracted from the geological map with the scale 1:200,000.	Euclidean distance to the closest fault.
13	Road network	Obtained from the National Geomatics Center of China (NGCC) with the scale 1:50,000.	Euclidean distance to the closest road.
14	Stream network	Obtained from the National Geomatics Center of China (NGCC) with the scale 1:50,000.	Euclidean distance to the closest stream.
15	MI	Obtained from USGS.	MI value.
16–29	Lithology	Extracted from the geological map with the scale 1:200,000, and assigned to14 dummy variables as described in the literature [66].	Dummy value 0 or 1.

Table 4. The evaluation result of training and validation.

Precision		Recall		F1 Score		AUC
Training	Validation	Training	Validation	Training	Validation	Training	Validation
0.83	0.77	0.92	0.90	0.87	0.83	0.95	0.90

Table 5. Confusion matrix between predicted value and true value in test area 1 and 2 based on different models.

Test Area	Models	True Value	Predict Landslide Susceptibility Value (ls)						Acc
			Positive (Landslide)			Negative (Non-Landslide)
			ls ≥ 0.9	0.7 ≤ ls < 0.9	0.5 ≤ ls < 0.7	0.3 ≤ ls < 0.5	0.1 ≤ ls < 0.3	ls < 0.1
Test area 1 (20,000 pixels)	U-net	1 (Positive, Landslide)	4175	3193	1169	532	461	470	85.46%
	U-net	0 (Negative, Non-landslide)	349	584	513	486	790	7278	85.46%
	Logistic	1 (Positive, Landslide)	0	1742	2782	3591	1839	46	69.12%
	Logistic	0 (Negative, Non-landslide)	0	189	512	1924	5199	2176	69.12%
	SVM	1 (Positive, Landslide)	5185	1753	1034	717	714	597	83.30%
	SVM	0 (Negative, Non-landslide)	495	408	410	650	1574	6463	83.30%
Test area 2 (3000 pixels)	U-net	1 (Positive, Landslide)	430	348	380	158	113	71	79.70%
	U-net	0 (Negative, Non-landslide)	10	105	152	120	144	969	79.70%
	Logistic	1 (Positive, Landslide)	0	384	1001	95	9	11	78.90%
	Logistic	0 (Negative, Non-landslide)	0	116	402	99	706	177	78.90%
	SVM	1 (Positive, Landslide)	538	597	212	97	27	29	79.20%
	SVM	0 (Negative, Non-landslide)	157	212	102	65	86	878	79.20%

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, Y.; Wei, Y.; Wang, Q.; Chen, F.; Lu, C.; Lei, S. Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach. Remote Sens. 2020, 12, 2767. https://doi.org/10.3390/rs12172767

AMA Style

Chen Y, Wei Y, Wang Q, Chen F, Lu C, Lei S. Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach. Remote Sensing. 2020; 12(17):2767. https://doi.org/10.3390/rs12172767

Chicago/Turabian Style

Chen, Yu, Yongming Wei, Qinjun Wang, Fang Chen, Chunyan Lu, and Shaohua Lei. 2020. "Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach" Remote Sensing 12, no. 17: 2767. https://doi.org/10.3390/rs12172767

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mapping Post-Earthquake Landslide Susceptibility: A U-Net Like Approach

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. Remote Sensing Data

2.2.1. Pre-Earthquake Data

2.2.2. Post-Earthquake Data

2.3. Landslide Inventory

2.4. Landslide Influencing Factors

2.4.1. Topography

2.4.2. Lithology and Fault

2.4.3. Human Activity

2.4.4. Seismic Parameters

2.5. U-Net Like Model for Post-Earthquake LSM

2.5.1. Traditional CNN and U-Net Model

2.5.2. Model Architecture

2.5.3. Input and Output

2.5.4. Training, Validation, and Independent Testing

3. Results

3.1. Spatial Analysis of Landslides

3.2. LSM Result of U-Net Like Model

3.3. Compare with LR and SVM Models

4. Discussion

4.1. Sample Balance for Model Input

4.2. Total Convolutional Size of Model Architecture

4.3. Pixel Itself or Surrounding Pixels for LSM Task

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI