Landslide Susceptibility Mapping in Darjeeling Himalayas , India

Landslide susceptibility map aids decision makers and planners for the prevention and mitigation of landslide hazard. )is study presents a methodology for the generation of landslide susceptibility mapping using remote sensing data and Geographic Information System technique for the part of the Darjeeling district, Eastern Himalaya, in India. Topographic, earthquake, and remote sensing data and published geology, soil, and rainfall maps were collected and processed using Geographic Information System. Landslide influencing factors in the study area are drainage, lineament, slope, rainfall, earthquake, lithology, land use/land cover, fault, valley, soil, relief, and aspect. )ese factors were evaluated for the generation of thematic data layers. Numerical weight and rating for each factor was assigned using the overlay analysis method for the generation of landslide susceptibility map in the Geographic Information System environment. )e resulting landslide susceptibility zonation map demarcated the study area into four different susceptibility classes: very high, high, moderate, and low. Particle Swarm Optimization-Support Vector Machine technique was used for the prediction and classification of landslide susceptibility classes, and Genetic Programming method was used to generate models and to predict landslide susceptibility classes in conjunction with Geographic Information System output, respectively. Genetic Programming and Particle Swarm Optimization-Support Vector Machine have performed well with respect to overall prediction accuracy and validated the landslide susceptibility model generated in the Geographic Information System environment. )e efficiency of the landslide susceptibility zonation map was also confirmed by correlating the landslide frequency between different susceptible classes.


Introduction
Landslides are momentary and instantaneously happening vandalize natural hazard in mountains; however, it turns in to a disaster due to immature geology coupled with external temporal triggering factors causing landscape changes and direct and indirect losses.Landslide happens when the slope changes from a steady to an unsteady state.e influence of gravity is the key operating force for the landslide to happen.It is a motion of mass of rock, earth, or debris along the slope [1].More than 15% of total land area in India is considered to be affected by landslides [2].Globally awareness has been drawn on the study of landslides mostly because of growing pressure of urbanization and its socioeconomic effects on the terrain habitats [3]. is trend of landslides is expected to continue in the time to come due to sustained deforestation, increment of haphazard urbanization, and changing climatic patterns in the landslide-prone areas [4,5].Landslide risk cannot be utterly averted; however, the impact of its acuteness and intensity may be reduced by identifying and predicting the problems in advance.Despite advances in science and technology, landslides continue to result in economic, human, and environmental losses worldwide [6].
e external and temporal triggering factors like rainfall and earthquakes, if happening individually or in combination with some considerable time and magnitude, are directly accountable for inducing the landslides [7].Rainfall is an inevitable triggering factor which could contribute to the occurrence of landslides [8].e landslide susceptibility is the probability of spatial phenomenon of slope failures [9].Remote sensing can play a role in the generation of landslide inventory map and thematic maps related to landslide occurrences [10].Remote sensing (RS) data in conjunction with data from other sources in digital form and their analysis in Geographic Information System (GIS) environment have made possible to generate different thematic data layers corresponding to the contributing factors accountable for the occurrence of landslides [11][12][13].Geomatics by taking advantage of modern tools, such as remote sensing and GIS, provides a perfect opportunity for usage, validation, and comparison of different methods to produce a landslide susceptibility map [14].For the creation of the LSZ map in the GIS environment, the integration of various thematic data layers with weights was assigned with respect to their relative significance [15][16][17][18].Production of the landslide susceptibility map describes the prone area where landslides may occur in the future [19].
e landslide susceptibility analysis has been applied for the purpose of assessing the degree of risk in landslide-prone areas [20].Landslide susceptibility mapping is an important step prior to landslide assessment planning, management, and disaster mitigation [21].Landslide susceptibility zonation (LSZ) maps are helpful in identifying the landslide vulnerable zone in advance, planning the future development projects and mitigation programs.Hence, for effective and efficient disaster management, there is an urgent requirement of recognizing unstable slopes and mitigating its effects, which may be attained with the assistance of LSZ mapping.
e objective of Particle Swarm Optimization (PSO)-Support Vector Machine (SVM) approach used in landslide susceptibility mapping is to generate an accurate landslide susceptibility map through classification technique [22].Particle Swarm Optimization is a computational technique that optimizes an issue towards iterative attempt to move forward a particle solution with respect to a provided measure about quality [23].
e advantages of PSO over other soft computing models are as follows: PSO provides more range and exploration for the population and the movements of particles allows the fast convergence of greater diversity in the search space.e principle behind the Support Vector Machine classifier is setting a boundary to an area of points which all are the same types.
Genetic Programming (GP) is an evolutionary computing technique that is used to solve a variety of complex problems.Koza [24] developed the Genetic Programming approach, and since then, it has been widely used in science and engineering applications.GP is a very popular method with diverse applications.In the field of geosciences, GP has been used by Litschert [25] to map landslide hazard zones in California locations.In another study by Nourani et al. [26], landslide susceptibility mapping was performed using GP for Zonouz plain in Iran.
is study presents a methodology for the generation of landslide susceptibility mapping using remote sensing data and Geographic Information System technique for the part of the Darjeeling district, Eastern Himalaya, in India.e performance of this GIS output was evaluated by using PSO-SVM and GP techniques, which aims to predict the accuracy of landslide susceptibility classes.e study also aims to determine the usefulness of remote sensing and GIS, and an attempt was made to generate the landslide susceptibility map for effective and efficient disaster management for the future.

Study Area
In India, about 12.6% of Indian landmass are in the Himalayan terrain region and are landslide susceptible regions [2]. is study focused on a part of Darjeeling hills, West Bengal, Eastern Himalaya, in India, which lies within the latitude 26 °49′ 31.910″ to 26 °56′ 38.366″ N and longitude 88 °13′3.706″ to 88 °22′31.818″E and reports an area of about 201 sq.km (Figure 1). is study belongs to the steep and rugged mountainous terrain, which falls at the alluvial plains of north extreme of West Bengal, India, and is highly prone to landslide.During monsoon, these regions observed regular land sliding incidents activated by rainfall.e study area also belongs to the high damage risk and severe earthquake intensity zone in India, that is, Zone IV [27], and is thus prone to earthquake-induced landslides.As per 2011 census, the study area has an inhabitant's density of 586 inhabitants per square kilometer, and its inhabitant expansion rate over the decade 2001-2011 was 14.77%.e study area is dominated with the factors mostly favorable for occurrence of landslide areas like slope of degree >45 °, highly dissected hill slopes, barren land, high rainfall, high earthquake-prone areas, and relief >2000 m. e study area has a history of land sliding events which resulted in loss of life and infrastructure.In the past, the study area experiences earthquakes of high intensity.From the history of landslide events in the Darjeeling Himalayas, it is concluded that the study area is vulnerable to landslides, and hence, the landslide susceptibility zonation map for the study area is required to be analyzed.

Data
In this study, the statistics utilized were IRS-RESOURCE-SAT-II LISS-IV, CARTOSAT-I PAN satellite data (Table 1) of the National Remote Sensing Centre, India; topographic maps of the Survey of India (1 : 50,000 scale); earthquake data of the National Centre for Seismology, India; and information from the published geology, soil, and rainfall maps.Integration of LISS-IV and PAN data was attempted to have the benefit of both high spatial and high spectral resolution in a sole image.Figure 2 shows the satellite image of the study area.Advances in Civil Engineering in GIS environment, calculating landslide potential index, and classifying and validating landslide susceptibility map.In this study, twelve di erent factors were assessed for the generation of landslide susceptibility mapping in the GIS environment.In this study, overlay analysis technique was adopted in the GIS environment for the generation of thematic data layers.For the study area, on the basis of eld knowledge, experience, and available literature, weight values were allocated to the data layers/factors on a 1 to 10 numerical scale in series of their signi cance towards slope instability, whereas the rating values were allocated to the classes of the layers on a 0 to 9 numerical scale, in which higher rating re ects greater in uence on landslide event compared to lower one.e numerical values adopted for weights and ratings were allotted to the di erent factors.

Methodology
ematic data layers were created by mathematically multiplying the weight of the layer with the ratings of the correlating class of the individual layer.
e outcome of the nal LSZ map was classi ed into various discrete susceptibility classes.Validation of the landslide susceptibility map was attempted with the help of landslide distribution map and data of landslides.Twelve prime parameters and the landslide potential index obtained from the parameters were involved as an input to PSO-SVM and GP model.

Thematic Data Layers
For the preparation of the LSZ map, generation of various thematic data layers is required, and the factors selected were both preparatory and triggering.e layers were produced in the GIS environment.CARTOSAT-1 DEM at a resolution of pixel size 25 × 25 m was utilized to extract information on slope, relief, and aspect layers.

Aspect.
Generally, the south-facing slopes have lesser vegetation density as compared to the north-facing slopes [28].With respect to the landslide distribution, south-and east-facing slopes are further susceptible to landslides [29].e aspect map (Figure 4(b)) is derived from the surface (spatial analysis) tool of DEM using GIS software.Aspect statistics of the study area computed were 0%, 7.76%, 9.66%, 14.17%, 11%, 15.24%, 13.73%, 13.34%, and 15.09% for the flat, north, northeast, east, southeast, south, southwest, west, and northwest directions, respectively.

Valley Buffer.
A valley buffer of 100 meter was considered for the study area along the major streams, that is, 3rd and higher-order drainages, and accordingly, the valley buffer map was generated (Figure 4(c)).Valley buffer statistics of the study area computed were 92.18% and 7.82%, for the valley buffer >100 m and <100 m, respectively.

Land Use/Land
Cover.Land use is an indirect measure for the strength of the slope, as it commands the rate of weathering and erosion.In this study, the LU/LC map (Figure 4(d)) was generated by utilizing the IRS-RESOURCESAT-II LISS-IV image with four bands along with the CARTOSAT-I PAN image.e interpreted land use/land cover has been digitized, and after that it was rasterized on 25 × 25 m pixel size.Land use/land cover statistics of the study area computed were 0.36%, 0.94%, 6.48%, 1.84%, 63.47%, 10.39%, 13.77%, and 2.75%, for the land use class agriculture land, barren land, built-up area, scrub land, sparse forest, tea plantation, thick forest, and waterbody, respectively.5.5.Soil.Top soil cover on a slope has an influence on the occurrence of landslides [17].e soil map (Figure 4(e)) was extracted from a regional soil map published by National Bureau of Soil Survey and Land use planning using the GIS environment.Soil statistics of the study area computed were 45.16%, 16.46%, and 38.38%, for the soil unit coarse loamy, fine loamy, and loamy skeletal, respectively.5.6.Lithology.In contrast to the weaker rocks, the stronger rocks give more resistance to the driving forces and consequently are less susceptible to landslides and vice versa [30].
e lithology is a representation of the physical characteristics of rock or soil.e lithology map (Figure 4(f)) was generated from the geological map [31] of the Darjeeling area in GIS environment.Lithology statistics of the study area computed was 6.34%, 0.94%, 0.93%, 0.93%, 3.27%, 8.24%, 6.57%, 7.46%, 4.44%, 1.22%, 1.12%, 46.20%, 6.12%, 1.05%, and 5.17% for the rock Chunabati formation, composite of Damuda formation and Chunabati formation, composite of Rangit pebble slate and Damuda formation, composite of Rangit pebble slate, Damuda formation and Gorubathan formation, Damuda formation, Darjeeling gneiss, feldspathic greywacke, geabdat sandstone, Gorubathan formation, graphite schist/gneiss, lingtse granite gniess, paro subgroup (paro gneiss), quartzite key beds (paro quartzite), quaternary and recent sediments, and Rangit pebble slate, respectively.5.7.Drainage.Most common cause for landslide in terrain region is soil erosion due to drainage activity.Increase of pore water pressure and decrease in the shear strength due to water infiltration will result in instability to the slope.Methodology for generating the drainage buffer by correlating with the landslide distribution has been suggested by Kanungo et al. [30], to map the landslide hazard zones in the Darjeeling Himalayas.From the topographic sheets of Survey of India in the scale of 1 : 50,000, most of the drainage layers were produced by digitization of the drainages and subsequently they were updated with the aid of the PAN image and LISS-IV image in the GIS environment.For all the drainage orders, a 25 m buffer zone on both sides of the drainages was created.After the spatial association of landslide distribution, it was observed that most of the landslide pixels fall in the 1st-and 2nd-order drainage buffers only.Accordingly, a drainage buffer map (Figure 4(g)) was generated in the GIS environment by taking into account the 1st-and 2nd-order streams only with 25 m buffer zones around these drainages.Drainage statistics of the study area computed were 8.62%, 3.57%, and 87.81% for the drainage buffers 1st order, 2nd order, and for the rest of the area, respectively.5.8.Lineaments.Landslides are more prone in the jointed, fractured, and faulted areas.Methodology for generating the lineament buffer layers with equal interval buffer zones has been suggested by Kanungo et al. [30], to map the landslide hazard zones in the Darjeeling Himalayas.PAN and LISSIV satellite images are used for the interpretation of lineaments.Interpreted lineaments were processed for the generation of lineament layers in the GIS environment.Lineament buffer regions were generated at 250 m distance, and then they were geospatially cross-tallied with the pixels of landslides.84% of the pixels had befallen in the primary two buffer regions only.Hence, it has been evaluated to have five equal distance lineament buffer zones at 125 m distance, and with respect to these zones, the lineament buffer map (Figure 4(h)) was generated.e lineament statistics of the study area computed were 20.63%, 21.13%, 18.39%, 14.72%, and 25.13% for the lineament buffers <125 m, 125-250 m, 250-375 m, 375-500 m, and >500 m, respectively.5.9.Fault Buffer.Huge jointed area displaying offsets of chain of hills are faults.e faults were obtained from the geological map [31] of the Darjeeling area by using GIS software.Hence for the study area, it has been considered to have five equal distance buffer zones at 125 m distance for the generation of the fault buffer map (Figure 4(i)), to study the effect of faults on the landslide occurrence.e fault Advances in Civil Engineering buffer statistics of the study area computed were 0.73%, 0.88%, 0.98%, 1.14%, and 96.27% for the fault buffers <125 m, 125-250 m, 250-375 m, 375-500 m, and >500 m, respectively.5.10.Relief.Relief represents the difference in altitude between two points.e lesser relief value nominates a mature topography, whereas the higher relief value nominates immature topography.
e relief map (Figure 4(j)) was extracted from DEM in the GIS environment.
e relief statistics of the study area computed were 56.10%, 41.07%, and 2.83% for the reliefs 0-1000 m, 1000-2000 m, and >2000 m, respectively.5.11.Rainfall.Rainfall plays a critical part in the unforeseen events like landslides.It is an external temporal triggering factor, and excess of it may make the slope become heavy due to increase in pore water pressure and shall result in slope slips.e rainfall map (Figure 4(k)) of the study area was generated by digitizing the polygons from the rainfall map of National Atlas and ematic Mapping Organisation, Kolkata, India, in a vector layer.
e rainfall map was rasterized at 25 m × 25 m spatial resolution.
e rainfall statistics of the study area computed were 4.84% and 95.16% for the rainfall classes medium and high, respectively.5.12.Earthquake.Both the destructive natural disasters, that is, earthquakes and landslides, are common in one sense that both are scary and destructive in nature.
e tremors produced by the earthquakes do not only activate currently developed landslides but also revive the older ones.e earthquake map for the study area has been prepared by using the point data of more than last 200 years, and these point data were explained with the magnitude and epicenter location.e data collected were obtained from National Centre for Seismology, New Delhi, India.e earthquake map (Figure 4(l)) has been generated by interpolation technique utilizing the Inverse Distance Weighted (IDW) tool in GIS environment; in which, the IDW tool forms a surface with respect to the value of point data.e earthquake map was subcategorized into two categories of moderate-and high-prone zones with respect to the reference of past history of earthquake magnitudes ranging from 3 to 5 (moderate) and above 5 (high) measured in the Richter scale, respectively.Earthquake statistics of the study area computed were 69.59% and 30.41% for the earthquake classes high and moderate, respectively.

Landslide Distribution.
e principal features for locating the landslides by using satellite images are spectral attributes, size, shape, contrast, and so on; the landslides are mostly bare of vegetation, and they showed high reflectance [17].In the past, for the detection of landslides, SPOT PAN and ATM high-resolution satellite images of 10 m and 7.5 m resolution, respectively, were used by Mason et al. [32].For the study area, landslides were detected by using the IRS-RESOURCESAT-II LISS-IV and CARTOSAT-I PAN high-resolution satellite images of 5 m and 2.5 m resolution, respectively.Some of the extracted landslides from the satellite images were crosschecked, and the landslide distribution map (Figure 5) was modified accordingly.e mapping of prevailing landslides is necessary to study the association between the landslide distribution and the causative factors [30].History of landslide inventory of the study area was prepared from the Special Publication Number 94, Geological Survey of India.According to this landslide inventory, 17 major landslides were falling in our study area.When these 17 landslide locations were correlated with the landslide distribution map and field verification, it was observed that most of these major historical landslide locations are still falling in the landslide distribution map of the study area.e landslide distribution map contains 91 landslides which were scattered over the complete study area.Most of the landslides falling are in the varying span from 500 sq.m to 3000 sq.m.
In the study area, ground truth verification for some of the significant landslides was carried out by using the global positioning system (GPS) device.Four significant landslide sites were selected within the study area (Figure 5), for explaining the information on actual occurrence and type of landslides with the help of field survey and Google Earth satellite image (Figure 6).In location Site 1, the type of landslide observed was translational debris slide, and the state of this slide was enlarging and multiple.For Site 2, the type of landslide observed was translational rock-cumdebris slide, and the state of this slide was successive.For Site 3, the type of landslide observed was translational debris slide, and the state of this slide was enlarging and single.However, for Site 4, the type of landslide observed was complex slide, where the slide is partly translational grading to earthflow at toe, and the state of this slide was single.

Landslide Susceptibility Mapping.
Identification of potential landslide area can be achieved by developing a rating scheme in which the factors and their classes were assigned numerical values based on the associated causative factors [17].In this study, an attempt was made to generate a grading system in which the factors and their classes have been allocated numerical values.e thematic data layers have been generated in the GIS environment.e weights and ratings allotted to each factor and their classes are given in Table 2.In this study, twelve thematic data layers were generated, and they were overlaid and mathematically added by using (1) for generating the landslide potential index (LPI) in the GIS environment for each cell: where Dr, Li, Sl, Pr, Ea, Lith, Fb, Vb, La, So, Re, and As are the representative symbols for the thematic weighted layers for drainage buffer, lineament buffer, slope, precipitation, earthquake, lithology, fault buffer, valley buffer, land use/ land cover, soil, relief, and aspect, respectively.e range for LPI adjudged was between 171 and 502, which was later on categorized to generate landslide susceptibility classes.For 8 Advances in Civil Engineering such classi cations, a judicious way is to look for values of sudden abrupt alterations [33].Boundaries for di erent susceptible classes were drawn at signi cant changes in gradient [17].Hence, a graph was plotted with LPI-frequency values, emerging many swings.In this graph (Figure 7), motion mean with the mean window lengths of 3, 7, and 11 was utilized for smoothening the curve and ner classi cation.To acquire respective susceptibility zones for di erent categories, the borders were drawn at abrupt alteration in gradient of the slope at the LPI values of 212, 302, and 381 (Table 3).e LSZ map (Figure 8) was generated by using the LPI-class boundaries in the GIS environment.

Classi cation and Prediction Using PSO-SVM Approach
Based on LPI.In PSO, the framework is started with a populace of random solutions and scans for optima by refreshing generations.In this approach, the potential arrangements, called particles, y through the issue space by following the present ideal particles.

e Algorithm for PSO-SVM.
PSO is introduced with a gathering of random particles (solution) and afterward looks for optima by refreshing generations.In each generation, every molecule is modi ed by following two "best" parameters.
e rst is the best solution ( tness) it has accomplished so far ( tness parameter is additionally put away). is value is called pbest.Another "best" parameter that is followed by the particle swarm analyzer is the best value, got so far by any particle in the populace.is best value is a global best and called gbest.e movements of the particles are directed by the values pbest and gbest.At the point when enhanced positions are being found, these values will then come to direct the movements of the swarm.e particle performs movement by updating its velocity and positions by using ( 2) and (3), respectively.Figure 9 describes the optimization of particles using PSO algorithm.
Formula ( 2) is for velocity update operation: Formula ( 3) is for position update according to velocity: where v t i and x t i denote the velocity and position of ith particle, p t best i denotes individual best known position of ith particle, g t best i denotes the entire swarm best known position, and rand 1 and rand 2 are the two random variables, whereas c 1 and c 2 are the learning parameters.
e owchart presented in Figure 9 describes that for every particle, if the calculated tness value is superior than the best tness value calculated in the past, make the present tness value as new pbest.After that the gbest value is calculated by assigning best particle's pbest value to gbest.

Procedure for PSO-SVM.
e owchart describing the working of PSO is shown in Figure 10.For the classication task, optimal LPI values are identi ed using PSO according to the minimum and maximum ranges for all Advances in Civil Engineering the 12 parameters by considering the numerical value of weights and ratings as given in Table 2. is work started with an initial population of "50" particles and processed up to "500" maximum iterations (generation).e procedure for classi cation are as follows.
Step 1: describe the LPI function as tness function and also initialize "population" and "max generation"; Step 2: set the minimum and maximum limits for each parameter used in LPI function according to Table 2; Step 3: record the movements of every particle in each generation in the vector format containing the LPI value along with the corresponding parameter values.In each generation, population moves from one place to suitable place and generates the new tness values.

Classi cation System for PSO-SVM.
In the classi cation, data are loaded from the database and divided into 70% training set and 30% testing set.In the data set, the parameter values corresponding to tness values are treated as a feature vector.SVM looks generally advantageous hyperplanes that give the maximum pro t.Input: training set, class labels, and testing set.Step 1: perform training of the SVM classi er with training set and class labels.Step 2: perform testing with testing set to test the accuracy of prediction.

Discussion for PSO-SVM Study.
e PSO-SVM study was carried out by taking selected "800" LPI values (200 for each class) along with the corresponding parameters that cover the entire range of classi cation for each class.Training is carried by taking 600 LPIs (150 from each class), and testing is performed using 200 LPIs (50 from each class).
e test accuracy described by the confusion matrix is presented in Figure 11(a), and a comparison between classi cations based on LPI values and predicted classication based on the SVM classi er the same as the confusion matrix is presented in Figure 11(b).e confusion matrix is used to explain the classi cation performance model on a set of testing data for which the original labels are known.Each row of the matrix represents the instances in a predicted class, while each column represents the instances in an actual class (or vice versa).In Figure 11(a), there are 5 columns and 5 rows, but classi cation in this task is limited to only 4 classes, and the fth column and fth row are lled with all zeros.e rst, second, third, and fourth columns and rows   Advances in Civil Engineering correspond to low, moderate, high, and very high classes, respectively.Each column related to these classes will have a sum of 50 LPIs; that is, the testing is done by 200 LPIs, by taking 50 LPIs from each class, which is the target class (actual classi cation based on LPI values), and predicted classi cation by the SVM classi er is represented by rows.In the rst row, a sum of 50 LPIs are presented that means out of 50 low-class LPIs, and the system predicted that all are the low class LPIs.In the second, third, and fourth rows, a sum of 52, 51, and 47 is present that means out of 50 moderate, 50 high, and 50 very high-class LPIs, the system predicted that 52 belongs to moderate, 51 belongs to high, and 47 belongs to very high.e prediction accuracy is also written in the sixth row corresponding to each class.A comparison of the measured LPI values generated by GIS and the predicted    12 Advances in Civil Engineering values generated by PSO-SVM in terms of frequency division is shown in Figure 7.

Classi cation and Prediction
Using Genetic Programming Approach Based on LPI.Genetic Programming (GP) is based on Darwin's theory of evolution.It uses the principle of natural selection and genetic recombination. is method transforms units that de ne a given problem, and with the progress of number of iterations, they familiarize themselves to their surroundings.A GP model consists of variables and functions represented in a tree structure.e leaf nodes of the tree are the variables that are user de ned like soil type and slope and interior nodes are the functions like +, −, sine, and cosine.
e basic steps involved in the operation of GP are the creation of random set of population for the given variables and functions which are then evaluated for tness.e ttest  Advances in Civil Engineering populations are passed on the next generation after applying genetic operators like crossover, mutation, and duplication and further checked for tness.An illustration for crossover operation is shown in Figure 12.As the number of generation increases, the possibility of more t models enhances.e GP tends to attain a model of 100% accuracy by iterating for few numbers of generations or otherwise gives a model of fairly high accuracy having reached the speci ed number of generation or the time limit.

Genetic Programming
Procedure.In this study, a computer program code for GP was run in MATLAB programming language.Before the run, training and testing data sets as inputs to the GP code were created.e input data sets were based on the variables and functions that formed the original LPI equation given by the GIS method.In this equation, the value for each variable to create the data set was randomly chosen within the limits as mentioned in Table 2. Out of the total number of data sets, 70% was used as the training set and remaining 30% as the testing set.
To begin with, the number of generations was kept very low as to check the accuracy that it attains at low generation values.Gradually the number of generations was increased and at 30th generation, the accuracy was found to be 100%, but the complexity of the model obtained was fairly high.On increasing further, the number of generations at 50th generation, the model equation obtained was the simplest, while maintaining the same accuracy of 100%. is equation obtained was the closest match to the original LPI equation given by the GIS method with a value of 0.000224 added at the end.e run results for the various numbers of generations are shown in Table 4 below.
e equations obtained for each iteration are shown in Table 5.A comparison of the measured LPI values generated by GIS and the predicted values generated by GP in terms of frequency division is shown in Figure 13.e total number of data points is di erent for LPI values of GIS and PSO-SVM methods (Figure 7) and GIS and GP methods (Figure 13) of analysis.A particular LPI value has a di erent number of occurrences while comparing the performance of GIS and PSO-SVM methods and GIS and GP methods; however, by comparing the trend at a suitable scale in the frequency axis of the comparison chart, an overall matching trend shall be observed.Hence, to compare the LPIfrequency trend of both the methods (PSO-SVM and GP) with the LPI-frequency trend using GIS (Figures 7 and 13), the scales of the frequency axes have been kept di erent in left and right ordinates.

Map Validation. Computation of landslide frequency
for each class is the dominant factor for evaluating the quality of LSZ map [34].e landslide susceptibility zones re ect the existing eld instability conditions [17].While  14 Advances in Civil Engineering verifying the LSZ map, very high and high susceptible zones show signs of soil erosion, slope instability, and so on.For evaluating the e ectiveness of the LSZ map, it is signi cant to calculate the landslide frequency for each zone.e study area has been classi ed into four zones, namely, low susceptibility zone, moderate susceptibility zone, high susceptibility zone, and very high susceptibility zone, and the number of landslides and frequency per susceptible zone have been computed (Table 6).

Conclusions
Based on the studies described above, the following conclusions are drawn:   Advances in Civil Engineering

Figure 1 :Figure 2 :
Figure 1: Location map of the study area.

Figure 3 :
Figure 3: Flow diagram showing di erent steps involved in the preparation of landslide susceptibility mapping.

Figure 6 :
Figure 6: Field photographs, Google Earth images, and types of some of the major landslide events in the study area.(a) Site 1: translational debris slide.(b) Site 2: translational rock-cum-debris slide.(c) Site 3: translational debris slide.(d) Site 4: complex slide.

Figure 7 :Figure 8 :
Figure 7: Frequency division of landslide potential index and predicted frequency using PSO-SVM.

Figure 9 :
Figure 9: Flowchart for optimization of particles using PSO algorithm.

Figure 11 :
Figure 11: Confusion matrix (a) and graph (b) describing the accuracy.

Figure 13 :
Figure13: Frequency division of landslide potential index and predicted frequency using GP.

Table 1 :
Details of satellite statistics used in this study.

Table 2 :
Weights and ratings for various factors and their classes.

Table 3 :
Distribution of landslide susceptibility zones and landslide potential index of the study area.

Table 4 :
Genetic Programming run results.
Note. # e accuracy is measured in terms of coe cient of correlation (R).* e iteration achieved best tness before reaching 200th generation.

Table 6 :
Details of landslides and frequency in different susceptible zones.