LIÊN KẾT WEBSITE
Gully head-cut distribution modeling using machine learning methods-a case study of N.W. Iran
Water (Switzerland) Số 1, năm 2020 (Tập 12, trang -)
ISSN: 20734441
ISSN: 20734441
DOI: 10.3390/w12010016
Tài liệu thuộc danh mục: Scopus
Article
English
Từ khóa: Benchmarking; Decision trees; Erosion; Forecasting; Logistic regression; Machine learning; Random forests; Alternating decision trees; Arid and semi-arid regions; Gully head-cuts; Iran; Machine learning methods; Machine learning models; Soil erosion; Topographic wetness index; Landforms; gully erosion; machine learning; regression analysis; satellite data; soil erosion; watershed; Iran
Tóm tắt tiếng anh
To more effectively prevent and manage the scourge of gully erosion in arid and semi-arid regions, we present a novel-ensemble intelligence approach-bagging-based alternating decision-tree classifier (bagging-ADTree)-and use it to model a landscape's susceptibility to gully erosion based on 18 gully-erosion conditioning factors. The model's goodness-of-fit and prediction performance are compared to three other machine learning algorithms (single alternating decision tree, rotational-forest-based alternating decision tree (RF-ADTree), and benchmark logistic regression). To achieve this, a gully-erosion inventory was created for the study area, the Chah Mousi watershed, Iran by combining archival records containing reports of gully erosion, remotely sensed data from Google Earth, and geolocated sites of gully head-cuts gathered in a field survey. A total of 119 gully head-cuts were identified and mapped. To train the models' analysis and prediction capabilities, 83 head-cuts (70% of the total) and the corresponding measures of the conditioning factors were input into each model. The results from the models were validated using the data pertaining to the remaining 36 gully locations (30%). Next, the frequency ratio is used to identify which conditioning-factor classes have the strongest correlation with gully erosion. Using random-forest modeling, the relative importance of each of the conditioning factors was determined. Based on the random-forest results, the top eight factors in this study area are distance-to-road, drainage density, distance-to-stream, LU/LC, annual precipitation, topographic wetness index, NDVI, and elevation. Finally, based on goodness-of-fit and AUROC of the success rate curve (SRC) and prediction rate curve (PRC), the results indicate that the bagging-ADTree ensemble model had the best performance, with SRC (0.964) and PRC (0.978). RF-ADTree (SRC = 0.952 and PRC = 0.971), ADTree (SRC = 0.926 and PRC = 0.965), and LR (SRC = 0.867 and PRC = 0.870) were the subsequent best performers. The results also indicate that bagging and RF, as meta-classifiers, improved the performance of the ADTree model as a base classifier. The bagging-ADTree model's results indicate that 24.28% of the study area is classified as having high and very high susceptibility to gully erosion. The new ensemble model accurately identified the areas that are susceptible to gully erosion based on the past patterns of formation, but it also provides highly accurate predictions of future gully development. The novel ensemble method introduced in this research is recommended for use to evaluate the patterns of gullying in arid and semi-arid environments and can effectively identify the most salient conditioning factors that promote the development and expansion of gullies in erosion-susceptible environments. 2019 by the authors.