This page has only limited features, please log in for full access.
Soil erosion is a form of land degradation. It is the process of moving surface soil with the action of external forces such as wind or water. Tillage also causes soil erosion. As outlined by the United Nations Sustainable Development Goal (UN SDG) #15, it is a global challenge to “combat desertification, and halt and reverse land degradation and halt biodiversity loss.” In order to advance this goal, we studied and modeled the soil erosion depth of a typical watershed in Taiwan using 26 morphometric factors derived from a digital elevation model (DEM) and 10 environmental factors. Feature selection was performed using the Boruta algorithm to determine 15 factors with confirmed importance and one tentative factor. Then, machine learning models, including the random forest (RF) and gradient boosting machine (GBM), were used to create prediction models validated by erosion pin measurements. The results show that GBM, coupled with 15 important factors (confirmed), achieved the best result in the context of root mean square error (RMSE) and Nash–Sutcliffe efficiency (NSE). Finally, we present the maps of soil erosion depth using the two machine learning models. The maps are useful for conservation planning and mitigating future soil erosion.
Kieu Nguyen; Walter Chen. DEM- and GIS-Based Analysis of Soil Erosion Depth Using Machine Learning. ISPRS International Journal of Geo-Information 2021, 10, 452 .
AMA StyleKieu Nguyen, Walter Chen. DEM- and GIS-Based Analysis of Soil Erosion Depth Using Machine Learning. ISPRS International Journal of Geo-Information. 2021; 10 (7):452.
Chicago/Turabian StyleKieu Nguyen; Walter Chen. 2021. "DEM- and GIS-Based Analysis of Soil Erosion Depth Using Machine Learning." ISPRS International Journal of Geo-Information 10, no. 7: 452.
Although machine learning has been extensively used in various fields, it has only recently been applied to soil erosion pin modeling. To improve upon previous methods of quantifying soil erosion based on erosion pin measurements, this study explored the possible application of ensemble machine learning algorithms to the Shihmen Reservoir watershed in northern Taiwan. Three categories of ensemble methods were considered in this study: (a) Bagging, (b) boosting, and (c) stacking. The bagging method in this study refers to bagged multivariate adaptive regression splines (bagged MARS) and random forest (RF), and the boosting method includes Cubist and gradient boosting machine (GBM). Finally, the stacking method is an ensemble method that uses a meta-model to combine the predictions of base models. This study used RF and GBM as the meta-models, decision tree, linear regression, artificial neural network, and support vector machine as the base models. The dataset used in this study was sampled using stratified random sampling to achieve a 70/30 split for the training and test data, and the process was repeated three times. The performance of six ensemble methods in three categories was analyzed based on the average of three attempts. It was found that GBM performed the best among the ensemble models with the lowest root-mean-square error (RMSE = 1.72 mm/year), the highest Nash-Sutcliffe efficiency (NSE = 0.54), and the highest index of agreement (d = 0.81). This result was confirmed by the spatial comparison of the absolute differences (errors) between model predictions and observations using GBM and RF in the study area. In summary, the results show that as a group, the bagging method and the boosting method performed equally well, and the stacking method was third for the erosion pin dataset considered in this study.
Kieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang. Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements. ISPRS International Journal of Geo-Information 2021, 10, 42 .
AMA StyleKieu Anh Nguyen, Walter Chen, Bor-Shiun Lin, Uma Seeboonruang. Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements. ISPRS International Journal of Geo-Information. 2021; 10 (1):42.
Chicago/Turabian StyleKieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang. 2021. "Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements." ISPRS International Journal of Geo-Information 10, no. 1: 42.
The universal soil loss equation (USLE) is a widely used empirical model for estimating soil loss. Among the USLE model factors, the cover management factor (C-factor) is a critical factor that substantially impacts the estimation result. Assigning C-factor values according to a land-use/land-cover (LULC) map from field surveys is a typical traditional approach. However, this approach may have limitations caused by the difficulty and cost in conducting field surveys and updating the LULC map regularly, thus significantly affecting the feasibility of multi-temporal analysis of soil erosion. To address this issue, this study uses data mining to build a random forest (RF) model between eight geospatial factors and the C-factor for the Shihmen Reservoir watershed in northern Taiwan for multi-temporal estimation of soil loss. The eight geospatial factors were collected or derived from remotely sensed images taken in 2004, a digital elevation model, and related digital maps. Due to the memory size limitation of the R software, only 4% of the total data points (population dataset) in each C-factor class were selected as the sample dataset (input dataset) for analysis using the stratified random sampling method. Seventy percent of the input dataset was used to train the RF model, and the other 30% was used to test the model. The results show that the RF model could capture the trend of vegetation recovery and soil loss reduction after the destructive event of Typhoon Aere in 2004 for multi-temporal analysis. Although the RF model was biased by the majority class’s large sample size (C = 0.01 class), the estimated soil erosion rate was close to the measurement obtained by the erosion pins installed in the watershed (90.6 t/ha-year). After the model’s completion, we furthered our aim to address the input dataset’s imbalanced data problem to improve the model’s classification performance. An ad-hoc down-sampling of the majority class technique was used to reduce the majority class’s sampling rate to 2%, 1%, and 0.5% while keeping the other minority classes at a 4% sample rate. The results show an improvement of the Kappa coefficient from 0.574 to 0.732, the AUC from 0.780 to 0.891, and the true positive rate of all minority classes combined from 0.43 to 0.70. However, the overall accuracy decreases from 0.952 to 0.846, and the true positive rate of the majority class declines from 0.99 to 0.94. The best average C-factor was achieved when the sampling rate of the majority class was 1%. On the other hand, the best soil erosion estimate was obtained when the sampling rate was 2%.
Fuan Tsai; Jhe-Syuan Lai; Kieu Anh Nguyen; Walter Chen. Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds. ISPRS International Journal of Geo-Information 2021, 10, 19 .
AMA StyleFuan Tsai, Jhe-Syuan Lai, Kieu Anh Nguyen, Walter Chen. Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds. ISPRS International Journal of Geo-Information. 2021; 10 (1):19.
Chicago/Turabian StyleFuan Tsai; Jhe-Syuan Lai; Kieu Anh Nguyen; Walter Chen. 2021. "Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds." ISPRS International Journal of Geo-Information 10, no. 1: 19.
This study continues a previous study with further analysis of watershed-scale erosion pin measurements. Three machine learning (ML) algorithms—Support Vector Machine (SVM), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Artificial Neural Network (ANN)—were used to analyze depth of erosion of a watershed (Shihmen reservoir) in northern Taiwan. In addition to three previously used statistical indexes (Mean Absolute Error, Root Mean Square of Error, and R-squared), Nash–Sutcliffe Efficiency (NSE) was calculated to compare the predictive performances of the three models. To see if there was a statistical difference between the three models, the Wilcoxon signed-rank test was used. The research utilized 14 environmental attributes as the input predictors of the ML algorithms. They are distance to river, distance to road, type of slope, sub-watershed, slope direction, elevation, slope class, rainfall, epoch, lithology, and the amount of organic content, clay, sand, and silt in the soil. Additionally, measurements of a total of 550 erosion pins installed on 55 slopes were used as the target variable of the model prediction. The dataset was divided into a training set (70%) and a testing set (30%) using the stratified random sampling with sub-watershed as the stratification variable. The results showed that the ANFIS model outperforms the other two algorithms in predicting the erosion rates of the study area. The average RMSE of the test data is 2.05 mm/yr for ANFIS, compared to 2.36 mm/yr and 2.61 mm/yr for ANN and SVM, respectively. Finally, the results of this study (ANN, ANFIS, and SVM) were compared with the previous study (Random Forest, Decision Tree, and multiple regression). It was found that Random Forest remains the best predictive model, and ANFIS is the second-best among the six ML algorithms.
Kieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang. Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan. Sustainability 2020, 12, 2022 .
AMA StyleKieu Anh Nguyen, Walter Chen, Bor-Shiun Lin, Uma Seeboonruang. Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan. Sustainability. 2020; 12 (5):2022.
Chicago/Turabian StyleKieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang. 2020. "Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan." Sustainability 12, no. 5: 2022.
Shihmen Reservoir watershed is vital to the water supply in Northern Taiwan but the reservoir has been heavily impacted by sedimentation and soil erosion since 1964. The purpose of this study was to explore the capability of machine learning algorithms, such as decision tree and random forest, to predict soil erosion (sheet and rill erosion) depths in the Shihmen reservoir watershed. The accuracy of the models was evaluated using the RMSE (Root Mean Squared Error), MAE (Mean Absolute Error), and R2. Moreover, the models were verified against the multiple regression analysis, which is commonly used in statistical analysis. The predictors of these models were 14 environmental factors which influence soil erosion, whereas the target was 550 erosion pins installed at 55 locations (on 55 slopes) and monitored over a period of approximately three years. The data sets for the models were separated into 70% for the training data and 30% for the testing data, using the simple random sampling and stratified random sampling methods. The results show that the random forest algorithm performed the best of the three methods. Moreover, the stratified random sampling method had better results among the two sampling methods, as anticipated. The average error (RMSE relative to 1:1 line) of the stratified random sampling method of the random forest algorithm is 0.93 mm/yr in the training data and 1.75 mm/yr in the testing data, respectively. Finally, the random forest algorithm predicted that type of slope, slope direction, and sub-watershed are the three most important factors of the 14 environmental factors collected and used in this study for splits in the trees and thus they are the three most important factors affecting the depth of sheet and rill erosion in the Shihmen Reservoir watershed. The results of this study can be employed by decision-makers to improve soil conservation planning and watershed remediation.
Kieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang; Kent Thomas. Predicting Sheet and Rill Erosion of Shihmen Reservoir Watershed in Taiwan Using Machine Learning. Sustainability 2019, 11, 3615 .
AMA StyleKieu Anh Nguyen, Walter Chen, Bor-Shiun Lin, Uma Seeboonruang, Kent Thomas. Predicting Sheet and Rill Erosion of Shihmen Reservoir Watershed in Taiwan Using Machine Learning. Sustainability. 2019; 11 (13):3615.
Chicago/Turabian StyleKieu Anh Nguyen; Walter Chen; Bor-Shiun Lin; Uma Seeboonruang; Kent Thomas. 2019. "Predicting Sheet and Rill Erosion of Shihmen Reservoir Watershed in Taiwan Using Machine Learning." Sustainability 11, no. 13: 3615.
Nowadays, the storage capacity of a reservoir reduced by sediment deposition is a concern of many countries in the world. Therefore, understanding the soil erosion and transportation process is a significant matter, which helps to manage and prevent sediments entering the reservoir. The main objective of this study is to examine the sediments reaching the outlet of a basin by empirical sediment delivery ratio (SDR) equations and the gross soil erosion. The Shihmen reservoir watershed is used as the study area. Because steep terrain is a characteristic feature of the study area, two SDR models that depend on the slope of the mainstream channel and the relief-length ratio of the watershed are chosen. It is found that the Maner (1958) model, which uses the relief-length ratio, is the better model of the two. We believe that this empirical research improves our understanding of the sediment delivery process occurring in the study area.
Kieu Anh Nguyen; Walter Chen. Estimating sediment delivery ratio by stream slope and relief ratio. MATEC Web of Conferences 2018, 192, 02040 .
AMA StyleKieu Anh Nguyen, Walter Chen. Estimating sediment delivery ratio by stream slope and relief ratio. MATEC Web of Conferences. 2018; 192 ():02040.
Chicago/Turabian StyleKieu Anh Nguyen; Walter Chen. 2018. "Estimating sediment delivery ratio by stream slope and relief ratio." MATEC Web of Conferences 192, no. : 02040.
Tropical watersheds in Taiwan and Thailand face the same severe soil erosion problem that is increasing at an alarming rate. In order to evaluate the severity of soil erosion, we quantitatively investigate the issue using a common soil erosion model (Universal Soil Loss Equation, USLE) on the Shihmen reservoir watershed of Taiwan and the Lam Phra Ploeng basin of Thailand, and compare their respective erosion factors. The results show an interesting contrast between the two watersheds. Some of the factors (rainfall factor, slope-steepness factor) are higher in the Shihmen reservoir watershed, while others (soil erodibility factor, cover and management factor) are higher in the Lam Phra Ploeng basin. The net result is that these factors cancel each other out, and the amount of soil erosion of the two watersheds are very similar at 68.03 t/ha/yr and 67.57 t/ha/yr, respectively.
Yi-Hsin Liu; Kieu Anh Nguyen; Walter Chen; Jatuwat Wattanasetpong; Uma Seeboonruang. Comparing watershed soil erosion of Taiwan and Thailand. MATEC Web of Conferences 2018, 192, 02041 .
AMA StyleYi-Hsin Liu, Kieu Anh Nguyen, Walter Chen, Jatuwat Wattanasetpong, Uma Seeboonruang. Comparing watershed soil erosion of Taiwan and Thailand. MATEC Web of Conferences. 2018; 192 ():02041.
Chicago/Turabian StyleYi-Hsin Liu; Kieu Anh Nguyen; Walter Chen; Jatuwat Wattanasetpong; Uma Seeboonruang. 2018. "Comparing watershed soil erosion of Taiwan and Thailand." MATEC Web of Conferences 192, no. : 02041.