Floods, as a catastrophic phenomenon, have a profound impact on ecosystems and human life. Modeling flood susceptibility in watersheds and reducing the damages caused by flooding is an important ...component of environmental and water management. The current study employs two new algorithms for the first time in flood susceptibility analysis, namely multivariate discriminant analysis (MDA), and classification and regression trees (CART), incorporated with a widely used algorithm, the support vector machine (SVM), to create a flood susceptibility map using an ensemble modeling approach. A flood susceptibility map was developed using these models along with a flood inventory map and flood conditioning factors (including altitude, slope, aspect, curvature, distance from river, topographic wetness index, drainage density, soil depth, soil hydrological groups, land use, and lithology). The case study area was the Khiyav-Chai watershed in Iran. To ensure a more accurate ensemble model, this study proposed a framework for flood susceptibility assessment where only those models with an accuracy of >80% were permissible for use in ensemble modeling. The relative importance of factors was determined using the Jackknife test. Results indicated that the MDA model had the highest predictive accuracy (89%), followed by the SVM (88%) and CART (0.83%) models. Sensitivity analysis showed that slope percent, drainage density, and distance from river were the most important factors in flood susceptibility mapping. The ensemble modeling approach indicated that residential areas at the outlet of the watershed were very susceptible to flooding, and that these areas should, therefore, be prioritized for the prevention and remediation of floods.
Display omitted
•Ensemble machine learning (ML) predicting flood susceptibility•Contribution of models with accuracy values above 80% in ensembling process•Area under curve (AUC) for the models ranges from 0.83 to 0.89.•ML allows quick priority of prone areas for the remediation of floods.
•The GARP and QUEST machine learning techniques were used to assess flood risks.•The models provide a fast method to assess flood risks.•Distance to channel and population density were major factors ...in flood risk.
Flood risk mapping and modeling is important to prevent urban flood damage. In this study, a flood risk map was produced with limited hydrological and hydraulic data using two state-of-the-art machine learning models: Genetic Algorithm Rule-Set Production (GARP) and Quick Unbiased Efficient Statistical Tree (QUEST). The flood conditioning factors used in modeling were: precipitation, slope, curve number, distance to river, distance to channel, depth to groundwater, land use, and elevation. Based on available reports and field surveys for Sari city (Iran), 113 points were identified as flooded areas (with each flooded zone assigned a value of 1). Different conditioning factors, including urban density, quality of buildings, age of buildings, population density, and socio-economic conditions, were taken into account to analyze flood vulnerability. In addition, the weight of these conditioning factors was determined based on expert knowledge and Fuzzy Analytical Network Process (FANP). An urban flood risk map was then produced using flood hazard and flood vulnerability maps. The area under the receiver-operator characteristic curve (AUC-ROC) and Kappa statistic were applied to evaluate model performance. The results demonstrated that the GARP model (AUC-ROC = 93.5%, Kappa = 0.86) had higher performance accuracy than the QUEST model (AUC-ROC = 89.2%, Kappa = 0.79). The results also indicated that distance to channel, land use, and elevation played major roles in flood hazard determination, whereas population density, quality of buildings, and urban density were the most important factors in terms of vulnerability. These findings demonstrate that machine learning models can help in flood risk mapping, especially in areas where detailed hydraulic and hydrological data are not available.
This study aimed to develop a novel framework for risk assessment of nitrate groundwater contamination by integrating chemical and statistical analysis for an arid region. A standard method was ...applied for assessing the vulnerability of groundwater to nitrate pollution in Lenjanat plain, Iran. Nitrate concentration were collected from 102 wells of the plain and used to provide pollution occurrence and probability maps. Three machine learning models including boosted regression trees (BRT), multivariate discriminant analysis (MDA), and support vector machine (SVM) were used for the probability of groundwater pollution occurrence. Afterwards, an ensemble modeling approach was applied for production of the groundwater pollution occurrence probability map. Validation of the models was carried out using area under the receiver operating characteristic curve method (AUC); values above 80% were selected to contribute in ensembling process. Results indicated that accuracy for the three models ranged from 0.81 to 0.87, therefore all models were considered for ensemble modeling process. The resultant groundwater pollution risk (produced by vulnerability, pollution, and probability maps) indicated that the central regions of the plain have high and very high risk of nitrate pollution further confirmed by the exiting landuse map. The findings may provide very helpful information in decision making for groundwater pollution risk management especially in semi-arid regions.
Display omitted
•Novel risk assessment framework for nitrate groundwater contamination in arid regions.•Machine learning (ML) predicting vulnerability, pollution, and occurrence probability.•ML allows quick regional evaluation of risk posed by nitrate in groundwater.
Among natural disasters, flood is increasingly recognized as a serious worldwide concern that causes the most damages in parts of agriculture, fishery, housing, and infrastructure and strongly ...affects economic and social activities. Universally, there is a requirement to increase our conception of flood vulnerability and to outstretch methods and tools to assess it. Spatial analysis of flood vulnerability is part of non-structural measures to prevent and reduce flood destructive effects. Hence, the current study proposes a methodology for assessing the flood vulnerability in the area of watershed in a severely flooded area of Iran (i.e., Kashkan Watershed). First interdependency analysis among criteria (including population density (PD), livestock density (LD), percentage of farmers and ranchers (PFR), distance to industrial and mining areas (DTIM), distance to tourist and cultural heritage areas (DTTCH), land use, distance to residential areas (DTRe), distance to road (DTR), and distance to stream (DTS)) was conducted using the decision-making trial and evaluation laboratory (DEMATEL) method. Hence, the cause and effect factors and their interaction levels in the whole network were investigated. Then, using the interdependency relationships among criteria, a network structure from flood vulnerability factors to determine their importance of factors was constructed, and the analytical network process (ANP) was applied. Finally, with the aim to overcome ambiguity, reduce uncertainty, and keep the data variability, an appropriate fuzzy membership function was applied to each layer by analyzing the relationship of each layer with flood vulnerability. Importance analysis indicated that land use (0.197), DTS (0.181), PD (0.180), DTRe (0.140), and DTR (0.138) were the most important variables. The flood vulnerability map produced by the integrated method of DEMATEL-ANP-fuzzy showed that about 19.2% of the region has a high to very high flood vulnerability.
Due to the rapidly increasing demand for groundwater, as one of the principal freshwater resources, there is an urge to advance novel prediction systems to more accurately estimate the groundwater ...potential for an informed groundwater resource management. Ensemble machine learning methods are generally reported to produce more accurate results. However, proposing the novel ensemble models along with running comparative studies for performance evaluation of these models would be equally essential to precisely identify the suitable methods. Thus, the current study is designed to provide knowledge on the performance of the four ensemble models i.e., Boosted generalized additive model (GamBoost), adaptive Boosting classification trees (AdaBoost), Bagged classification and regression trees (Bagged CART), and random forest (RF). To build the models, 339 groundwater resources’ locations and the spatial groundwater potential conditioning factors were used. Thereafter, the recursive feature elimination (RFE) method was applied to identify the key features. The RFE specified that the best number of features for groundwater potential modeling was 12 variables among 15 (with a mean Accuracy of about 0.84). The modeling results indicated that the Bagging models (i.e., RF and Bagged CART) had a higher performance than the Boosting models (i.e., AdaBoost and GamBoost). Overall, the RF model outperformed the other models (with accuracy = 0.86, Kappa = 0.67, Precision = 0.85, and Recall = 0.91). Also, the topographic position index’s predictive variables, valley depth, drainage density, elevation, and distance from stream had the highest contribution in the modeling process. Groundwater potential maps predicted in this study can help water resources managers and policymakers in the fields of watershed and aquifer management to preserve an optimal exploit from this important freshwater.
Abstract
Snow avalanche is among the most harmful natural hazards with major socioeconomic and environmental destruction in the cold and mountainous regions. The devastating propagation and ...accumulation of the snow avalanche debris and mass wasting of surface rocks and vegetation particles threaten human life, transportation networks, built environments, ecosystems, and water resources. Susceptibility assessment of snow avalanche hazardous areas is of utmost importance for mitigation and development of land-use policies. This research evaluates the performance of the well-known machine learning methods, i.e., generalized additive model (GAM), multivariate adaptive regression spline (MARS), boosted regression trees (BRT), and support vector machine (SVM), in modeling the mass wasting hazard induced by snow avalanches. The key features are identified by the recursive feature elimination (RFE) method and used for the model calibration. The results indicated a good performance of the modeling process (Accuracy > 0.88, Kappa > 0.76, Precision > 0.84, Recall > 0.86, and AUC > 0.89), which the SVM model highlighted superior performance than others. Sensitivity analysis demonstrated that the topographic position index (TPI) and distance to stream (DTS) were the most important variables which had more contribution in producing the susceptibility maps.
This paper proposes a regionalization method for streamflow prediction in ungauged watersheds in the 7461 km
area above the Gharehsoo Hydrometry Station in the Ardabil Province, in the north of Iran. ...First, the Fuzzy c-means clustering method (FCM) was used to divide 46 gauged (19) and ungauged (27) watersheds into homogenous groups based on a variety of topographical and climatic factors. After identifying the homogenous watersheds, the Soil and Water Assessment Tool (SWAT) was calibrated and validated using data from the gauged watersheds in each group. The calibrated parameters were then tested in another gauged watershed that we considered as a pseudo ungauged watershed in each group. Values of R-Squared and Nash-Sutcliffe efficiency (NSE) were both ≥ 0.70 during the calibration and validation phases; and ≥ 0.80 and ≥ 0.74, respectively, during the testing in the pseudo ungauged watersheds. Based on these metrics, the validated regional models demonstrated a satisfactory result for predicting streamflow in the ungauged watersheds within each group. These models are important for managing stream quantity and quality in the intensive agriculture study area.
Urban flood-risk mapping is an important tool for the mitigation of flooding in view of continuing urbanization and climate change. However, many developing countries lack sufficiently detailed data ...to produce reliable risk maps with existing methods. Thus, improved methods are needed that can help managers and decision makers to combine existing data with more soft semi-subjective data, such as citizen observations of flood-prone and vulnerable areas in view of existing settlements. Thus, we present an innovative approach using the semi-subjective Analytic Hierarchy Process (AHP), which integrates both subjective and objective assessments, to help organize the problem framework. This approach involves measuring the consistency of decision makers’ judgments, generating pairwise comparisons for choosing a solution, and considering criteria and sub-criteria to evaluate possible options. An urban flood-risk map was created according to the vulnerabilities and hazards of different urban areas using classification and regression-tree models, and the map can serve both as a first stage in advancing flood-risk mitigation approaches and in allocating warning and forecasting systems. The findings show that machine-learning methods are efficient in urban flood zoning. Using the city Rasht in Iran, it is shown that distance to rivers, urban drainage density, and distance to vulnerable areas are the most significant parameters that influence flood hazards. Similarly, for urban flood vulnerability, population density, land use, dwelling quality, household income, distance to cultural heritage, and distance to medical centers and hospitals are the most important factors. The integrated technique for both objective and semi-subjective data as outlined in the present study shows credible results that can be obtained without complicated modeling and costly field surveys. The proposed method is especially helpful in areas with little data to describe and display flood hazards to managers and decision makers.
In recent years, there has been an increasing interest in spatial modeling, and flood hazard prediction is a major area of interest within the field of hydrology. It is necessary to consider return ...periods for identifying the flood hazard zones. In hydraulic modeling such as HEC-RAS, this is usually done, but in spatial modeling by machine learning (ML) models, this has not been taken into account so far. This study seeks to obtain data that will help to address this research gap. The Sentinel-1 Radar images have been used for identifying the flooded locations in different return periods. An embedded feature selection algorithm (i.e., recursive feature elimination random forest; RFE-RF) was used in the current research for key feature selection. Then, three ML models of neural networks using model averaging, classification and regression tree, and support vector machine were employed. The flood hazard prediction demonstrated a great performance for all the applied models (i.e., accuracy and precision > 90%, Kappa > 88%). Sensitivity analysis disclosed that the variables of elevation and distance from stream are in the first importance order, the variables of precipitation, slope, and land use are in the second importance order, and other variables are in the third importance order in all return periods. The modeling results indicated that among man-made land uses the irrigated area between 17.7 and 31.4%, dry farming from 0.5 to 2.4%, and residential areas between 8.3 and 25.1% are exposed to high and very high flood hazard areas. The current findings add to a growing body of literature on the spatial modeling of floods.
Increasing groundwater salinity has recently raised severe environmental and health concerns around the world. Advancement of the novel methods for spatial salinity modeling and prediction would be ...essential for effective management of the resources and planning mitigation policies. The current research presents the application of machine learning (ML) models in groundwater salinity mapping based on the dichotomous predictions. The groundwater salinity is predicted using the essential factors (i.e., identified by the simulated annealing feature selection methodology) through k-fold cross-validation methodology. Six ML models, namely, flexible discriminant analysis (FDA), mixture discriminant analysis (MAD), boosted regression tree (BRT), multivariate adaptive regression spline (MARS), random forest (RF), support vector machine (SVM), were employed to groundwater salinity mapping. The results of the modeling indicated that the SVM model had superior performance than other models. Variables of soil order, groundwater withdrawal, precipitation, land use, and elevation had the most contribute to groundwater salinity mapping. Results highlighted that the southern parts of the region and some parts in the north, northeast, and west have a high groundwater salinity, in which these areas are mostly matched with soil order of Entisols, bareland areas, and low elevations.