GSA Connects 2024 Meeting in Anaheim, California

Paper No. 209-4
Presentation Time: 2:25 PM

PREDICTING SOIL SATURATED HYDRAULIC CONDUCTIVITY USING MACHINE LEARNING-BASED PEDOTRANSFER FUNCTIONS


MAHMUD, Md Ilias1, HOLT, Robert M.2, WODAJO, Leti T.3, HICKEY, Craig J.3 and O'REILLY, Andrew4, (1)Department of Geology and Geological Engineering, University of Mississippi, University, MS 38677; National Center for Physical Acoustics, University of Mississippi, University, MS 38677, (2)Department of Geology and Geological Engineering, University of Mississippi, University, MS 38677, (3)National Center for Physical Acoustics, University of Mississippi, Oxford, MS 38677, (4)U.S. Department of Agriculture, Agricultural Research Service, National Sedimentation Laboratory, Oxford, MS 38655

The saturated soil hydraulic conductivity (Ks) is a crucial parameter in various hydrological and climate models, affecting rainfall partitioning between runoff and infiltration. Thus, Ks is essential in many critical applications, such as optimal irrigation design, groundwater recharge estimation, and predicting natural hazards like floods and landslides. Determination of Ks from direct methods is expensive, time-consuming, and often infeasible for large-scale applications. As an alternative, pedotransfer functions (PTFs) are widely used indirect models to estimate Ks from easily retrievable soil parameters. Despite significant efforts, the performance of conventional PTFs in predicting Ks remains poor. Machine learning-based pedotransfer functions (ML-PTFs) have been found to overcome this drawback and outperform conventional PTFs in predicting Ks. The increasing accessibility of large soil databases, coupled with advances in machine learning algorithms, offers new opportunities to enhance the robustness of ML-PTFs. This study employed machine learning algorithms, such as artificial neural networks, Boosted Regression Tree (BRT), and random forests, based on over 20,000 soil samples to develop new PTFs for estimating Ks. We also evaluated the relative importance of predictor variables (soil parameters) used in developing the ML-PTFs. The PTFs implemented with decision tree-based ensemble models outperformed other ML-based models, with BRT yielding the best performance, achieving the highest R2 of 0.873 and the lowest root mean square error of 0.336 (log10(cm/h)). Clay content was identified as the most important predictor, followed by nearly equal contributions from bulk density, sand content, and organic carbon content, with silt content being the least significant contributor. The ML-PTFs developed here can be applied to the recently available high-resolution global-scale digital maps of soil properties to generate spatial distribution maps of Ks from regional to global scales.

[This work was supported by the U.S. Department of Agriculture under Non-Assistance Cooperative Agreement 58-6060-6-009. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the views of the U. S. Department of Agriculture.]