ADAPTING SUPERVISED MACHINE LEARNING APPROACHES FOR HYDROTHERMAL RESOURCE ASSESSMENTS

Mordensky, Stanley

Paper No. 187-5

Presentation Time: 2:35 PM

ADAPTING SUPERVISED MACHINE LEARNING APPROACHES FOR HYDROTHERMAL RESOURCE ASSESSMENTS

MORDENSKY, Stanley¹, BURNS, Erick¹, LIPOR, John J.², DEANGELO, Jacob³ and CARACCIOLI, Pascal⁴, (1)U.S. Geological Survey, Geology, Minerals, Energy, and Geophysics Science Center, Portland, OR 97201, (2)Electrical & Computer Engineering, Portland State University, Portland, OR 97201, (3)U.S. Geological Survey, Geology, Minerals, Energy, and Geophysics Science Center, Moffett Field, CA 94025, (4)U.S. Geological Survey / Portland State University, Portland, OR 97201

The inherent mismatch of the data requirements of machine learning (intrinsic to the mathematical strategies employed) and the inherent qualities of natural resource data (e.g., sample bias, low number of samples, high correlation of input data) are challenges during the application of supervised machine learning for the development of natural resource assessments. Herein, we demonstrate how to address problems presented by: positive-unlabeled classifications (i.e., knowing only where some hydrothermal systems [positives] are located, and no locations authoritatively classified as having no hydrothermal convection) by recognizing that most locations are negatives and that a statistically small number of true but unlabeled positives will be mislabeled as negative during the analyses; class imbalance (i.e., that hydrothermal systems [positives] are inherently sparse compared with the total area that contains no hydrothermal systems [negatives]) by training and testing using the expected natural ratio of the classes; having few labeled examples (i.e., that there are only dozens or hundreds of known hydrothermal systems for a region) by selecting appropriate supervised machine learning algorithms and using informative features; and the influence of correlated but not causative features (e.g., regional trends in elevation when positives are common in only a few regions) by ensuring that the sampling of negative sites appropriately samples the range of feature values. We find that, after properly addressing the intrinsic characteristics of machine learning strategies, data-driven approaches can account for the unique qualities of natural resource data to improve upon hydrothermal resource assessments by reducing potential bias from expert decisions.

Session No. 187

T3. The Changing Landscape of Energy Geology I

Tuesday, 17 October 2023: 1:30 PM-5:30 PM

320 (David L Lawrence Convention Center)

Geological Society of America Abstracts with Programs. Vol. 55, No. 6
doi: 10.1130/abs/2023AM-391076

© Copyright 2023 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T3. The Changing Landscape of Energy Geology I

<< Previous Abstract | Next Abstract >>

GSA Connects 2023 Meeting in Pittsburgh, Pennsylvania

ADAPTING SUPERVISED MACHINE LEARNING APPROACHES FOR HYDROTHERMAL RESOURCE ASSESSMENTS