PREDICTING MODERN SEDIMENT COMPOSITION – MACHINE LEARNING APPLIED TO A GLOBAL PETROGRAPHIC DATABASE

Johnson, Isaac

Paper No. 16-3

Presentation Time: 8:45 AM

PREDICTING MODERN SEDIMENT COMPOSITION – MACHINE LEARNING APPLIED TO A GLOBAL PETROGRAPHIC DATABASE

JOHNSON, Isaac¹, SHARMAN, Glenn R.¹, SZYMANSKI, Eugene² and HUANG, Xiao¹, (1)Geosciences, University of Arkansas, 340 N. Campus Dr., 216 Gearhart Hall, Fayetteville, AR 72701, (2)Utah Geological Survey, 1594 West North Temple, Suite 3110, Salt Lake City, UT 84116

Sandstone petrography has long been used as a tool to infer sedimentary provenance. In the latter half of the 20^th century, sandstone petrographers including William R. Dickinson and Paul E. Potter made significant advancements in relating the relative abundance of framework grains in sedimentary deposits to tectonic setting. Grain proportions may also reveal details about the boundary conditions of the systems in which sediments are formed, including source terrane lithology, climate, and transport distance. This research seeks to answer the following questions: (1) can the final modal composition of sand be predicted if boundary conditions are known; and (2) can sand modal composition be used to determine the relative control of the environmental factors that generate sediments? We investigate these questions by analyzing Pleistocene-to-modern aged samples where provenance and boundary conditions are known with certainty, using existing data from published studies and new data from marine sand samples across the globe. Moreover, this research will reassess the usefulness of point count data to predict sedimentary provenance by employing data analysis libraries of the Python programming language in order to better understand how Earth-surface processes are manifested in the global sedimentary archive.

To date, we have compiled point count data from 3,026 sand samples and 48 published sources and, of these data, we used a subset of 1,554 fluvial samples to train a Random Forest Regressor from Python’s scikit-learn library. Numerical data were collected for each fluvial sample’s catchment, including precipitation, temperature, relief, slope, area, erosion rate and source rock proportions; these data comprise the Random Forest independent variables. Preliminary results reveal a positive correlation between predicted composition and the test dataset with a R² score of 0.719. Permutation feature importance was calculated for each independent variable revealing average basin slope is the most important estimator at a mean importance of 55.0%, with basin area and average basin temperature following at 26.3% and 17.9%, respectively. Future research will incorporate data from marine, lacustrine and littoral depositional environments to improve the predictive capabilities of the Random Forest.

Session No. 16

T9. Unravelling Sedimentary Basins II: A Session in Memory of Paul E. Potter

Tuesday, 20 April 2021: 8:00 AM-12:00 PM

UALR Earth Science Room (Online)

Geological Society of America Abstracts with Programs. Vol. 53, No. 3
doi: 10.1130/abs/2021NC-362558

© Copyright 2021 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T9. Unravelling Sedimentary Basins II: A Session in Memory of Paul E. Potter

<< Previous Abstract | Next Abstract >>

Joint 55th Annual North-Central / 55th Annual South-Central Section Meeting - 2021

PREDICTING MODERN SEDIMENT COMPOSITION – MACHINE LEARNING APPLIED TO A GLOBAL PETROGRAPHIC DATABASE