UNDERSTANDING THE GEOCHEMISTRY OF SOUTHERN CALIFORNIA PLUTONIC ROCKS USING AUTOMATED MACHINE LEARNING

Esteban, Oscar

Paper No. 7-10

Presentation Time: 12:10 PM

UNDERSTANDING THE GEOCHEMISTRY OF SOUTHERN CALIFORNIA PLUTONIC ROCKS USING AUTOMATED MACHINE LEARNING

ESTEBAN, Oscar¹, ALFEREZ, German¹, MARTINEZ ARDILA, Ana² and CLAUSEN, Benjamin L.³, (1)Institute of Data Science, Universidad de Montemorelos, Av Libertad 1300 Pte., Montemorelos, NL 67500, Mexico, (2)Dept of Earth and Biological Sciences, Loma Linda Univ, Loma Linda, CA 92350, (3)Dept of Earth and Biological Sciences, Loma Linda Univ, Geoscience Research Inst, Loma Linda, CA 92350

Southern California plutonic rocks found in the Transverse and Peninsular Ranges have been divided into as many as eight different groups separated by faults and distinguished by varying crustal thickness and mantle component sources. Baird et al. (1984) systematically collected about 500 samples of these rocks. Elemental and isotopic data from these samples have been displayed and analyzed using standard discrimination, bivariate, and ternary diagrams; however, displaying only two or three elements at a time does not effectively utilize the multivariate data available.

Over the past 70 years, machine learning has emerged as an option for analyzing multivariate data simultaneously and examining patterns in the geosciences. However, a tool-supported pipeline is lacking for geologists to apply machine learning from data preparation to model evaluation. Here we present a pipeline to guide geologists in applying automated machine learning (autoML) to multivariate data. It uses an open web application developed with Python. We apply the pipeline to the 500 southern California plutonic geochemistry samples using both supervised and unsupervised learning algorithms.

Supervised learning algorithms were used to classify the samples: Decision Tree, K-Nearest Neighbors, Logistic Regression, Support Vector Machines, and Multi-Layer Perceptron. The model generated with the Decision Tree algorithm offered the best average accuracy (87%), precision (89%) and recall (89%) results and identified the decisions made during the classification.

Two unsupervised learning approaches of PCA and K-Means were used. Up to five principal components were selected to explain 72% of the data variance. These components were input to the K-Means clustering algorithm to generate three clusters. Components and clusters may be related to: 1) mafic to felsic differentiation with small ionic radius compatible elements (MgO, Co, V, Mn, and HREE) positive and large ionic radius incompatible elements (K₂O, Rb, and LREE) negative; 2) pressure effects and magma source depth with Sr as positive and Y as negative and other REEs arranged between; and 3) water effects with immobile elements (Ta, Nb) positive and mobile alkali elements (Na, K, Rb, Cs) negative, and possibly also elements enhanced in hydrothermal deposits and radioactive elements.

Session No. 7

T22. Undergraduate Research I (Posters)

Wednesday, 12 May 2021: 11:25 AM-12:30 PM

Mackay School of Earth Sciences & Engineering Room (Online)

Geological Society of America Abstracts with Programs. Vol. 53, No. 4
doi: 10.1130/abs/2021CD-362505

© Copyright 2021 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T22. Undergraduate Research I (Posters)

<< Previous Abstract | Next Abstract >>

Cordilleran Section - 117th Annual Meeting - 2021

UNDERSTANDING THE GEOCHEMISTRY OF SOUTHERN CALIFORNIA PLUTONIC ROCKS USING AUTOMATED MACHINE LEARNING