Northeastern Section - 59th Annual Meeting - 2024

Paper No. 36-5
Presentation Time: 9:25 AM

MACHINE LEARNING ANALYSIS OF N-ALKANES FROM WOODY AND GRASSY AFRICAN PLANTS


TWEEDY, Ruth, Earth & Planetary Science, Harvard University, Cambridge, MA 02138, SHI, Sarah C., Lamont Doherty Earth Observatory, Palisades, NY 10964-8000 and UNO, Kevin, Human Evolutionary Biology, Harvard University, 11 Divinity Ave, Cambridge, MA 02138

Differentiating past woody and grassy ecosystems insights into the timing, climate influences, and biotic mechanisms for terrestrial ecosystem change. A commonly used proxy for this is the 𝛿¹³C of n-alkanes, which relies on the fact that in modern tropical ecosystems, “woody” vegetation uses C3 photosynthesis while “grassy” vegetation uses C4 photosynthesis. However, C4 grasses only gained an abundant ecological signature on the African continent 10 million years ago. Reliance on the 𝛿¹³C proxy therefore creates a knowledge gap around early-mid Neogene ecosystem structure, impeding understanding of the cause of C4 grassland expansion, and which ecosystem classes these C4 grasslands replaced.

n-Alkane chain length distributions have been suggested as an alternative indicator of woody and grassy African plants, with studies using ensemble mean distributions of these plant types as end members for continental scale ecosystem reconstructions. However, there is significant variation in the distributions of individual plant n-alkanes. This study employs supervised machine learning to classify n-alkane distributions of modern plants, assessing the reliability of this proxy for grassy and woody plant identification.

Using a dataset of n-alkane chain lengths from ~600 modern specimens, including over 220 newly generated distributions, we employ linear (e.g., K-Nearest Neighbors) and non-linear (e.g., Neural Network) classifiers. Both achieve >80% validation accuracy in distinguishing woody and grassy plants, suggesting the reliability of n-alkane chain length distributions as indicators of woody and grassy ecosystems. Future studies applying machine learning to modern calibration soils or lake sediments which integrate plant wax n-alkanes will establish a foundation for applying this tool to the geologic record. Our method complements established isotopic practices, offering a powerful tool for reconstructing vegetation structure in pure C3 ecosystems.