AUTOMATED LEAF ANALYSIS WITH DEEP LEARNING AND ITS POTENTIAL FOR THE FOSSIL RECORD

NGUYEN, Thao¹, EBERHARDT, Sven¹, WILF, Peter², WING, Scott L.³ and SERRE, Thomas¹, (1)Dept. of Cognitive, Linguistic and Psychological Sciences, Brown University, Providence, RI 02912, (2)Dept. of Geosciences, Pennsylvania State Univ, University Park, PA 16802, (3)Department of Paleobiology, Smithsonian Institution, P.O. Box 37012 MRC 121, Washington, DC 20013, Thomas_Serre@brown.edu

Leaves are the most conspicuous, abundant, and frequently fossilized plant organs, but until now, the vast quantity of evolutionary information encoded in their complex, variable shapes and venation patterns was largely inaccessible. Machine vision offers opportunities to analyze large numbers of specimens, to discover novel leaf features of angiosperm clades that may have phylogenetic significance, and to use those characters to classify unknowns. There is enormous potential for machine learning to guide the identification and evolutionary analysis of fossil leaves. Here, we leverage recent developments in the area of deep learning, an area of machine learning that is currently revolutionizing artificial intelligence. Deep learning aims to model high-level visual abstractions by training a deep neural network to classify images. The algorithm learns high-level visual representations by composing a hierarchy of simple but non-linear modules. Starting from pixel intensities, each transformation yields an increasingly abstract visual representation at each stage. In order to train and test the algorithm, we have assembled a large image collection of over 25,000 cleared and x-rayed leaves. Significantly, no manual preparation of the images is necessary. We report initial results with a deep learning network demonstrating accurate categorization of thousands of angiosperm leaf images into natural botanical groups (APG IV orders and families), far outperforming an earlier computer vision approach (Wilf et al. Computer vision cracks the leaf code, PNAS 2016). We further explore methods for feature visualization to gain deeper understanding of the wealth of novel botanical characters used by the network to learn to categorize leaves. Last, we report promising initial results towards the automated analysis of fossil leaves.

Session No. 218

T49. Geobiology of Earth-Life Systems II

Tuesday, 27 September 2016: 1:30 PM-5:30 PM

Room 503 (Colorado Convention Center)

Geological Society of America Abstracts with Programs. Vol. 48, No. 7
doi: 10.1130/abs/2016AM-284813

© Copyright 2016 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T49. Geobiology of Earth-Life Systems II

<< Previous Abstract | Next Abstract >>

GSA Annual Meeting in Denver, Colorado, USA - 2016

AUTOMATED LEAF ANALYSIS WITH DEEP LEARNING AND ITS POTENTIAL FOR THE FOSSIL RECORD