EXTENDING THE REACH AND RESOLUTION OF THE PALEOBIOLOGY DATABASE WITH COMPUTATIONAL AND DATA INFRASTRUCTURES

PETERS, Shanan E.¹, SYVERSON, V.J.P.¹, ZAFFOS, Andrew¹, HUSSON, Jon¹, ROSS, Ian² and CZAPLEWSKI, John¹, (1)Department of Geoscience, University of Wisconsin-Madison, 1215 W Dayton St, Madison, WI 53706, (2)Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, peters@geology.wisc.edu

Many next-generation questions in paleobiology require integration of paleontological, biological, and geological data. Although some projects involve data generation in all of these areas, it is common to rely on data and information extracted from publications. Here we describe advances in the Paleobiology Database (PBDB) that emerge via integration with the GeoDeepDive (GDD) digital library and the Macrostrat geologic database.

Fossil occurrences within the PBDB derive from field-based observations and fossil specimens, some of which were collected, curated in museum collections, and described in publications. However, not all specimen numbers and specimen-specific data (e.g., size, morphology) are included in the PBDB. We used custom scripts and GDD to automatically locate and extract more than 133K unique numbered specimens from more than 13k papers for an initial set of 71 institutional collections. Specimen-based descriptors of morphology, taphonomy, geology, and more were then linked to these specimens and to PBDB occurrences. The publication footprint of museum collections was also computed. We also used GDD to extract potential PBDB fossil occurrences from the literature. Lithostratigraphic units described as fossil-bearing in the literature, but that are not included in the PBDB, are randomly distributed with respect to age among named rock units.

The utility of fossils often depends on the precision and accuracy of age constraints. However, fossil occurrence ages in the PBDB, and within much of the literature and museum collections, remain decoupled from data that constrain geochronology. A continuous-time age model for all sediments in Macrostrat has been generated using basic principles. GDD and cyberinfrastructure exposing results from geochronological lab facilities is being used to improve this age model. Repositioning museum fossil specimens and PBDB occurrences within a continuous-time stratigraphic age model improves the effective temporal resolution of the PBDB by up to an order of magnitude, enables time bin-free quantitative analysis of fossil data, and provides a mechanism for continual refinement of fossil ages as new geochronological measurements are made.

Session No. 99

T49. Advances in Computational Paleobiology: Reshaping Our Understanding of the Fossil Record

Monday, 23 October 2017: 8:00 AM-12:00 PM

Room 608 (Washington State Convention Center)

Geological Society of America Abstracts with Programs. Vol. 49, No. 6
doi: 10.1130/abs/2017AM-300954

© Copyright 2017 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T49. Advances in Computational Paleobiology: Reshaping Our Understanding of the Fossil Record

<< Previous Abstract | Next Abstract >>

GSA Annual Meeting in Seattle, Washington, USA - 2017

EXTENDING THE REACH AND RESOLUTION OF THE PALEOBIOLOGY DATABASE WITH COMPUTATIONAL AND DATA INFRASTRUCTURES