2015 GSA Annual Meeting in Baltimore, Maryland, USA (1-4 November 2015)

Paper No. 147-12
Presentation Time: 4:30 PM

COMMUNITY-SUPPORTED DATA REPOSITORIES IN PALEOECOLOGY AND PALEOCLIMATOLOGY:  THE ‘MIDDLE TAIL’ BETWEEN GEOSCIENTIFIC USERS AND GEOINFORMATICS


WILLIAMS, John W., Department of Geography, University of Wisconsin-Madison, 550 N Park St, Madison, WI 53706, UHEN, Mark, Department of Atmospheric, Oceanic and Earth Sciences, George Mason University, MS 6E2, Fairfax, VA 22030, GORING, Simon, Department of Geography, University of Wisconsin-Madison, 550 N Park St., Madison, WI 53706, ASHWORTH, Allan C., Department of Geosciences, North Dakota State University, Fargo, ND 58108, BILLS, Brian, Center for Environmental Informatics, The Pennsylvania State University, Earth and Environmental Systems Institute, 2217 Earth-Engineering Sciences Building, University Park, PA 16802-6813, BLOIS, Jessica, School of Natural Sciences, University of California - Merced, 1200 Castle Commerce Building, #47, Merced, CA 95343, CHARLES, Don, Philadelphia, PA 19103, GRAHAM, Russell W., Geosciences, The Pennsylvania State University, 116 Deike, University Park, PA 16802, SMITH, Alison J., Geology, Kent State University, Kent, OH 44242 and GRIMM, Eric, Illinois State Museum, Springfield, IL 62703, jww@geography.wisc.edu

Data access has become the critical bottleneck to advancing scientific understanding. In the paleoclimatic and paleobiological sciences (and the geosciences in general), there is a fundamental scale gap between the broad-scale questions that motivate research and the scale of data collection, carried out by individuals or small teams.

Community-supported data repositories (CSDRs) have emerged in response to this need. Many paleobiological CSDRs were begun decades ago by individual researchers or small teams (e.g. Jack Sepkoski, CLIMAP, COHMAP) to address particular questions (e.g. the history of biodiversity on Earth, reconstructing past climatic changes) and have matured into multi-user and multi-purpose data repositories governed by teams of geoscientists and informaticists (e.g. Neotoma Paleoecology Database, the Paleobiology Database). CSDRs facilitate large-scale research by providing open-access and curated data that employ community-supported metadata and data standards. CSDRs also serve as a ‘middle tail’ or boundary organization between information scientists and individual geoscientists, passing use cases and research priorities in one direction, best practices and common protocols in the other.

Because paleoecological expertise is highly decentralized and distributed across proxy types, taxonomic groups, time periods, and regions, an array of paleobiological and paleoclimatic CSDRs has arisen, e.g. the Neotoma Paleoecology Database, Paleobiology Database, International Tree Ring Database, NOAA NCEI for Paleoclimatology, Morphobank, iDigPaleo, and Integrated Earth Data Alliance. Recently, these groups have organized into a Paleobiology Data Consortium dedicated to improving interoperability and sharing best practices and protocols.

The Neotoma Paleoecology Database offers one model of a CSDR, designed to facilitate research into ecological and evolutionary dynamics during recent past global change. Neotoma data can be searched, viewed, and returned to users through multiple interactive and programmatic interfaces, designed to span a range of user preferences. Neotoma is governed by geoscientists via multiple virtual constituent data working groups and provides community engagement through training workshops for data contributors, stewards, and users.