INTRODUCING EXPLORATORY DATA ANALYSIS TECHNIQUES IN INTRODUCTORY PALEONTOLOGY: DIVERSITY INDICES, SIMILARITY COEFFICIENTS, AND CLUSTER ANALYSIS
The data set originates from the Lower Pennsylvanian (Langsettian) Mary Lee Coal zone in the Black Warrior Basin, Alabama, and provides a temporally and spatially constrained perspective on a Pennsylvanian peat-mire ecosystem at a community scale. Rapid coseismic subsidence resulted in estuarine (tidalite) deposition that buried the Blue Creek forest in growth position, preserving an autochthonous leaf litter associated and erect, siltstone-cast trunks. Strip mining exposed this horizon that was traced and assessed over 0.5 km2 area, from which 47 form taxa were recognized from seventeen 10 m2 quadrats.
Principles behind the Shannon-Weiner, Simpsons, and Marglef diversity measures are introduced, and students are asked to calculate each indice. Subsequently, they compare the metrics against the original data to determine which factors control the calculated results. The Sorenson’s, Jaccard, and Simple Matching binary (presence/absence) similarity coefficients are introduced, and calculations made to understand which assemblages are most similar. Subsequently, these are used in agglomerative hierarchical methods in which R-mode clustering results in an hierarchical arrangement such that the relationships between the different groups are apparent and dendrograms constructed. These relationships then are spatially plotted providing an understanding of the mosaic pattern in the assemblage.