THE DISCRIMINATION OF MORPHOLOGICALLY SIMILAR POLLEN USING MACHINE LEARNING METHODS
White spruce (Picea glauca) occupies predominately drier upland soils while black spruce (Picea mariana) populates poorly drained lowlands. Changes in their relative abundance through the Quaternary reflect changing environmental conditions in North American forests. Given their paleoclimatic importance, there is an extensive literature on Picea morphology, but despite two decades of analysis, inconsistency in identification remained a persistent problem. However, our results using instance-based learning (n-nearest neighbor) algorithms demonstrate that consistent discrimination of black and white spruce is possible, with >85% accuracy in the identification of black and white spruce species, and ~100% accuracy in the discrimination of spruce from other saccate genera (Pinus, Abies). The algorithms used are robust to some folding, tearing, and compression of the grains, suggesting that these methods would be applicable to the identification of other palynomorphs in both Quaternary and pre-Quaternary studies. By identifying and developing shape and texture measures sensitive to more subtle differences in morphology, our research complements other research programs seeking to fully automate palynological analysis, while providing identification tools that should be of immediate utility to many palynologists.