ESTIMATING THE NUMBER OF MORPHOLOGICAL RATE PARTITIONS IN A PHYLOGENETIC TREE

Wang, Steve

Paper No. 108-3

Presentation Time: 8:00 AM-5:30 PM

ESTIMATING THE NUMBER OF MORPHOLOGICAL RATE PARTITIONS IN A PHYLOGENETIC TREE

BEN, Jialun¹, POTTHOFF, Zachary K.¹, SHARMA POUDEL, Pradip¹, SINGH, Anhad², LLOYD, Graeme T.³ and WANG, Steve⁴, (1)Mathematics and Statistics, Swarthmore College, 500 College Ave, Swarthmore, PA 19081; Computer Science, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, (2)Mathematics and Statistics, Swarthmore College, 500 College Ave, Swarthmore, PA 19081; Economics, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, (3)Independent researcher, Amble, Northumberland NE65 0HN, United Kingdom, (4)Mathematics and Statistics, Swarthmore College, 500 College Ave, Swarthmore, PA 19081

In a phylogenetic tree, each branch can be thought of as having a true underlying rate of evolution. These rates are unknown, but they may be estimated from the observed number of character changes and the estimated duration of each branch. Given a tree with N_B branches, there may be as few as one rate of evolution (if all branches have the same rate) or as many as N_B rates (if each branch has a different rate).

Here, our goal is to estimate the number of distinct rates of evolution in the tree when the observed data are discrete counts of character changes. First, we describe an exhaustive search algorithm that examines each possible partition of k rates assigned to contiguous non-overlapping regions of the tree, where k ranges from 1 to N_B. For each possible partition, we calculate how well it fits the observed data using AIC. The partition with the smallest AIC provides an estimate for the number of distinct rates, their magnitudes, and their corresponding regions of the tree. This exhaustive search algorithm is guaranteed to find the best-fitting partition, but it is impractically slow for trees with large numbers of tips (i.e., approximately 20 or more).

We next describe two fast algorithms that can be applied to large trees: a forward (splitting) algorithm that starts by assuming that all branches have the same rate and then checks whether adding additional rates improves the fit, and a backwards (merging) algorithm that starts by assuming each branch has a different rate and checks whether merging contiguous branches improves the fit. We compare these fast algorithms with the exhaustive search algorithm and assess their performance on simulated datasets. Finally, we apply our methods to a dataset of lungfish fossils to better understand their evolutionary dynamics.

Session No. 108--Booth# 239

T77. Phylogenetic and Computational Approaches in Paleobiology and Paleoecology (Posters)

Monday, 16 October 2023: 8:00 AM-5:30 PM

Hall B (David L Lawrence Convention Center)

Geological Society of America Abstracts with Programs. Vol. 55, No. 6
doi: 10.1130/abs/2023AM-393286

© Copyright 2023 The Geological Society of America (GSA), all rights reserved. Permission is hereby granted to the author(s) of this abstract to reproduce and distribute it freely, for noncommercial purposes. Permission is hereby granted to any individual scientist to download a single copy of this electronic file and reproduce up to 20 paper copies for noncommercial purposes advancing science and education, including classroom use, providing all reproductions include the complete content shown here, including the author information. All other forms of reproduction and/or transmittal are prohibited without written permission from GSA Copyright Permissions.

Back to: T77. Phylogenetic and Computational Approaches in Paleobiology and Paleoecology (Posters)

<< Previous Abstract | Next Abstract >>

GSA Connects 2023 Meeting in Pittsburgh, Pennsylvania

ESTIMATING THE NUMBER OF MORPHOLOGICAL RATE PARTITIONS IN A PHYLOGENETIC TREE