Tuesday, November 06, 2012

Estimating Divergence Times in Large Molecular Phylogenies

Estimating divergence times in large molecular phylogenies

Authors:


1. Koichiro Tamura (a)
2. Fabia Ursula Battistuzzi (b,c)
3. Paul Billing-Ross (b)
4. Oscar Murillo (b)
5. Alan Filipski (b)
6. Sudhir Kumar (b,d,*)

Author Affiliations:

a. Department of Biological Sciences, Tokyo Metropolitan University, Tokyo 192-0397, Japan

b. Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State University, Tempe, AZ 85287-5301

c. Department of Biological Sciences, Oakland University, Rochester, MI 48309

d. School of Life Sciences, Arizona State University, Tempe, AZ 85287-4501

*. To whom correspondence should be addressed. E-mail: s.kumar@asu.edu.

Abstract:

Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.

No comments: