Abstract: The phylogenetic approach in biodiversity science is an important and growing set of tools used for a wide range of questions and applications from conservation assessment to testing patterns in community assembly. Yet, with the dawn of the big data age, the size and scope of datasets in biology are becom ing ever larger thanks to revolutions in information technology, increased data sharing, ‘big data’ initiatives and online repositories. Although such sources promise biodiversity analyses at an unprecedentedly large scale, there is no clear solution on how to similarly upscale the phylogenetic approach. Here I propose, implement and use a solution that I term Mass Phylogeny Estima tion (MPE). MPE is an automated programmatic pipeline for the generation of phylogenies from lists of taxon names. It can estimate custom phylogenies for sets of species derived from big data sources. Here I test and demonstrate its usefulness by generating phylogenies for a pan-global dataset of multiple communities - from many different taxa – sampled at sites along an urban gra dient. I then perform phylogenetic analyses using the resultant phylogenetic trees and the ecological data to test for phylogenetically non-random responses to urbanisation. I find a non-significant decrease in phylogenetic diversity with increasing urbanisation, and examples of non-random phylogenetic assortment. Although, MPE is more effective than searching for published phylogenies, it still requires improving in order to produce phylogenies of sufficient size to provide good statistical power to detect non-random patterns.
|Years||2012 - 2013|
|Affiliations||Imperial College London (Silwood Campus)|