Repository logo
 

Search Results

Now showing 1 - 10 of 97
  • Optimal data partitioning, multispecies coalescent and Bayesian concordance analyses resolve early divergences of the grape family (Vitaceae)
    Publication . Lu, Limin; Cox, C. J.; Mathews, Sarah; Wang, Wei; Wen, Jun; Chen, Zhiduan
    Evolutionary rate heterogeneity and rapid radiations are common phenomena in organismal evolution and represent major challenges for reconstructing deep-level phylogenies. Here we detected substantial conflicts in and among data sets as well as uncertainty concerning relationships among lineages of Vitaceae from individual gene trees, supernetworks and tree certainty values. Congruent deep-level relationships of Vitaceae were retrieved by comprehensive comparisons of results from optimal partitioning analyses, multispecies coalescent approaches and the Bayesian concordance method. We found that partitioning schemes selected by PartitionFinder were preferred over those by gene or by codon position, and the unpartitioned model usually performed the worst. For a data set with conflicting signals, however, the unpartitioned model outperformed models that included more partitions, demonstrating some limitations to the effectiveness of concatenation for these data. For a transcriptome data set, fast coalescent methods (STAR and MP-EST) and a Bayesian concordance approach yielded congruent topologies with trees from the concatenated analyses and previous studies. Our results highlight that well-resolved gene trees are critical for the effectiveness of coalescent-based methods. Future efforts to improve the accuracy of phylogenomic analyses should emphasize the development of newmethods that can accommodate multiple biological processes and tolerate missing data while remaining computationally tractable. (C) The Willi Hennig Society 2017.
  • Organellar phylogenomics of an emerging model system: Sphagnum (peatmoss)
    Publication . Shaw, A. Jonathan; Devos, Nicolas; Liu, Yang; Cox, C. J.; Goffinet, B.; Flatberg, Kjell Ivar; Shaw, Blanka
    Background and Aims Sphagnum-dominated peatlands contain approx. 30 % of the terrestrial carbon pool in the form of partially decomposed plant material (peat), and, as a consequence, Sphagnum is currently a focus of studies on biogeochemistry and control of global climate. Sphagnum species differ in ecologically important traits that scale up to impact ecosystem function, and sequencing of the genome from selected Sphagnum species is currently under-way. As an emerging model system, these resources for Sphagnum will facilitate linking nucleotide variation to plant functional traits, and through those traits to ecosystem processes. A solid phylogenetic framework for Sphagnum is crucial to comparative analyses of species-specific traits, but relationships among major clades within Sphagnum have been recalcitrant to resolution because the genus underwent a rapid radiation. Herein a well-supported hypothesis for phylogenetic relationships among major clades within Sphagnum based on organellar genome sequences (plastid, mitochondrial) is provided.Methods We obtained nucleotide sequences (273 753 nucleotides in total) from the two organellar genomes from 38 species (including three outgroups). Phylogenetic analyses were conducted using a variety of methods applied to nucleotide and amino acid sequences. The Sphagnum phylogeny was rooted with sequences from the related Sphagnopsida genera, Eosphagnum and Flatbergium.Key Results Phylogenetic analyses of the data converge on the following subgeneric relationships: (Rigida (((Subsecunda) (Cuspidata)) ((Sphagnum) (Acutifolia))). All relationships were strongly supported. Species in the two major clades (i.e. Subsecunda + Cuspidata and Sphagnum + Acutifolia), which include > 90 % of all Sphagnum species, differ in ecological niches and these differences correlate with other functional traits that impact biogeochemical cycling. Mitochondrial intron presence/absence are variable among species and genera of the Sphagnopsida. Two new nomenclatural combinations are made, in the genera Eosphagnum and Flatbergium.Conclusions Newly resolved relationships now permit phylogenetic analyses of morphological, biochemical and ecological traits among Sphagnum species. The results clarify long-standing disagreements about subgeneric relationships and intrageneric classification.
  • Peatmoss (Sphagnum) diversification associated with Miocene Northern Hemisphere climatic cooling?
    Publication . Shaw, A. J.; Devos, N.; Cox, C. J.; Boles, S. B.; Shaw, B.; Buchanan, A. M.; Cave, L.; Seppelt, R.
    Global climate changes sometimes spark biological radiations that can feed back to effect significant ecological impacts. Northern Hemisphere peatlands dominated by living and dead peatmosses (Sphagnum) harbor almost 30% of the global soil carbon pool and have functioned as a net carbon sink throughout the Holocene, and probably since the late Tertiary. Before that time, northern latitudes were dominated by tropical and temperate plant groups and ecosystems.
  • Increased diversification rates follow shifts to bisexuality in liverworts
    Publication . Laenen, Benjamin; Machac, Antonin; Gradstein, S. Robbert; Shaw, Blanka; Patino, Jairo; Desamore, Aurelie; Goffinet, B.; Cox, C. J.; Shaw, A. Jonathan; Vanderpoorten, Alain
    Shifts in sexual systems are one of the key drivers of species diversification. In contrast to angiosperms, unisexuality prevails in bryophytes. Here, we test the hypotheses that bisexuality evolved from an ancestral unisexual condition and is a key innovation in liverworts. We investigate whether shifts in sexual systems influence diversification using hidden state speciation and extinction analysis (HiSSE). This new method compares the effects of the variable of interest to the best-fitting latent variable, yielding robust and conservative tests. We find that the transitions in sexual systems are significantly biased toward unisexuality, even though bisexuality is coupled with increased diversification. Sexual systems are strongly conserved deep within the liverwort tree but become much more labile toward the present. Bisexuality appears to be a key innovation in liverworts. Its effects on diversification are presumably mediated by the interplay of high fertilization rates, massive spore production and long-distance dispersal, which may separately or together have facilitated liverwort speciation, suppressed their extinction, or both. Importantly, shifts in liverwort sexual systems have the opposite effect when compared to angiosperms, leading to contrasting diversification patterns between the two groups. The high prevalence of unisexuality among liverworts suggests, however, a strong selection for sexual dimorphism.
  • A new species of Leskeodon (Daltoniaceae) from Ecuador
    Publication . Buck, W. R.; Shaw, A. J.; Cox, C. J.
    Leskeodon caducifolius is described from recent collections made at a single site in a cloud forest in southern Ecuador. The species is distinctive in its small, caducous leaves and elongate, porose exothecial cells.
  • Data-specific substitution models improve protein-based phylogenetics
    Publication . Brazão, João; Foster, Peter G.; J. Cox, Cymon
    Calculating amino-acid substitution models that are specific for individual protein data sets is often difficult due to the computational burden of estimating large numbers of rate parameters. In this study, we tested the computational efficiency and accuracy of five methods used to estimate substitution models, namely Codeml, FastMG, IQ-TREE, P4 (maximum likelihood), and P4 (Bayesian inference). Data-specific substitution models were estimated from simulated alignments (with different lengths) that were generated from a known simulation model and simulation tree. Each of the resulting data-specific substitution models was used to calculate the maximum likelihood score of the simulation tree and simulated data that was used to calculate the model, and compared with the maximum likelihood scores of the known simulation model and simulation tree on the same simulated data. Additionally, the commonly-used empirical models, cpREV and WAG, were assessed similarly. Data-specific models performed better than the empirical models, which under-fitted the simulated alignments, had the highest difference to the simulation model maximum-likelihood score, clustered further from the simulation model in principal component analysis ordination, and inferred less accurate trees. Data-specific models and the simulation model shared statistically indistinguishable maximum-likelihood scores, indicating that the five methods were reasonably accurate at estimating substitution models by this measure. Nevertheless, tree statistics showed differences between optimal maximum likelihood trees. Unlike other model estimating methods, trees inferred using data-specific models generated with IQ-TREE and P4 (maximum likelihood) were not significantly different from the trees derived from the simulation model in each analysis, indicating that these two methods alone were the most accurate at estimating data-specific models. To show the benefits of using data-specific protein models several published data sets were reanalysed using IQ-TREE-estimated models. These newly estimated models were a better fit to the data than the empirical models that were used by the original authors, often inferred longer trees, and resulted in different tree topologies in more than half of the re-analysed data sets. The results of this study show that software availability and high computation burden are not limitations to generating better-fitting data-specific amino-acid substitution models for phylogenetic analyses.
  • Prokaryotic diversity in stream sediments affected by acid mine drainage
    Publication . Carlier, Jorge; Ettamimi, Sara; Cox, Cymon J.; Hammani, Khalil; Ghazal, Hassan; Costa, Maria Clara
    The microbial communities in mining impacted areas rely on a variety of mechanisms to survive in such extreme environments. In this work, a meta-taxonomic approach using 16S rRNA gene sequences was used to investigate the prokaryotic diversity of sediment samples from water bodies affected by acid mine drainage at the Sao Domingos mining area in the south of Portugal. Samples were collected in summer and winter from the most contaminated sites from where the water flows downstream to the freshwater of Chanca's river reservoir. The prokaryotic diversity on water bodies' sediments allowed us to distinguish the highly contaminated sites (pH approximate to 2) from sites with intermediate levels of contamination (pH approximate to 3-6.5), and from sites without contamination (pH approximate to 7.5). The abundances of acidophiles of generaAcidiphilium, Acidibacter, Acidobacterium and Acidocellain the sediments were correlated with the level of acid mine drainage contamination. The two first genera were among the 30 most abundant prokaryotes in all contaminated samples, including one (SS2w), where the contamination was very diluted, thereby emphasizing the impact that such type of pollution can have in the microbial communities of sediments. In addition, the high abundances of archaeal taxa from classThermoplasmataand of bacteria from family RCP1-48 in the sediments from the most contaminated site corroborate their importance in such ecosystems and a putative role in the generation of acid mine drainage.
  • Mitochondrial genes from 18 angiosperms fill sampling gaps for phylogenomic inferences of the early diversification of flowering plants
    Publication . Xue, Jia-Yu; Dong, Shan-Shan; Wang, Ming-Qiang; Song, Tian-Qiang; Zhou, Guang-Can; Li, Zhen; Van de Peer, Yves; Shao, Zhu-Qing; Wang, Wei; Chen, Min; Zhang, Yan-Mei; Sun, Xiao-Qin; Chen, Hong-Feng; Zhang, Yong-Xia; Zhang, Shou-Zhou; Chen, Fei; Zhang, Liang-Sheng; Cox, Cymon; Liu, Yang; Wang, Qiang; Hang, Yue-Yu
    The early diversification of angiosperms is thought to have been a rapid process, which may complicate phylogenetic analyses of early angiosperm relationships. Plastid and nuclear phylogenomic studies have raised several conflicting hypotheses regarding overall angiosperm phylogeny, but mitochondrial genomes have been largely ignored as a relevant source of information. Here we sequenced mitochondrial genomes from 18 angiosperms to fill taxon-sampling gaps in Austrobaileyales, magnoliids, Chloranthales, Ceratophyllales, and major lineages of eudicots and monocots. We assembled a data matrix of 38 mitochondrial genes from 107 taxa to assess how well mitochondrial genomic data address current uncertainties in angiosperm relationships. Although we recovered conflicting phylogenies based on different data sets and analytical methods, we also observed congruence regarding deep relationships of several major angiosperm lineages: Chloranthales were always inferred to be the sister group of Ceratophyllales, Austrobaileyales to mesangiosperms, and the unplaced Dilleniales was consistently resolved as the sister to superasterids. Substitutional saturation, GC compositional heterogeneity, and codon-usage bias are possible reasons for the noise/conflict that may impact phylogenetic reconstruction; and angiosperm mitochondrial genes may not be substantially affected by these factors. The third codon positions of the mitochondrial genes appear to contain more parsimony-informative sites than the first and second codon positions, and therefore produced better resolved phylogenetic relationships with generally strong support. The relationships among these major lineages remain incompletely resolved, perhaps as a result of the rapidity of early radiations. Nevertheless, data from mitochondrial genomes provide additional evidence and alternative hypotheses for exploring the early evolution and diversification of the angiosperms.
  • The archaebacterial origin of eukaryotes
    Publication . Cox, C. J.; Foster, P. G.; Hirt, R. P.; Harris, S. R.; Embley, T. M.
    The origin of the eukaryotic genetic apparatus is thought to be central to understanding the evolution of the eukaryotic cell. Disagreement about the source of the relevant genes has spawned competing hypotheses for the origins of the eukaryote nuclear lineage. The iconic rooted 3-domains tree of life shows eukaryotes and archaebacteria as separate groups that share a common ancestor to the exclusion of eubacteria. By contrast, the eocyte hypothesis has eukaryotes originating within the archaebacteria and sharing a common ancestor with a particular group called the Crenarchaeota or eocytes. Here, we have investigated the relative support for each hypothesis from analysis of 53 genes spanning the 3 domains, including essential components of the eukaryotic nucleic acid replication, transcription, and translation apparatus. As an important component of our analysis, we investigated the fit between model and data with respect to composition. Compositional heterogeneity is a pervasive problem for reconstruction of ancient relationships, which, if ignored, can produce an incorrect tree with strong support. To mitigate its effects, we used phylogenetic models that allow for changing nucleotide or amino acid compositions over the tree and data. Our analyses favor a topology that supports the eocyte hypothesis rather than archaebacterial monophyly and the 3-domains tree of life.
  • The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods
    Publication . Foster, P. G.; Cox, C. J.; Martin Embley, T.
    The three-domains tree, which depicts eukaryotes and archaebacteria as monophyletic sister groups, is the dominant model for early eukaryotic evolution. By contrast, the ‘eocyte hypothesis’, where eukaryotes are proposed to have originated from within the archaebacteria as sister to the Crenarchaeota (also called the eocytes), has been largely neglected in the literature. We have investigated support for these two competing hypotheses from molecular sequence data using methods that attempt to accommodate the across-site compositional heterogeneity and across-tree compositional and rate matrix heterogeneity that are manifest features of these data. When ribosomal RNA genes were analysed using standard methods that do not adequately model these kinds of heterogeneity, the three-domains tree was supported. However, this support was eroded or lost when composition-heterogeneous models were used, with concomitant increase in support for the eocyte tree for eukaryotic origins. Analysis of combined amino acid sequences from 41 protein-coding genes supported the eocyte tree, whether or not composition-heterogeneous models were used. The possible effects of substitutional saturation of our data were examined using simulation; these results suggested that saturation is delayed by among-site rate variation in the sequences, and that phylogenetic signal for ancient relationships is plausibly present in these data.