Publicaciones
Phylogenetic analysis of ferlin genes reveals ancient eukaryotic origins
Nuclear-mitochondrial sequences as witnesses of past interbreeding and population diversity in the jumping bristletail Mesomachilis
Nuclear-mitochondrial sequences (NUMTs) can be especially powerful tools for evolutionary studies by providing a window into the nature of ancestral mitochondrial (mt) lineages. Here we illustrate this property of NUMTs through a survey of such sequences in two species of the jumping bristletail genus Mesomachilis. An ~1800 basepair fragment encompassing three mt genes – COI, tRNA Leu and COII – was cloned twice for a sample of specimens and colonies were screened. NUMTs in this genus are abundant and genetically diverse (up to 23% different from the associated mt sequence). Numerous independent nuclear integration events are scattered at different time depths, suggesting a continuous process of NUMT generation. By combining phylogenetic reconstruction with analyses of the pattern of nucleotide substitution on mt and NUMT branches, we inferred the dynamics of NUMT origin and evolution. The majority of NUMTs in the dataset have apparently evolved from source mt lineages that were substantially different from the currently associated mt sequence, indicating NUMT transfer via matings among divergent lineages. The original source mt lineages either were unsampled or are extinct. Translocations of NUMTs to new nuclear genomes can preserve NUMTs following extinction of their ancestral mt lineages, and can be used to detect mixing between divergent mt lineages and past levels of mt diversity.
Interference with histidyl-tRNA synthetase by a CRISPR spacer sequence as a factor in the evolution of Pelobacter carbinolicus
Intragenomic conflict in populations infected by Parthenogenesis Inducing Wolbachia ends with irreversible loss of sexual reproduction.
History of myxozoan character evolution on the basis of rDNA and EF-2 data
Phosphorylated and non-phosphorylated serine and threonine residues evolve at different rates in mammals
Protein phosphorylation plays an important role in the regulation of protein function. Phosphorylated residues are generally assumed to be subject to functional constraint, but it has recently been suggested from a comparison of distantly related vertebrate species that most phosphorylated residues evolve at the rates consistent with the surrounding regions. To resolve the controversy, we infer the ancestral phosphoproteome of human and mouse to compare the evolutionary rates of phosphorylated and non-phosphorylated serine (S), threonine (T) and tyrosine (Y) residues. This approach enables accurate estimation of evolutionary rates as it does not assume deep conservation of phosphorylated residues. We show that phosphorylated S/T residues tend to evolve more slowly than non-phosphorylated S/T residues not only in disordered but also in ordered protein regions, indicating evolutionary conservation of phosphorylated S/T residues in mammals. Thus, phosphorylated S/T residues tend to be subject to stronger functional constraint than non-phosophorylated residues regardless of the protein regions in which they reside. In contrast, phosphorylated Y residues evolve at similar rates as non-phosphorylated ones. We also find that the human lineage has gained more phosphorylated T residues and lost fewer phosphorylated Y residues than the mouse lineage. The cause of the gain/loss imbalance remains a mystery but should be worth exploring.
Length polymorphism and head shape association among genes with polyglutamine repeats in the stalk-eyed fly, Teleopsis dalmanni
Evolutionary History and Functional Diversification of Phosphomannomutase Genes
Abstract Phosphomannomutases (PMMs) catalyze the interconversion of mannose-6-phosphate to mannose-1-phosphate. In humans, two PMM enzymes exist—PMM1 and PMM2; yet, they have different functional specificities. PMM2 presents PMM activity, and its deficiency causes a Congenital Disorder of Glycosylation (PMM2-CDG). On the other hand, PMM1 can also act as glucose-1,6-bisphosphatase in the brain after stimulation with inosine monophosphate and thus far has not been implicated in any human disease. This study aims to refine the evolutionary time frame at which gene duplication gave rise to PMM1 and PMM2, and to identify the most likely amino acid positions underlying the proteins’ different functions. The phylogenetic analysis using available protein sequences, allowed us to establish that duplication occurred early in vertebrate evolution. In order to understand the molecular basis underlying the functional divergence, conserved and most likely functional divergence-related sites were identified, through the analysis of site-specific evolutionary rates. This analysis indicates that most of the sites known to be important in the homodimer formation and in the catalytic activity are conserved in both proteins. Among those potentially related to functional divergence, two positions (183 and 186 in human PMM1) emerge as the most interesting ones. The residues at these positions have different side-chain conformations in the protein structure in the unbound and bound states, and are highly but differently conserved in PMM1 and in PMM2 proteins. Altogether, these results provide new data into the evolutionary history of PMM1 and PMM2 duplicates and highlight the most probable sites that evolved to distinct functional specificities.
- Content Type Journal Article
- DOI 10.1007/s00239-010-9368-5
- Authors
- Rita Quental, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
- Ana Moleirinho, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
- Luísa Azevedo, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
- António Amorim, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
- Journal Journal of Molecular Evolution
- Online ISSN 1432-1432
- Print ISSN 0022-2844
A Bayesian Approach for Fast and Accurate Gene Tree Reconstruction
Recent sequencing and computing advances have enabled phylogenetic analyses to expand to both entire genomes and large clades, thus requiring more efficient and accurate methods designed specifically for the phylogenomic context. Here we present SPIMAP, an efficient Bayesian method for reconstructing gene trees in the presence of a known species tree. We observe many improvements in reconstruction accuracy, achieved by modeling multiple aspects of evolution, including gene duplication and loss rates, speciation times, and correlated substitution rate variation across both species and loci. We have implemented and applied this method on two clades of fully-sequenced species, 12 Drosophila and 16 fungal genomes as well as simulated phylogenies, and find dramatic improvements in reconstruction accuracy as compared to the most popular existing methods, including those that take the species tree into account. We find that reconstruction inaccuracies of traditional phylogenetic methods overestimate the number of duplication and loss events by as much as 2 to 3 fold, while our method achieves significantly higher accuracy. We feel the results and methods presented here will have many important implications for future investigations of gene evolution.
Expansion of hexose transporter genes was associated with the evolution of aerobic fermentation in yeasts
The genetic basis of organisms’ adaptation to different environments is a central issue of molecular evolution. The budding yeast Saccharomyces cerevisiae and its relatives predominantly ferment glucose into ethanol even in the presence of oxygen. This was suggested to be an adaptation to glucose-rich habitats, but the underlying genetic basis of the evolution of aerobic fermentation remains unclear. In S. cerevisiae, the first step of glucose metabolism is transporting glucose across the plasma membrane, which is carried out by hexose transporter (Hxt) proteins. Although several studies have recognized that the rate of glucose uptake can affect how glucose is metabolized, the role of HXT genes in the evolution of aerobic fermentation has not been fully explored. In this study, we identified all members of the HXT gene family in 23 fully sequenced fungal genomes, reconstructed their evolutionary history to pinpoint gene gain and loss events and evaluated their adaptive significance in the evolution of aerobic fermentation. We found that the HXT genes have been extensively amplified in the two fungal lineages that have independently evolved aerobic fermentation. In contrast, reduction of the number of HXT genes has occurred in aerobic respiratory species. Our study reveals a strong positive correlation between the copy number of HXT genes and the strength of aerobic fermentation, suggesting that HXT gene expansion has facilitated the evolution of aerobic fermentation.
Biased gene conversion affects patterns of codon usage and amino acid usage in the Saccharomyces senso-strictu group of yeasts
Patterns of synonymous codon usage vary between organisms and are controlled by neutral processes (such as drift and mutation) as well as by selection. Here we show that an additional neutral process, GC-biased gene conversion (gBGC), plays a part in shaping patterns of both synonymous codon usage and amino acid composition in a manner dependent upon the local recombination rate. We obtain estimates of the strength of gBGC acting on synonymous sites in five species of yeast, which we find to be a much weaker force than selection. We use this to correct estimates of the strength of selection on codon usage bias, which are normally confounded by the action of gBGC. Our estimate of the rate of gBGC agrees well with an experimentally determined value obtained from Saccharomyces cerevisiae. We also find that, contrary to expectation, codon usage bias is highest in areas of intermediate levels of recombination for GC-ending optimal codons. Possible reasons for this are discussed.
Evolution of the insect yellow gene family
The yellow gene family is intriguing for a number of reasons. To date, yellow-like genes have only been identified in insect species and a number of bacteria. The function of the yellows is largely unknown, although a few have been associated with melanisation and behaviour in Drosophila, and a unique clade of genes from Apis mellifera may be involved in caste specification. Here we show that yellow-like sequences are present in bacteria, insects and fungi, but absent from other eukaryotes apart from isolated putative sequences in Amphioxus, the Salmon Louse and Naegleria. The yellow-like family forms a discrete gene class characterised by the presence of a major royal jelly protein (MRJP) domain, but eukaryote yellow-like proteins are not monophyletic. The unusual phylogenetic distribution of yellow-like sequences suggests either multiple horizontal transfer from bacteria into eukaryotes, or extensive gene loss in eukaryote lineages. Comparative analysis of yellow family synteny and gene order demonstrates that a highly conserved block of three to five genes has been maintained throughout insect diversification despite extensive genome rearrangements. We show strong purifying selection on seven yellow genes over approximately 100 million years separating the silkmoth and Heliconius butterflies, and an association between spatial regulation of gene expression and distribution of melanic pigment in the developing butterfly wing. A single, ancestral yellow-like gene has therefore undergone multiple rounds of duplication within the insects, accompanied by functional constraint on both genomic location and protein evolution.
Molecular footprints of local adaptation in two Mediterranean conifers
This study combines neutrality tests and environmental correlations to identify non-neutral patterns of evolution in candidate genes related to drought stress in two closely-related Mediterranean conifers, Pinus pinaster Ait. and Pinus halepensis Mill. Based on previous studies, we selected 12 amplicons covering six candidate genes that were sequenced in a large sample spanning the full range of these two species. Neutrality tests relatively robust to demography (DHEW compound test and ML-HKA test) were used to detect selection events at different temporal scales. Environmental associations between variation at candidate genes and climatic variables were also examined. These combined approaches detected distinct genes that may be targeted by selection, most of them specific to only one of the two conifers, despite their recent divergence (< 10 Ma). An exception was 4-coumarate: CoA ligase (4cl), a gene involved in the production of various important secondary products that appeared to play a role in local adaptation processes of both pines. Another remarkable result was that all significant environmental correlations involved temperature indices, highlighting the importance of this climatic factor as a selective driver on Mediterranean pines. The ability to detect natural selection at the DNA sequence level depends on the nature and the strength of the selection events, on the timescale at which they occurred and on the sensitivity of the methods to other evolutionary forces that can mimic selection (e.g. demography, population structure). Using complementary approaches can help to capture different aspects of the evolutionary processes that govern molecular variation at both intra- and interspecific levels.
Phylogenetic and coalescent analysis of three loci suggest that the Water Rail is divisible into two species, Rallus aquaticus and R. indicus
Frequent gene conversion events between the X and Y homologous chromosomal regions in primates
Gene Family Size Conservation Is a Good Indicator of Evolutionary Rates
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human–chimpanzee–macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
An Alignment Confidence Score Capturing Robustness to Guide Tree Uncertainty
Multiple sequence alignment (MSA) is the basis for a wide range of comparative sequence analyses from molecular phylogenetics to 3D structure prediction. Sophisticated algorithms have been developed for sequence alignment, but in practice, many errors can be expected and extensive portions of the MSA are unreliable. Hence, it is imperative to understand and characterize the various sources of errors in MSAs and to quantify site-specific alignment confidence. In this paper, we show that uncertainties in the guide tree used by progressive alignment methods are a major source of alignment uncertainty. We use this insight to develop a novel method for quantifying the robustness of each alignment column to guide tree uncertainty. We build on the widely used bootstrap method for perturbing the phylogenetic tree. Specifically, we generate a collection of trees and use each as a guide tree in the alignment algorithm, thus producing a set of MSAs. We next test the consistency of every column of the MSA obtained from the unperturbed guide tree with respect to the set of MSAs. We name this measure the "GUIDe tree based AligNment ConfidencE" (GUIDANCE) score. Using the Benchmark Alignment data BASE benchmark as well as simulation studies, we show that GUIDANCE scores accurately identify errors in MSAs. Additionally, we compare our results with the previously published Heads-or-Tails score and show that the GUIDANCE score is a better predictor of unreliably aligned regions.
Bayesian Estimation of Divergence Times from Large Sequence Alignments
Bayesian estimation of divergence times from molecular sequences relies on sophisticated Markov chain Monte Carlo techniques, and Metropolis–Hastings (MH) samplers have been successfully used in that context. This approach involves heavy computational burdens that can hinder the analysis of large phylogenomic data sets. Reliable estimation of divergence times can also be extremely time consuming, if not impossible, for sequence alignments that convey weak or conflicting phylogenetic signals, emphasizing the need for more efficient sampling methods. This article describes a new approach that estimates the posterior density of substitution rates and node times. The prior distribution of rates accounts for their potential autocorrelation along lineages, whereas priors on node ages are modeled with uniform densities. Also, the likelihood function is approximated by a multivariate normal density. The combination of these components leads to convenient mathematical simplifications, allowing the posterior distribution of rates and times to be estimated using a Gibbs sampling algorithm. The analysis of four real-world data sets shows that this sampler outperforms the standard MH approach and demonstrates the suitability of this new method for analyzing large and/or difficult data sets.
Structural Considerations in the Fitness Landscape of a Virus
Viral fitness is determined by replication within hosts and transmission between them. We examine how pleiotropic mutations that have antagonistic effects (i.e., antibody evasion vs. receptor binding) on viral replication within hosts can impact viral immune escape in the host population. When the host population is vaccinated, the virus escapes from passive immunity by mutations in the antibody-binding region on the surface of the target protein. However, the reduced ability of the antibody to bind the virus is often accompanied by a reduced ability of the virus to bind the cell receptor because the antibody-binding region overlaps with the receptor-binding domain (RBD). The types of permitted mutations are limited. To investigate the causal relation between a mutation in a viral genome and adaptive evolution of a viral population, we developed a mathematical model that describes the population dynamics of viruses, antibodies, and normal/infected cells within a host. The coefficients describe the binding affinity between the virus and the induced antibody and that between the virus and its receptor. Our knowledge-based index enables us to estimate the effect of a mutation in a binding region on the binding affinity. Using population genetic theory, we evaluated the probability that a mutant is fixed in a host population. The mutations that can be fixed with high probabilities may determine how long a vaccine remains effective. We simulate the adaptive evolution of coronavirus, the etiological agent of severe acute respiratory syndrome, and show that some of mutations in the RBD may have high fixation probabilities in the vaccinated host population.





