Publicaciones

Phylogenetic analysis of ferlin genes reveals ancient eukaryotic origins

BMC Evolutionary Biology - hace 14 horas 42 mins
Background: The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans. Results: Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without), with six ferlin genes in most vertebrates (three DysF, three non-DysF). Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates) and shark (cartilaginous fish). Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F) adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F) displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire) in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting reproduction-related divergence and specialization of species-specific functions within their genus. Conclusions: Our phylogenetic studies provide evolutionary insight into the ferlin gene family. We highlight the existence of ferlin-like proteins throughout eukaryotic evolution, from unicellular phytoplankton and apicomplexan parasites, through to humans. We characterise the preservation of ferlin structural motifs, not only of C2 domains, but also the more poorly characterised ferlin-specific motifs representing the DysF, FerA and FerB domains. Our data suggest an ancient role of ferlin proteins, with lessons from vertebrate biology and human disease suggesting a role relating to vesicle fusion and plasma membrane specialization.

Nuclear-mitochondrial sequences as witnesses of past interbreeding and population diversity in the jumping bristletail Mesomachilis

Molecular Biology & Evolution. AOP - 28 Julio, 2010 - 12:39

Nuclear-mitochondrial sequences (NUMTs) can be especially powerful tools for evolutionary studies by providing a window into the nature of ancestral mitochondrial (mt) lineages. Here we illustrate this property of NUMTs through a survey of such sequences in two species of the jumping bristletail genus Mesomachilis. An ~1800 basepair fragment encompassing three mt genes – COI, tRNA Leu and COII – was cloned twice for a sample of specimens and colonies were screened. NUMTs in this genus are abundant and genetically diverse (up to 23% different from the associated mt sequence). Numerous independent nuclear integration events are scattered at different time depths, suggesting a continuous process of NUMT generation. By combining phylogenetic reconstruction with analyses of the pattern of nucleotide substitution on mt and NUMT branches, we inferred the dynamics of NUMT origin and evolution. The majority of NUMTs in the dataset have apparently evolved from source mt lineages that were substantially different from the currently associated mt sequence, indicating NUMT transfer via matings among divergent lineages. The original source mt lineages either were unsampled or are extinct. Translocations of NUMTs to new nuclear genomes can preserve NUMTs following extinction of their ancestral mt lineages, and can be used to detect mixing between divergent mt lineages and past levels of mt diversity.

Interference with histidyl-tRNA synthetase by a CRISPR spacer sequence as a factor in the evolution of Pelobacter carbinolicus

BMC Evolutionary Biology - 28 Julio, 2010 - 01:00
Background: Pelobacter carbinolicus, a bacterium of the family Geobacteraceae, cannot reduce Fe(III) directly or produce electricity like its relatives. How P. carbinolicus evolved is an intriguing problem. The genome of P. carbinolicus contains clustered regularly interspaced short palindromic repeats (CRISPR) separated by unique spacer sequences, which recent studies have shown to produce RNA molecules that interfere with genes containing identical sequences. Results: CRISPR spacer #1, which matches a sequence within hisS, the histidyl-tRNA synthetase gene of P. carbinolicus, was shown to be expressed. Phylogenetic analysis and genetics demonstrated that a gene paralogous to hisS in the genomes of Geobacteraceae is unlikely to compensate for interference with hisS. Spacer #1 inhibited growth of a transgenic strain of Geobacter sulfurreducens in which the native hisS was replaced with that of P. carbinolicus. The prediction that interference with hisS would result in an attenuated histidyl-tRNA pool insufficient for translation of proteins with multiple closely spaced histidines, predisposing them to mutation and elimination during evolution, was investigated by comparative genomics of P. carbinolicus and related species. Several ancestral genes with high histidine demand have been lost or modified in the P. carbinolicus lineage, providing an explanation for its physiological differences from other Geobacteraceae. Conclusions: The disappearance of multiheme c-type cytochromes and other genes typical of a metal-respiring ancestor from the P. carbinolicus lineage may be the consequence of spacer #1 interfering with hisS, a condition that can be reproduced in a heterologous host. This is the first successful co-introduction of an active CRISPR spacer and its target in the same cell, the first application of a chimeric CRISPR construct consisting of a spacer from one species in the context of repeats of another species, and the first report of a potential impact of CRISPR on genome-scale evolution by interference with an essential gene.

Intragenomic conflict in populations infected by Parthenogenesis Inducing Wolbachia ends with irreversible loss of sexual reproduction.

BMC Evolutionary Biology - 28 Julio, 2010 - 01:00
Background: The maternally inherited, bacterial symbiont, parthenogenesis inducing (PI) Wolbachia, causes females in some haplodiploid insects to produce daughters from both fertilized and unfertilized eggs. The symbionts, with their maternal inheritance, benefit from inducing the production of exclusively daughters, however the optimal sex ratio for the nuclear genome is more male-biased. Here we examine through models how an infection with PI-Wolbachia in a previously uninfected population leads to a genomic conflict between PI-Wolbachia and the nuclear genome. In most natural populations infected with PI-Wolbachia the infection has gone to fixation and sexual reproduction is impossible, specifically because the females have lost their ability to fertilize eggs, even when mated with functional males. Results: The PI Wolbachia infection by itself does not interfere with the fertilization process in infected eggs, fertilized infected eggs develop into biparental infected females. Because of the increasingly female-biased sex ratio in the population during a spreading PI-Wolbachia infection, sex allocation alleles in the host that cause the production of more sons are rapidly selected. In haplodiploid species a reduced fertilization rate leads to the production of more sons. Selection for the reduced fertilization rate leads to a spread of these alleles through both the infected and uninfected population, eventually resulting in the population becoming fixed for both the PI-Wolbachia infection and the reduced fertilization rate. Fertilization rate alleles that completely interfere with fertilization ("virginity alleles") will be selected over alleles that still allow for some fertilization. This drives the final resolution of the conflict: the irreversible loss of sexual reproduction and the complete dependence of the host on its symbiont. Conclusions: This study shows that dependence among organisms can evolve rapidly due to the resolution of the conflicts between cytoplasmic and nuclear genes, and without requiring a mutualism between the partners.

History of myxozoan character evolution on the basis of rDNA and EF-2 data

BMC Evolutionary Biology - 28 Julio, 2010 - 01:00
Background: Phylogenetic relationships among myxosporeans based on ribosomal DNA data disagree with traditional taxonomic classification: a number of myxosporeans with very similar spore morphology are assigned to the same genera even though they are phylogenetically distantly related. The credibility of rDNA as a suitable marker for Myxozoa is uncertain and needs to be proved. Furthermore, we need to know the history of myxospore evolution to understand the great diversity of modern species. Results: Phylogenetic analysis of elongation factor 2 supports the ribosomal DNA-based reconstruction of myxozoan evolution. We propose that SSU rDNA is a reliable marker for inferring myxozoan relationships, even though SSU rDNA analysis markedly disagrees with the current taxonomy. The analyses of character evolution of 15 morphological and 5 bionomical characters show the evolution of individual characters and uncover the main evolutionary changes in the myxosporean spore morphology and bionomy. Most bionomical and several morphological characters were found to be congruent with the phylogeny. The summary of character analyses leads to the simulation of myxozoan ancestral morphotypes and their evolution to the current species. As such, the ancestor of all myxozoans appears to have infected the renal tubules of freshwater fish, was sphaerosporid in shape, and had a spore with polar capsules that discharged slightly sideways. After the separation of Malacosporea, the spore of the common myxosporean ancestor then changed to the typical sphaerosporid morphotype. This species inhabited the marine environment as a parasite of the gall bladder of marine fish and ultimately separated into the three main myxosporean lineages evident today. Two of these lineages re-entered the freshwater environment, one as a myxosporean with Chloromyxum and another with a primitive sphaerosporid morphotype. The common ancestor of all marine myxosporeans had a ceratomyxid shape of spore. Conclusions: We support rDNA based myxozoan phylogeny by the analysis of a protein coding gene and demonstrate the reliability of rDNA as a marker explaining myxozoan relationships. Our tracing the history of myxozoan character evolution discloses ancestral morphotypes and shows their development over the course of evolution. We point out several myxozoan characters that are to a certain extent congruent with the phylogeny and determined that the discrepancy between phylogeny and current taxonomy based on spore morphology is due to an extreme myxospore plasticity occurring during myxozoan evolution.

Phosphorylated and non-phosphorylated serine and threonine residues evolve at different rates in mammals

Molecular Biology & Evolution. AOP - 27 Julio, 2010 - 21:43

Protein phosphorylation plays an important role in the regulation of protein function. Phosphorylated residues are generally assumed to be subject to functional constraint, but it has recently been suggested from a comparison of distantly related vertebrate species that most phosphorylated residues evolve at the rates consistent with the surrounding regions. To resolve the controversy, we infer the ancestral phosphoproteome of human and mouse to compare the evolutionary rates of phosphorylated and non-phosphorylated serine (S), threonine (T) and tyrosine (Y) residues. This approach enables accurate estimation of evolutionary rates as it does not assume deep conservation of phosphorylated residues. We show that phosphorylated S/T residues tend to evolve more slowly than non-phosphorylated S/T residues not only in disordered but also in ordered protein regions, indicating evolutionary conservation of phosphorylated S/T residues in mammals. Thus, phosphorylated S/T residues tend to be subject to stronger functional constraint than non-phosophorylated residues regardless of the protein regions in which they reside. In contrast, phosphorylated Y residues evolve at similar rates as non-phosphorylated ones. We also find that the human lineage has gained more phosphorylated T residues and lost fewer phosphorylated Y residues than the mouse lineage. The cause of the gain/loss imbalance remains a mystery but should be worth exploring.

Length polymorphism and head shape association among genes with polyglutamine repeats in the stalk-eyed fly, Teleopsis dalmanni

BMC Evolutionary Biology - 27 Julio, 2010 - 01:00
Background: Polymorphisms of single amino acid repeats (SARPs) are a potential source of genetic variation for rapidly evolving morphological traits. Here, we characterize variation in and test for an association between SARPs and head shape, a trait under strong sexual selection, in the stalk-eyed fly, Teleopsis dalmanni. Using an annotated expressed sequence tag database developed from eye-antennal imaginal disc tissues in T. dalmanni we identified 98 genes containing nine or more consecutive copies of a single amino acid. We then quantify variation in length and allelic diversity for 32 codon and 15 noncodon repeat regions in a large outbred population. We also assessed the frequency with which amino acid repeats are either gained or lost by identifying sequence similarities between T. dalmanni SARP loci and their orthologs in Drosophila melanogaster. Finally, to identify SARP containing genes that may influence head development we conducted a two-generation association study after assortatively mating for extreme relative eyespan. Results: We found that glutamine repeats occur more often than expected by amino acid abundance among 3,400 head development genes in T. dalmanni and D. melanogaster. Furthermore, glutamine repeats occur disproportionately in transcription factors. Loci with glutamine repeats exhibit heterozygosities and allelic diversities that do not differ from noncoding dinucleotide microsatellites, including greater variation among X-linked than autosomal regions. In the majority of cases, repeat tracts did not overlap between T. dalmanni and D. melanogaster indicating that large glutamine repeats are gained or lost frequently during Dipteran evolution. Analysis of covariance reveals a significant effect of parental genotype on mean progeny eyespan, with body length as a covariate, at six SARP loci [CG33692, ptip, band4.1 inhibitor LRP interactor, corto, 3531953:1, and ecdysone-induced protein 75B (Eip75B)]. Mixed model analysis of covariance using the eyespan of siblings segregating for repeat length variation confirms that significant genotype-phenotype associations exist for at least one sex at five of these loci and for one gene, CG33692, longer repeats were associated with longer relative eyespan in both sexes. Conclusion: Among genes expressed during head development in stalk-eyed flies, long codon repeats typically contain glutamine, occur in transcription factors and exhibit high levels of heterozygosity. Furthermore, the presence of significant associations within families between repeat length and head shape indicates that six genes, or genes linked to them, contribute genetic variation to the development of this extremely sexually dimorphic trait.

Evolutionary History and Functional Diversification of Phosphomannomutase Genes

Journal of Molecular Evolution - 26 Julio, 2010 - 23:09

Abstract  Phosphomannomutases (PMMs) catalyze the interconversion of mannose-6-phosphate to mannose-1-phosphate. In humans, two PMM enzymes exist—PMM1 and PMM2; yet, they have different functional specificities. PMM2 presents PMM activity, and its deficiency causes a Congenital Disorder of Glycosylation (PMM2-CDG). On the other hand, PMM1 can also act as glucose-1,6-bisphosphatase in the brain after stimulation with inosine monophosphate and thus far has not been implicated in any human disease. This study aims to refine the evolutionary time frame at which gene duplication gave rise to PMM1 and PMM2, and to identify the most likely amino acid positions underlying the proteins’ different functions. The phylogenetic analysis using available protein sequences, allowed us to establish that duplication occurred early in vertebrate evolution. In order to understand the molecular basis underlying the functional divergence, conserved and most likely functional divergence-related sites were identified, through the analysis of site-specific evolutionary rates. This analysis indicates that most of the sites known to be important in the homodimer formation and in the catalytic activity are conserved in both proteins. Among those potentially related to functional divergence, two positions (183 and 186 in human PMM1) emerge as the most interesting ones. The residues at these positions have different side-chain conformations in the protein structure in the unbound and bound states, and are highly but differently conserved in PMM1 and in PMM2 proteins. Altogether, these results provide new data into the evolutionary history of PMM1 and PMM2 duplicates and highlight the most probable sites that evolved to distinct functional specificities.

  • Content Type Journal Article
  • DOI 10.1007/s00239-010-9368-5
  • Authors
    • Rita Quental, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
    • Ana Moleirinho, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
    • Luísa Azevedo, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal
    • António Amorim, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) Rua Dr. Roberto Frias s/n 4200-465 Porto Portugal

A Bayesian Approach for Fast and Accurate Gene Tree Reconstruction

Molecular Biology & Evolution. AOP - 26 Julio, 2010 - 05:21

Recent sequencing and computing advances have enabled phylogenetic analyses to expand to both entire genomes and large clades, thus requiring more efficient and accurate methods designed specifically for the phylogenomic context. Here we present SPIMAP, an efficient Bayesian method for reconstructing gene trees in the presence of a known species tree. We observe many improvements in reconstruction accuracy, achieved by modeling multiple aspects of evolution, including gene duplication and loss rates, speciation times, and correlated substitution rate variation across both species and loci. We have implemented and applied this method on two clades of fully-sequenced species, 12 Drosophila and 16 fungal genomes as well as simulated phylogenies, and find dramatic improvements in reconstruction accuracy as compared to the most popular existing methods, including those that take the species tree into account. We find that reconstruction inaccuracies of traditional phylogenetic methods overestimate the number of duplication and loss events by as much as 2 to 3 fold, while our method achieves significantly higher accuracy. We feel the results and methods presented here will have many important implications for future investigations of gene evolution.

Expansion of hexose transporter genes was associated with the evolution of aerobic fermentation in yeasts

Molecular Biology & Evolution. AOP - 26 Julio, 2010 - 05:21

The genetic basis of organisms’ adaptation to different environments is a central issue of molecular evolution. The budding yeast Saccharomyces cerevisiae and its relatives predominantly ferment glucose into ethanol even in the presence of oxygen. This was suggested to be an adaptation to glucose-rich habitats, but the underlying genetic basis of the evolution of aerobic fermentation remains unclear. In S. cerevisiae, the first step of glucose metabolism is transporting glucose across the plasma membrane, which is carried out by hexose transporter (Hxt) proteins. Although several studies have recognized that the rate of glucose uptake can affect how glucose is metabolized, the role of HXT genes in the evolution of aerobic fermentation has not been fully explored. In this study, we identified all members of the HXT gene family in 23 fully sequenced fungal genomes, reconstructed their evolutionary history to pinpoint gene gain and loss events and evaluated their adaptive significance in the evolution of aerobic fermentation. We found that the HXT genes have been extensively amplified in the two fungal lineages that have independently evolved aerobic fermentation. In contrast, reduction of the number of HXT genes has occurred in aerobic respiratory species. Our study reveals a strong positive correlation between the copy number of HXT genes and the strength of aerobic fermentation, suggesting that HXT gene expansion has facilitated the evolution of aerobic fermentation.

Biased gene conversion affects patterns of codon usage and amino acid usage in the Saccharomyces senso-strictu group of yeasts

Molecular Biology & Evolution. AOP - 23 Julio, 2010 - 13:07

Patterns of synonymous codon usage vary between organisms and are controlled by neutral processes (such as drift and mutation) as well as by selection. Here we show that an additional neutral process, GC-biased gene conversion (gBGC), plays a part in shaping patterns of both synonymous codon usage and amino acid composition in a manner dependent upon the local recombination rate. We obtain estimates of the strength of gBGC acting on synonymous sites in five species of yeast, which we find to be a much weaker force than selection. We use this to correct estimates of the strength of selection on codon usage bias, which are normally confounded by the action of gBGC. Our estimate of the rate of gBGC agrees well with an experimentally determined value obtained from Saccharomyces cerevisiae. We also find that, contrary to expectation, codon usage bias is highest in areas of intermediate levels of recombination for GC-ending optimal codons. Possible reasons for this are discussed.

Evolution of the insect yellow gene family

Molecular Biology & Evolution. AOP - 23 Julio, 2010 - 13:07

The yellow gene family is intriguing for a number of reasons. To date, yellow-like genes have only been identified in insect species and a number of bacteria. The function of the yellows is largely unknown, although a few have been associated with melanisation and behaviour in Drosophila, and a unique clade of genes from Apis mellifera may be involved in caste specification. Here we show that yellow-like sequences are present in bacteria, insects and fungi, but absent from other eukaryotes apart from isolated putative sequences in Amphioxus, the Salmon Louse and Naegleria. The yellow-like family forms a discrete gene class characterised by the presence of a major royal jelly protein (MRJP) domain, but eukaryote yellow-like proteins are not monophyletic. The unusual phylogenetic distribution of yellow-like sequences suggests either multiple horizontal transfer from bacteria into eukaryotes, or extensive gene loss in eukaryote lineages. Comparative analysis of yellow family synteny and gene order demonstrates that a highly conserved block of three to five genes has been maintained throughout insect diversification despite extensive genome rearrangements. We show strong purifying selection on seven yellow genes over approximately 100 million years separating the silkmoth and Heliconius butterflies, and an association between spatial regulation of gene expression and distribution of melanic pigment in the developing butterfly wing. A single, ancestral yellow-like gene has therefore undergone multiple rounds of duplication within the insects, accompanied by functional constraint on both genomic location and protein evolution.

Molecular footprints of local adaptation in two Mediterranean conifers

Molecular Biology & Evolution. AOP - 23 Julio, 2010 - 13:07

This study combines neutrality tests and environmental correlations to identify non-neutral patterns of evolution in candidate genes related to drought stress in two closely-related Mediterranean conifers, Pinus pinaster Ait. and Pinus halepensis Mill. Based on previous studies, we selected 12 amplicons covering six candidate genes that were sequenced in a large sample spanning the full range of these two species. Neutrality tests relatively robust to demography (DHEW compound test and ML-HKA test) were used to detect selection events at different temporal scales. Environmental associations between variation at candidate genes and climatic variables were also examined. These combined approaches detected distinct genes that may be targeted by selection, most of them specific to only one of the two conifers, despite their recent divergence (< 10 Ma). An exception was 4-coumarate: CoA ligase (4cl), a gene involved in the production of various important secondary products that appeared to play a role in local adaptation processes of both pines. Another remarkable result was that all significant environmental correlations involved temperature indices, highlighting the importance of this climatic factor as a selective driver on Mediterranean pines. The ability to detect natural selection at the DNA sequence level depends on the nature and the strength of the selection events, on the timescale at which they occurred and on the sensitivity of the methods to other evolutionary forces that can mimic selection (e.g. demography, population structure). Using complementary approaches can help to capture different aspects of the evolutionary processes that govern molecular variation at both intra- and interspecific levels.

Phylogenetic and coalescent analysis of three loci suggest that the Water Rail is divisible into two species, Rallus aquaticus and R. indicus

BMC Evolutionary Biology - 23 Julio, 2010 - 01:00
Background: Water Rails (Rallus aquaticus) inhabit fragmented freshwater wetlands across their Palearctic distribution. Disjunct populations are now thought to be morphologically similar over their vast geographic range, though four subspecies had been recognized previously. The fossil record suggests that Water Rails (R. aquaticus) were already spread across the Palearctic by the Pleistocene ~2 million years ago, and the oldest fossil remains thought to be closely related to the common ancestor of water rails date from the Pliocene. Results: To investigate population structure in Water Rails at the genetic level we sequenced three independent loci: 686 base pairs (bp) of the mitochondrial DNA COI barcode; 618 bp of the intron ADH5; and 746 bp of the exon PTPN12. Phylogeographic analysis revealed that Water Rails breeding in eastern Asia (R. a. indicus, also known as the Brown-cheeked Rail) are strongly differentiated from the Water Rails in Western and Middle Asia and Europe (R. a. aquaticus and R. a. korejewi). The Kimura 3-parameter plus Gamma COI genetic distance between these two geographic groups was > 3 %, and they differed by 18 diagnostic substitutions commensurate with differences between recently diverged sister species of birds. In spite of the low number of variable sites, the two nuclear loci supported this split. We estimated the split of the Brown-cheeked Rail and the Water Rail to have occurred ~534,000 years ago (95% CI 275,000-990,000 years ago). Fragmentation of the widespread ancestral population and eventual speciation of water rails is likely attributable to vicariance by a barrier formed by glacial cycles, continuous uplift of the Tibetan Plateau and increased sedimentation in deserts in southern Asia that originated in the Miocene. Conclusions: Water Rails from East Asia were genetically differentiated from the ones breeding in Europe and Western to Middle Asia. Most of the genetic signal was from mitochondrial COI, and was corroborated by polymorphic sites in the two nuclear loci we employed. The split between these two lineages was estimated to occur in the Middle Pleistocene, when populations were isolated in disjunct wetlands with little or no gene flow. Independent evidence from differences in morphology and vocalizations in concert with genetic differentiation and a long history of isolation support recognition of the Brown-cheeked Rail breeding in East Asia as a separate species, R. indicus. The use of several independent loci is invaluable in inferring species trees from gene trees and in recognizing species limits.

Frequent gene conversion events between the X and Y homologous chromosomal regions in primates

BMC Evolutionary Biology - 23 Julio, 2010 - 01:00
Background: Mammalian sex-chromosomes originated from a pair of autosomes. A step-wise cessation of recombination is necessary for the proper maintenance of sex-determination and, consequently, generates a four strata structure on the X chromosome. Each stratum shows a specific per-site nucleotide sequence difference (p-distance) between the X and Y chromosomes, depending on the time of recombination arrest. Stratum 4 covers the distal half of the human X chromosome short arm and the p-distance of the stratum is ~10%, on average. However, a 100-kb region, which includes KALX and VCX, in the middle of stratum 4 shows a significantly lower p-distance (1-5%), suggesting frequent sequence exchanges or gene conversions between the X and Y chromosomes in humans. To examine the evolutionary mechanism for this low p-distance region, sequences of a corresponding region including KALX/Y from seven species of non-human primates were analyzed. Results: Phylogenetic analysis of this low p-distance region in humans and non-human primate species revealed that gene conversion like events have taken place at least ten times after the divergence of the New World monkeys and the Catarrhini (i.e., Old World monkeys and hominoids). A KALY-converted KALX allele in white-handed gibbons also suggests a possible recent gene conversion between the X and Y chromosomes. In these primate sequences, the proximal boundary of this low p-distance region is located in a LINE element shared between the X and Y chromosomes, suggesting the involvement of this element in frequent gene conversions. A segmental palindrome structure on the X chromosome at the distal boundary near VCX, in humans and chimpanzees, may mediate frequent sequence exchanges between X and Y chromosomes together with a palindrome on the Y chromosome. Conclusion: Gene conversion events between the X and Y homologous regions have been suggested, mainly in humans. Here, we found frequent gene conversions in the evolutionary course of primates, which were a common trait among primates, but not among humans. An insertion of a LINE element at the proximal end of the region may be a cause for these frequent conversions. This gene conversion in humans may also be one of the genetic causes of Kallmann syndrome.

Gene Family Size Conservation Is a Good Indicator of Evolutionary Rates

Molecular Biology & Evolution - 22 Julio, 2010 - 22:06

The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human–chimpanzee–macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.

An Alignment Confidence Score Capturing Robustness to Guide Tree Uncertainty

Molecular Biology & Evolution - 22 Julio, 2010 - 22:06

Multiple sequence alignment (MSA) is the basis for a wide range of comparative sequence analyses from molecular phylogenetics to 3D structure prediction. Sophisticated algorithms have been developed for sequence alignment, but in practice, many errors can be expected and extensive portions of the MSA are unreliable. Hence, it is imperative to understand and characterize the various sources of errors in MSAs and to quantify site-specific alignment confidence. In this paper, we show that uncertainties in the guide tree used by progressive alignment methods are a major source of alignment uncertainty. We use this insight to develop a novel method for quantifying the robustness of each alignment column to guide tree uncertainty. We build on the widely used bootstrap method for perturbing the phylogenetic tree. Specifically, we generate a collection of trees and use each as a guide tree in the alignment algorithm, thus producing a set of MSAs. We next test the consistency of every column of the MSA obtained from the unperturbed guide tree with respect to the set of MSAs. We name this measure the "GUIDe tree based AligNment ConfidencE" (GUIDANCE) score. Using the Benchmark Alignment data BASE benchmark as well as simulation studies, we show that GUIDANCE scores accurately identify errors in MSAs. Additionally, we compare our results with the previously published Heads-or-Tails score and show that the GUIDANCE score is a better predictor of unreliably aligned regions.

Bayesian Estimation of Divergence Times from Large Sequence Alignments

Molecular Biology & Evolution - 22 Julio, 2010 - 22:06

Bayesian estimation of divergence times from molecular sequences relies on sophisticated Markov chain Monte Carlo techniques, and Metropolis–Hastings (MH) samplers have been successfully used in that context. This approach involves heavy computational burdens that can hinder the analysis of large phylogenomic data sets. Reliable estimation of divergence times can also be extremely time consuming, if not impossible, for sequence alignments that convey weak or conflicting phylogenetic signals, emphasizing the need for more efficient sampling methods. This article describes a new approach that estimates the posterior density of substitution rates and node times. The prior distribution of rates accounts for their potential autocorrelation along lineages, whereas priors on node ages are modeled with uniform densities. Also, the likelihood function is approximated by a multivariate normal density. The combination of these components leads to convenient mathematical simplifications, allowing the posterior distribution of rates and times to be estimated using a Gibbs sampling algorithm. The analysis of four real-world data sets shows that this sampler outperforms the standard MH approach and demonstrates the suitability of this new method for analyzing large and/or difficult data sets.

Structural Considerations in the Fitness Landscape of a Virus

Molecular Biology & Evolution - 22 Julio, 2010 - 22:06

Viral fitness is determined by replication within hosts and transmission between them. We examine how pleiotropic mutations that have antagonistic effects (i.e., antibody evasion vs. receptor binding) on viral replication within hosts can impact viral immune escape in the host population. When the host population is vaccinated, the virus escapes from passive immunity by mutations in the antibody-binding region on the surface of the target protein. However, the reduced ability of the antibody to bind the virus is often accompanied by a reduced ability of the virus to bind the cell receptor because the antibody-binding region overlaps with the receptor-binding domain (RBD). The types of permitted mutations are limited. To investigate the causal relation between a mutation in a viral genome and adaptive evolution of a viral population, we developed a mathematical model that describes the population dynamics of viruses, antibodies, and normal/infected cells within a host. The coefficients describe the binding affinity between the virus and the induced antibody and that between the virus and its receptor. Our knowledge-based index enables us to estimate the effect of a mutation in a binding region on the binding affinity. Using population genetic theory, we evaluated the probability that a mutant is fixed in a host population. The mutations that can be fixed with high probabilities may determine how long a vaccine remains effective. We simulate the adaptive evolution of coronavirus, the etiological agent of severe acute respiratory syndrome, and show that some of mutations in the RBD may have high fixation probabilities in the vaccinated host population.

Distribuir contenido