Genetics (Current Issue)
James F. Crow: His Life in Public Service [Perspectives]
The readers of this journal may well be aware of Professor Crow’s scientific achievements and his role as the editor of Perspectives. In addition, for many thousands of students at the University of Wisconsin over many generations, James F. Crow was one of the most memorable teachers at both the undergraduate and graduate levels. What is less known is his major role in public service where he served as chair of many important committees for the National Academy of Sciences, the National Institutes of Health, the National Institutes of Justice as well as various international programs. In all of these efforts, Professor Crow has left a lasting impact.
The Effects of Deleterious Mutations on Evolution at Linked Sites [Review]
The process of evolution at a given site in the genome can be influenced by the action of selection at other sites, especially when these are closely linked to it. Such selection reduces the effective population size experienced by the site in question (the Hill–Robertson effect), reducing the level of variability and the efficacy of selection. In particular, deleterious variants are continually being produced by mutation and then eliminated by selection at sites throughout the genome. The resulting reduction in variability at linked neutral or nearly neutral sites can be predicted from the theory of background selection, which assumes that deleterious mutations have such large effects that their behavior in the population is effectively deterministic. More weakly selected mutations can accumulate by Muller’s ratchet after a shutdown of recombination, as in an evolving Y chromosome. Many functionally significant sites are probably so weakly selected that Hill–Robertson interference undermines the effective strength of selection upon them, when recombination is rare or absent. This leads to large departures from deterministic equilibrium and smaller effects on linked neutral sites than under background selection or Muller’s ratchet. Evidence is discussed that is consistent with the action of these processes in shaping genome-wide patterns of variation and evolution.
The Regulation of Filamentous Growth in Yeast [Cell Signaling [amp ] Development]
Filamentous growth is a nutrient-regulated growth response that occurs in many fungal species. In pathogens, filamentous growth is critical for host–cell attachment, invasion into tissues, and virulence. The budding yeast Saccharomyces cerevisiae undergoes filamentous growth, which provides a genetically tractable system to study the molecular basis of the response. Filamentous growth is regulated by evolutionarily conserved signaling pathways. One of these pathways is a mitogen activated protein kinase (MAPK) pathway. A remarkable feature of the filamentous growth MAPK pathway is that it is composed of factors that also function in other pathways. An intriguing challenge therefore has been to understand how pathways that share components establish and maintain their identity. Other canonical signaling pathways—rat sarcoma/protein kinase A (RAS/PKA), sucrose nonfermentable (SNF), and target of rapamycin (TOR)—also regulate filamentous growth, which raises the question of how signals from multiple pathways become integrated into a coordinated response. Together, these pathways regulate cell differentiation to the filamentous type, which is characterized by changes in cell adhesion, cell polarity, and cell shape. How these changes are accomplished is also discussed. High-throughput genomics approaches have recently uncovered new connections to filamentous growth regulation. These connections suggest that filamentous growth is a more complex and globally regulated behavior than is currently appreciated, which may help to pave the way for future investigations into this eukaryotic cell differentiation behavior.
Morphogenesis and the Cell Cycle [Cell Cycle]
Studies of the processes leading to the construction of a bud and its separation from the mother cell in Saccharomyces cerevisiae have provided foundational paradigms for the mechanisms of polarity establishment, cytoskeletal organization, and cytokinesis. Here we review our current understanding of how these morphogenetic events occur and how they are controlled by the cell-cycle-regulatory cyclin-CDK system. In addition, defects in morphogenesis provide signals that feed back on the cyclin-CDK system, and we review what is known regarding regulation of cell-cycle progression in response to such defects, primarily acting through the kinase Swe1p. The bidirectional communication between morphogenesis and the cell cycle is crucial for successful proliferation, and its study has illuminated many elegant and often unexpected regulatory mechanisms. Despite considerable progress, however, many of the most puzzling mysteries in this field remain to be resolved.
Integrating Global Regulatory Input Into the Salmonella Pathogenicity Island 1 Type III Secretion System [Gene Expression]
Salmonella enterica serovar Typhimurium uses the Salmonella pathogenicity island 1 (SPI1) type III secretion system to induce inflammatory diarrhea and bacterial uptake into intestinal epithelial cells. The expression of hilA, encoding the transcriptional activator of the SPI1 structural genes, is directly controlled by three AraC-like regulators, HilD, HilC, and RtsA, each of which can activate the hilD, hilC, rtsA, and hilA genes, forming a complex feed-forward regulatory loop. A large number of factors and environmental signals have been implicated in SPI1 regulation. We have developed a series of genetic tests that allows us to determine where these factors feed into the SPI1 regulatory circuit. Using this approach, we have grouped 21 of the known SPI1 regulators and environmental signals into distinct classes on the basis of observed regulatory patterns, anchored by those few systems where the mechanism of regulation is best understood. Many of these factors are shown to work post-transcriptionally at the level of HilD, while others act at the hilA promoter or affect all SPI1 promoters. Analysis of the published transcriptomic data reveals apparent coregulation of the SPI1 and flagellar genes in various conditions. However, we show that in most cases, the factors that affect both systems control SPI1 independently of the flagellar protein FliZ, despite its role as an important SPI1 regulator and coordinator of the two systems. These results provide a comprehensive model for SPI1 regulation that serves as a framework for future molecular analyses of this complex regulatory network.
dSet1 Is the Main H3K4 Di- and Tri-Methyltransferase Throughout Drosophila Development [Gene Expression]
In eukaryotes, the post-translational addition of methyl groups to histone H3 lysine 4 (H3K4) plays key roles in maintenance and establishment of appropriate gene expression patterns and chromatin states. We report here that an essential locus within chromosome 3L centric heterochromatin encodes the previously uncharacterized Drosophila melanogaster ortholog (dSet1, CG40351) of the Set1 H3K4 histone methyltransferase (HMT). Our results suggest that dSet1 acts as a "global" or general H3K4 di- and trimethyl HMT in Drosophila. Levels of H3K4 di- and trimethylation are significantly reduced in dSet1 mutants during late larval and post-larval stages, but not in animals carrying mutations in genes encoding other well-characterized H3K4 HMTs such as trr, trx, and ash1. The latter results suggest that Trr, Trx, and Ash1 may play more specific roles in regulating key cellular targets and pathways and/or act as global H3K4 HMTs earlier in development. In yeast and mammalian cells, the HMT activity of Set1 proteins is mediated through an evolutionarily conserved protein complex known as Complex of Proteins Associated with Set1 (COMPASS). We present biochemical evidence that dSet1 interacts with members of a putative Drosophila COMPASS complex and genetic evidence that these members are functionally required for H3K4 methylation. Taken together, our results suggest that dSet1 is responsible for the bulk of H3K4 di- and trimethylation throughout Drosophila development, thus providing a model system for better understanding the requirements for and functions of these modifications in metazoans.
A Protosilencer of Subtelomeric Gene Expression in Candida glabrata with Unique Properties [Gene Expression]
Adherence to host cells is an important step in the pathogenicity of the opportunistic fungal pathogen Candida glabrata. This adherence is mediated by some members of the large family of cell wall proteins encoded by the EPA (Epithelial Adhesin) genes present in the C. glabrata genome. The majority of the EPA genes are localized close to different telomeres in C. glabrata, resulting in a negative regulation of transcription of these genes through chromatin-based subtelomeric silencing. In vitro, adherence to epithelial cells is mainly mediated by Epa1, the only member of the EPA family that is expressed in vitro. EPA1 forms a cluster with EPA2 and EPA3 at the subtelomeric region of telomere E-R. EPA2 and EPA3 are subject to silencing that propagates from this telomere in a process that depends on the Sir2, -3, -4, and Rif1 proteins, but surprisingly not on the yKu70 and yKu80 proteins. Here we describe that the yKu70/yKu80-independent silencing of telomere E-R is due to the presence of a cis-acting protosilencer (Sil2126) located between EPA3 and the telomere. This element can silence a reporter gene when placed 31.9 kb away from this telomere, but not when it is removed from the telomere context, or when it is placed near other telomeres, or inverted with respect to the reporter. Importantly, we show that the cis-acting Sil2126 element is required for the yKu70/80-independent silencing of this telomere, underscoring the importance of cis-elements for repressive chromatin formation and spreading on some telomeres in C. glabrata.
Molecular Characterization of Vegetative Incompatibility Genes That Restrict Hypovirus Transmission in the Chestnut Blight Fungus Cryphonectria parasitica [Cellular Genetics]
Genetic nonself recognition systems such as vegetative incompatibility operate in many filamentous fungi to regulate hyphal fusion between genetically dissimilar individuals and to restrict the spread of virulence-attenuating mycoviruses that have potential for biological control of pathogenic fungi. We report here the use of a comparative genomics approach to identify seven candidate polymorphic genes associated with four vegetative incompatibility (vic) loci of the chestnut blight fungus Cryphonectria parasitica. Disruption of candidate alleles in one of two strains that were heteroallelic at vic2, vic6, or vic7 resulted in enhanced virus transmission, but did not prevent barrage formation associated with mycelial incompatibility. Detailed characterization of the vic6 locus revealed the involvement of nonallelic interactions between two tightly linked genes in barrage formation, heterokaryon formation, and asymmetric, gene-specific influences on virus transmission. The combined results establish molecular identities of genes associated with four C. parasitica vic loci and provide insights into how these recognition factors interact to trigger incompatibility and restrict virus transmission.
Distinct Cell Guidance Pathways Controlled by the Rac and Rho GEF Domains of UNC-73/TRIO in Caenorhabditis elegans [Developmental and Behavioral Genetics]
Sperm Development and Motility are Regulated by PP1 Phosphatases in Caenorhabditis elegans [Developmental and Behavioral Genetics]
Sperm from different species have evolved distinctive motility structures, including tubulin-based flagella in mammals and major sperm protein (MSP)-based pseudopods in nematodes. Despite such divergence, we show that sperm-specific PP1 phosphatases, which are required for male fertility in mouse, function in multiple processes in the development and motility of Caenorhabditis elegans amoeboid sperm. We used live-imaging analysis to show the PP1 phosphatases GSP-3 and GSP-4 (GSP-3/4) are required to partition chromosomes during sperm meiosis. Postmeiosis, tracking fluorescently labeled sperm revealed that both male and hermaphrodite sperm lacking GSP-3/4 are immotile. Genetic and in vitro activation assays show lack of GSP-3/4 causes defects in pseudopod development and the rate of pseudopodial treadmilling. Further, GSP-3/4 are required for the localization dynamics of MSP. GSP-3/4 shift localization in concert with MSP from fibrous bodies that sequester MSP at the base of the pseudopod, where directed MSP disassembly facilitates pseudopod contraction. Consistent with a role for GSP-3/4 as a spatial regulator of MSP disassembly, MSP is mislocalized in sperm lacking GSP-3/4. Although a requirement for PP1 phosphatases in nematode and mammalian sperm suggests evolutionary conservation, we show PP1s have independently evolved sperm-specific paralogs in separate lineages. Thus PP1 phosphatases are highly adaptable and employed across a broad range of sexually reproducing species to regulate male fertility.
Combining Markers into Haplotypes Can Improve Population Structure Inference [Population and Evolutionary Genetics]
High-throughput genotyping and sequencing technologies can generate dense sets of genetic markers for large numbers of individuals. For most species, these data will contain many markers in linkage disequilibrium (LD). To utilize such data for population structure inference, we investigate the use of haplotypes constructed by combining the alleles at single-nucleotide polymorphisms (SNPs). We introduce a statistic derived from information theory, the gain of informativeness for assignment (GIA), which quantifies the additional information for assigning individuals to populations using haplotype data compared to using individual loci separately. Using a two-loci–two-allele model, we demonstrate that combining markers in linkage equilibrium into haplotypes always leads to nonpositive GIA, suggesting that combining the two markers is not advantageous for ancestry inference. However, for loci in LD, GIA is often positive, suggesting that assignment can be improved by combining markers into haplotypes. Using GIA as a criterion for combining markers into haplotypes, we demonstrate for simulated data a significant improvement of assigning individuals to candidate populations. For the many cases that we investigate, incorrect assignment was reduced between 26% and 97% using haplotype data. For empirical data from French and German individuals, the incorrectly assigned individuals can, for example, be decreased by 73% using haplotypes. Our results can be useful for challenging population structure and assignment problems, in particular for studies where large-scale population–genomic data are available.
Measuring Selection Coefficients Below 10-3: Method, Questions, and Prospects [Population and Evolutionary Genetics]
Measuring fitness with precision is a key issue in evolutionary biology, particularly in studying mutations of small effects. It is usually thought that sampling error and drift prevent precise measurement of very small fitness effects. We circumvented these limits by using a new combined approach to measuring and analyzing fitness. We estimated the mutational fitness effect (MFE) of three independent mini-Tn10 transposon insertion mutations by conducting competition experiments in large populations of Escherichia coli under controlled laboratory conditions. Using flow cytometry to assess genotype frequencies from very large samples alleviated the problem of sampling error, while the effect of drift was controlled by using large populations and massive replication of fitness measures. Furthermore, with a set of four competition experiments between ancestral and mutant genotypes, we were able to decompose fitness measures into four estimated parameters that account for fitness effects of our fluorescent marker (α), the mutation (β), epistasis between the mutation and the marker (), and departure from transitivity (). Our method allowed us to estimate mean selection coefficients to a precision of 2 x 10–4. We also found small, but significant, epistatic interactions between the allelic effects of mutations and markers and confirmed that fitness effects were transitive in most cases. Unexpectedly, we also detected variation in measures of s that were significantly bigger than expected due to drift alone, indicating the existence of cryptic variation, even in fully controlled experiments. Overall our results indicate that selection coefficients are best understood as being distributed, representing a limit on the precision with which selection can be measured, even under controlled laboratory conditions.
Complex Population Dynamics and the Coalescent Under Neutrality [Population and Evolutionary Genetics]
Estimates of the coalescent effective population size Ne can be poorly correlated with the true population size. The relationship between Ne and the population size is sensitive to the way in which birth and death rates vary over time. The problem of inference is exacerbated when the mechanisms underlying population dynamics are complex and depend on many parameters. In instances where nonparametric estimators of Ne such as the skyline struggle to reproduce the correct demographic history, model-based estimators that can draw on prior information about population size and growth rates may be more efficient. A coalescent model is developed for a large class of populations such that the demographic history is described by a deterministic nonlinear dynamical system of arbitrary dimension. This class of demographic model differs from those typically used in population genetics. Birth and death rates are not fixed, and no assumptions are made regarding the fraction of the population sampled. Furthermore, the population may be structured in such a way that gene copies reproduce both within and across demes. For this large class of models, it is shown how to derive the rate of coalescence, as well as the likelihood of a gene genealogy with heterochronous sampling and labeled taxa, and how to simulate a coalescent tree conditional on a complex demographic history. This theoretical framework encapsulates many of the models used by ecologists and epidemiologists and should facilitate the integration of population genetics with the study of mathematical population dynamics.
Altitudinal Variation at Duplicated {beta}-Globin Genes in Deer Mice: Effects of Selection, Recombination, and Gene Conversion [Population and Evolutionary Genetics]
Spatially varying selection on a given polymorphism is expected to produce a localized peak in the between-population component of nucleotide diversity, and theory suggests that the chromosomal extent of elevated differentiation may be enhanced in cases where tandemly linked genes contribute to fitness variation. An intriguing example is provided by the tandemly duplicated β-globin genes of deer mice (Peromyscus maniculatus), which contribute to adaptive differentiation in blood–oxygen affinity between high- and low-altitude populations. Remarkably, the two β-globin genes segregate the same pair of functionally distinct alleles due to a history of interparalog gene conversion and alleles of the same functional type are in perfect coupling-phase linkage disequilibrium (LD). Here we report a multilocus analysis of nucleotide polymorphism and LD in highland and lowland mice with different genetic backgrounds at the β-globin genes. The analysis of haplotype structure revealed a paradoxical pattern whereby perfect LD between the two β-globin paralogs (which are separated by 16.2 kb) is maintained in spite of the fact that LD within both paralogs decays to background levels over physical distances of less than 1 kb. The survey of nucleotide polymorphism revealed that elevated levels of altitudinal differentiation at each of the β-globin genes drop away quite rapidly in the external flanking regions (upstream of the 5' paralog and downstream of the 3' paralog), but the level of differentiation remains unexpectedly high across the intergenic region. Observed patterns of diversity and haplotype structure are difficult to reconcile with expectations of a two-locus selection model with multiplicative fitness.
Linkage Disequilibrium Under Recurrent Bottlenecks [Population and Evolutionary Genetics]
To model deviations from selectively neutral genetic variation caused by different forms of selection, it is necessary to first understand patterns of neutral variation. Best understood is neutral genetic variation at a single locus. But, as is well known, additional insights can be gained by investigating multiple loci. The resulting patterns reflect the degree of association (linkage) between loci and provide information about the underlying multilocus gene genealogies. The statistical properties of two-locus gene genealogies have been intensively studied for populations of constant size, as well as for simple demographic histories such as exponential population growth and single bottlenecks. By contrast, the combined effect of recombination and sustained demographic fluctuations is poorly understood. Addressing this issue, we study a two-locus Wright–Fisher model of a population subject to recurrent bottlenecks. We derive coalescent approximations for the covariance of the times to the most recent common ancestor at two loci in samples of two chromosomes. This covariance reflects the degree of association and thus linkage disequilibrium between these loci. We find, first, that an effective population-size approximation describes the numerically observed association between two loci provided that recombination occurs either much faster or much more slowly than the population-size fluctuations. Second, when recombination occurs frequently between but rarely within bottlenecks, we observe that the association of gene histories becomes independent of physical distance over a certain range of distances. Third, we show that in this case, a commonly used measure of linkage disequilibrium, $${\sigma }_{\hbox{ d }}^{2}$$ (closely related to $${\widehat{r}}^{2}$$), fails to capture the long-range association between two loci. The reason is that constituent terms, each reflecting the long-range association, cancel. Fourth, we analyze a limiting case in which the long-range association can be described in terms of a Xi coalescent allowing for simultaneous multiple mergers of ancestral lines.
Estimation of Quantitative Trait Locus Effects with Epistasis by Variational Bayes Algorithms [Genetics of Complex Traits]
Bayesian hierarchical shrinkage methods have been widely used for quantitative trait locus mapping. From the computational perspective, the application of the Markov chain Monte Carlo (MCMC) method is not optimal for high-dimensional problems such as the ones arising in epistatic analysis. Maximum a posteriori (MAP) estimation can be a faster alternative, but it usually produces only point estimates without providing any measures of uncertainty (i.e., interval estimates). The variational Bayes method, stemming from the mean field theory in theoretical physics, is regarded as a compromise between MAP and MCMC estimation, which can be efficiently computed and produces the uncertainty measures of the estimates. Furthermore, variational Bayes methods can be regarded as the extension of traditional expectation-maximization (EM) algorithms and can be applied to a broader class of Bayesian models. Thus, the use of variational Bayes algorithms based on three hierarchical shrinkage models including Bayesian adaptive shrinkage, Bayesian LASSO, and extended Bayesian LASSO is proposed here. These methods performed generally well and were found to be highly competitive with their MCMC counterparts in our example analyses. The use of posterior credible intervals and permutation tests are considered for decision making between quantitative trait loci (QTL) and non-QTL. The performance of the presented models is also compared with R/qtlbim and R/BhGLM packages, using a previously studied simulated public epistatic data set.
Transcriptional Regulation and the Diversification of Metabolism in Wine Yeast Strains [Genetics of Complex Traits]
Transcription factors and their binding sites have been proposed as primary targets of evolutionary adaptation because changes to single transcription factors can lead to far-reaching changes in gene expression patterns. Nevertheless, there is very little concrete evidence for such evolutionary changes. Industrial wine yeast strains, of the species Saccharomyces cerevisiae, are a geno- and phenotypically diverse group of organisms that have adapted to the ecological niches of industrial winemaking environments and have been selected to produce specific styles of wine. Variation in transcriptional regulation among wine yeast strains may be responsible for many of the observed differences and specific adaptations to different fermentative conditions in the context of commercial winemaking. We analyzed gene expression profiles of wine yeast strains to assess the impact of transcription factor expression on metabolic networks. The data provide new insights into the molecular basis of variations in gene expression in industrial strains and their consequent effects on metabolic networks important to wine fermentation. We show that the metabolic phenotype of a strain can be shifted in a relatively predictable manner by changing expression levels of individual transcription factors, opening opportunities to modify transcription networks to achieve desirable outcomes.
Diversity of Long Terminal Repeat Retrotransposon Genome Distribution in Natural Populations of the Wild Diploid Wheat Aegilops speltoides [Genome and Systems Biology]
The environment can have a decisive influence on the structure of the genome, changing it in a certain direction. Therefore, the genomic distribution of environmentally sensitive transposable elements may vary measurably across a species area. In the present research, we aimed to detect and evaluate the level of LTR retrotransposon intraspecific variability in Aegilops speltoides (2n = 2x = 14), a wild cross-pollinated relative of cultivated wheat. The interretrotransposon amplified polymorphism (IRAP) protocol was applied to detect and evaluate the level of retrotransposon intraspecific variability in Ae. speltoides and closely related species. IRAP analysis revealed significant diversity in TE distribution. Various genotypes from the 13 explored populations significantly differ with respect to the patterns of the four explored LTR retrotransposons (WIS2, Wilma, Daniela, and Fatima). This diversity points to a constant ongoing process of LTR retrotransposon fraction restructuring in populations of Ae. speltoides throughout the species’ range and within single populations in time. Maximum changes were recorded in genotypes from small stressed populations. Principal component analysis showed that the dynamics of the Fatima element significantly differ from those of WIS2, Wilma, and Daniela. In terms of relationships between Sitopsis species, IRAP analysis revealed a grouping with Ae. sharonensis and Ae. longissima forming a separate unit, Ae. speltoides appearing as a dispersed group, and Ae. bicornis being in an intermediate position. IRAP display data revealed dynamic changes in LTR retrotransposon fractions in the genome of Ae. speltoides. The process is permanent and population specific, ultimately leading to the separation of small stressed populations from the main group.
"SNP Snappy": A Strategy for Fast Genome-Wide Association Studies Fitting a Full Mixed Model [Methods, Technology, and Resources]
A strategy to reduce computational demands of genome-wide association studies fitting a mixed model is presented. Improvements are achieved by utilizing a large proportion of calculations that remain constant across the multiple analyses for individual markers involved, with estimates obtained without inverting large matrices.


