The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and ...elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.
Extensive studies are currently being performed to associate disease susceptibility with one form of genetic variation, namely, single-nucleotide polymorphisms (SNPs). In recent years, another type ...of common genetic variation has been characterized, namely, structural variation, including copy number variants (CNVs). To determine the overall contribution of CNVs to complex phenotypes, we have performed association analyses of expression levels of 14,925 transcripts with SNPs and CNVs in individuals who are part of the International HapMap project. SNPs and CNVs captured 83.6% and 17.7% of the total detected genetic variation in gene expression, respectively, but the signals from the two types of variation had little overlap. Interrogation of the genome for both types of variants may be an effective way to elucidate the causes of complex phenotypes and disease in humans.
Population genomics of human gene expression Montgomery, Stephen; Dermitzakis, Emmanouil T; Bird, Christine P ...
Nature genetics,
10/2007, Letnik:
39, Številka:
10
Journal Article
Recenzirano
Odprti dostop
Genetic variation influences gene expression, and this variation in gene expression can be efficiently mapped to specific genomic regions and variants. Here we have used gene expression profiling of ...Epstein-Barr virus-transformed lymphoblastoid cell lines of all 270 individuals genotyped in the HapMap Consortium to elucidate the detailed features of genetic variation underlying gene expression variation. We find that gene expression is heritable and that differentiation between populations is in agreement with earlier small-scale studies. A detailed association analysis of over 2.2 million common SNPs per population (5% frequency in HapMap) with gene expression identified at least 1,348 genes with association signals in cis and at least 180 in trans. Replication in at least one independent population was achieved for 37% of cis signals and 15% of trans signals, respectively. Our results strongly support an abundance of cis-regulatory variation in the human genome. Detection of trans effects is limited but suggests that regulatory variation may be the key primary effect contributing to phenotypic variation in humans. We also explore several methodologies that improve the current state of analysis of gene expression variation.
The
JAK2V617F mutation is associated with distinct myeloproliferative neoplasms, including polycythemia vera (PV) and essential thrombocythemia (ET), but it remains unclear how it generates disparate ...disorders. By comparing clonally-derived mutant and wild-type cells from individual patients, we demonstrate that the transcriptional consequences of
JAK2V617F are subtle, and that
JAK2V617F-heterozygous erythroid cells from ET and PV patients exhibit differential interferon signaling and STAT1 phosphorylation. Increased STAT1 activity in normal CD34-positive progenitors produces an ET-like phenotype, whereas downregulation of STAT1 activity in
JAK2V617F-heterozygous ET progenitors produces a PV-like phenotype. Our results illustrate the power of clonal analysis, indicate that the consequences of
JAK2V617F reflect a balance between STAT5 and STAT1 activation and are relevant for other neoplasms associated with signaling pathway mutations.
► Effect of JAK2V617F on transcriptome is subtle and less than interindividual variation ► STAT1 is activated in JAK2V617F-heterozygous erythroblasts in ET and not PV ► STAT1 activity provides molecular mechanism for difference between ET and PV ► Power of comparing clonally-derived mutant and wild-type cells from same patient
Infections with the malaria parasite Plasmodium falciparum result in more than 1 million deaths each year worldwide. Deciphering the evolutionary history and genetic variation of P. falciparum is ...critical for understanding the evolution of drug resistance, identifying potential vaccine candidates and appreciating the effect of parasite variation on prevalence and severity of malaria in humans. Most studies of natural variation in P. falciparum have been either in depth over small genomic regions (up to the size of a small chromosome) or genome wide but only at low resolution. In an effort to complement these studies with genome-wide data, we undertook shotgun sequencing of a Ghanaian clinical isolate (with fivefold coverage), the IT laboratory isolate (with onefold coverage) and the chimpanzee parasite P. reichenowi (with twofold coverage). We compared these sequences with the fully sequenced P. falciparum 3D7 isolate genome. We describe the most salient features of P. falciparum polymorphism and adaptive evolution with relation to gene function, transcript and protein expression and cellular localization. This analysis uncovers the primary evolutionary changes that have occurred since the P. falciparum-P. reichenowi speciation and changes that are occurring within P. falciparum.NOTE: In the original version of this paper, the authors failed to acknowledge that sequencing of the P. falciparum IT laboratory isolate was funded by a European Union 6th Framework Program grant to the BioMalPar Consortium (contract number LSHP-LT-2004-503578). This error has been corrected in the PDF version of the article.
Gene regulation is considered one of the driving forces of evolution. Although protein-coding DNA sequences and RNA genes have been subject to recent evolutionary events in the human lineage, it has ...been hypothesized that the large phenotypic divergence between humans and chimpanzees has been driven mainly by changes in gene regulation rather than altered protein-coding gene sequences. Comparative analysis of vertebrate genomes has revealed an abundance of evolutionarily conserved but noncoding sequences. These conserved noncoding (CNC) sequences may well harbor critical regulatory variants that have driven recent human evolution.
Here we identify 1,356 CNC sequences that appear to have undergone dramatic human-specific changes in selective pressures, at least 15% of which have substitution rates significantly above that expected under neutrality. The 1,356 'accelerated CNC' (ANC) sequences are enriched in recent segmental duplications, suggesting a recent change in selective constraint following duplication. In addition, single nucleotide polymorphisms within ANC sequences have a significant excess of high frequency derived alleles and high F(ST) values relative to controls, indicating that acceleration and positive selection are recent in human populations. Finally, a significant number of single nucleotide polymorphisms within ANC sequences are associated with changes in gene expression. The probability of variation in an ANC sequence being associated with a gene expression phenotype is fivefold higher than variation in a control CNC sequence.
Our analysis suggests that ANC sequences have until very recently played a role in human evolution, potentially through lineage-specific changes in gene regulation.
Genome-wide associations have shown a lot of promise in dissecting the genetics of complex traits in humans with single variants, yet a large fraction of the genetic effects is still unaccounted for. ...Analyzing genetic interactions between variants (epistasis) is one of the potential ways forward. We investigated the abundance and functional impact of a specific type of epistasis, namely the interaction between regulatory and protein-coding variants. Using genotype and gene expression data from the 210 unrelated individuals of the original four HapMap populations, we have explored the combined effects of regulatory and protein-coding single nucleotide polymorphisms (SNPs). We predict that about 18% (1,502 out of 8,233 nsSNPs) of protein-coding variants are differentially expressed among individuals and demonstrate that regulatory variants can modify the functional effect of a coding variant in cis. Furthermore, we show that such interactions in cis can affect the expression of downstream targets of the gene containing the protein-coding SNP. In this way, a cis interaction between regulatory and protein-coding variants has a trans impact on gene expression. Given the abundance of both types of variants in human populations, we propose that joint consideration of regulatory and protein-coding variants may reveal additional genetic effects underlying complex traits and disease and may shed light on causes of differential penetrance of known disease variants.
Mutations in whole organisms are powerful ways of interrogating gene function in a realistic context. We describe a program, the Sanger Institute Mouse Genetics Project, that provides a step toward ...the aim of knocking out all genes and screening each line for a broad range of traits. We found that hitherto unpublished genes were as likely to reveal phenotypes as known genes, suggesting that novel genes represent a rich resource for investigating the molecular basis of disease. We found many unexpected phenotypes detected only because we screened for them, emphasizing the value of screening all mutants for a wide range of traits. Haploinsufficiency and pleiotropy were both surprisingly common. Forty-two percent of genes were essential for viability, and these were less likely to have a paralog and more likely to contribute to a protein complex than other genes. Phenotypic data and more than 900 mutants are openly available for further analysis.
Display omitted
Display omitted
•Large openly available resource of targeted mouse mutants and phenotypic data•Screen for broad range of disease features and traits•Many novel phenotypes suggest functions for both studied and unstudied genes•Haploinsufficiency and pleiotropy are common
More than 900 new mutant mice lines and a multifaceted phenotypic screening platform reveal unanticipated pleiotropies, widespread effects of haploinsufficiency, potential disease models, and functions for unstudied genes.
Genome-wide associations have shown a lot of promise in dissecting the genetics of complex traits in humans with single variants, yet a large fraction of the genetic effects is still unaccounted for. ...Analyzing genetic interactions between variants (epistasis) is one of the potential ways forward. We investigated the abundance and functional impact of a specific type of epistasis, namely the interaction between regulatory and protein-coding variants. Using genotype and gene expression data from the 210 unrelated individuals of the original four HapMap populations, we have explored the combined effects of regulatory and protein-coding single nucleotide polymorphisms (SNPs). We predict that about 18% (1,502 out of 8,233 nsSNPs) of protein-coding variants are differentially expressed among individuals and demonstrate that regulatory variants can modify the functional effect of a coding variant in cis. Furthermore, we show that such interactions in cis can affect the expression of downstream targets of the gene containing the protein-coding SNP. In this way, a cis interaction between regulatory and protein-coding variants has a trans impact on gene expression. Given the abundance of both types of variants in human populations, we propose that joint consideration of regulatory and protein-coding variants may reveal additional genetic effects underlying complex traits and disease and may shed light on causes of differential penetrance of known disease variants.
Biomarkers to optimize extended adjuvant endocrine therapy for women with estrogen receptor (ER)-positive breast cancer are limited. The HOXB13/IL17BR (H/I) biomarker predicts recurrence risk in ...ER-positive, lymph node-negative breast cancer patients. H/I was evaluated in MA.17 trial for prognostic performance for late recurrence and treatment benefit from extended adjuvant letrozole.
A prospective-retrospective, nested case-control design of 83 recurrences matched to 166 nonrecurrences from letrozole- and placebo-treated patients within MA.17 was conducted. Expression of H/I within primary tumors was determined by reverse-transcription polymerase chain reaction with a prespecified cutpoint. The predictive ability of H/I for ascertaining benefit from letrozole was determined using multivariable conditional logistic regression including standard clinicopathological factors as covariates. All statistical tests were two-sided.
High H/I was statistically significantly associated with a decrease in late recurrence in patients receiving extended letrozole therapy (odds ratio OR = 0.35; 95% confidence interval CI = 0.16 to 0.75; P = .007). In an adjusted model with standard clinicopathological factors, high H/I remained statistically significantly associated with patient benefit from letrozole (OR = 0.33; 95% CI = 0.15 to 0.73; P = .006). Reduction in the absolute risk of recurrence at 5 years was 16.5% for patients with high H/I (P = .007). The interaction between H/I and letrozole treatment was statistically significant (P = .03).
In the absence of extended letrozole therapy, high H/I identifies a subgroup of ER-positive patients disease-free after 5 years of tamoxifen who are at risk for late recurrence. When extended endocrine therapy with letrozole is prescribed, high H/I predicts benefit from therapy and a decreased probability of late disease recurrence.