Renal cell carcinoma (RCC) represents between 80 and 90% of kidney cancers. Previous genome-wide association studies of RCC have identified five variants conferring risk of the disease. Here we ...report the results from a discovery RCC genome-wide association study and replication analysis, including a total of 2,411 patients and 71,497 controls. One variant, rs35252396CG located at 8q24.21, is significantly associated with RCC after combining discovery and replication results (OR=1.27, P(combined)=5.4 × 10(-11)) and has an average risk allele frequency in controls of 46%. rs35252396CG does not have any strongly correlated variants in the genome and is located within a region predicted to have regulatory functions in several cell lines, including six originating from the kidney. This is the first RCC variant reported at 8q24.21 and it is largely independent (r(2)≤0.02) of the numerous previously reported cancer risk variants at this locus.
To search for new sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conducted a genome-wide SNP association study of 930 Icelanders with BCC and 33,117 controls. After ...analyzing 304,083 SNPs, we observed signals from loci at 1p36 and 1q42, and replicated these associations in additional sample sets from Iceland and Eastern Europe. Overall, the most significant signals were from rs7538876 on 1p36 (OR = 1.28, P = 4.4 × 10−12) and rs801114 on 1q42 (OR = 1.28, P = 5.9 × 10−12). The 1p36 locus contains the candidate genes PADI4, PADI6, RCC2 and ARHGEF10L, and the gene nearest to the 1q42 locus is the ras-homolog RHOU. Neither locus was associated with fair pigmentation traits that are known risk factors for BCC, and no risk was observed for melanoma. Approximately 1.6% of individuals of European ancestry are homozygous for both variants, and their estimated risk of BCC is 2.68 times that of noncarriers.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
We have accumulated considerable data on the genetic makeup of the Icelandic population by sequencing the whole genomes of 2,636 Icelanders to depth of at least 10X and by chip genotyping 101,584 ...more. The sequencing was done with Illumina technology. The median sequencing depth was 20X and 909 individuals were sequenced to a depth of at least 30X. We found 20 million single nucleotide polymorphisms (SNPs) and 1.5 million insertions/deletions (indels) that passed stringent quality control. Almost all the common SNPs (derived allele frequency (DAF) over 2%) that we identified in Iceland have been observed by either dbSNP (build 137) or the Exome Sequencing Project (ESP) while only 60 and 20% of rare (DAF<0.5%) SNPs and indels in coding regions, the most heavily studied parts of the genome, have been observed in the public databases. Features of our variant data, such as the transition/transversion ratio and the length distribution of indels, are similar to published reports.
The genetic basis of the human vocal system is largely unknown, as are the sequence variants that give rise to individual differences in voice and speech. Here, we couple data on diversity in the ...sequence of the genome with voice and vowel acoustics in speech recordings from 12,901 Icelanders. We show how voice pitch and vowel acoustics vary across the life span and correlate with anthropometric, physiological, and cognitive traits. We found that voice pitch and vowel acoustics have a heritable component and discovered correlated common variants in
that associate with voice pitch. The
variants also associate with adrenal gene expression and cardiovascular traits. By showing that voice and vowel acoustics are influenced by genetics, we have taken important steps toward understanding the genetics and evolution of the human vocal system.
We used an approach that we term ancestry-shift refinement mapping to investigate an association, originally discovered in a GWAS of a Chinese population, between rs2046210T and breast cancer ...susceptibility. The locus is on 6q25.1 in proximity to the C6orf97 and estrogen receptor alpha (ESR1) genes. We identified a panel of SNPs that are correlated with rs2046210 in Chinese, but not necessarily so in other ancestral populations, and genotyped them in breast cancer case:control samples of Asian, European, and African origin, a total of 10,176 cases and 13,286 controls. We found that rs2046210T does not confer substantial risk of breast cancer in Europeans and Africans (OR = 1.04, P = 0.099, and OR = 0.98, P = 0.77, respectively). Rather, in those ancestries, an association signal arises from a group of less common SNPs typified by rs9397435. The rs9397435G allele was found to confer risk of breast cancer in European (OR = 1.15, P = 1.2 x 10(-3)), African (OR = 1.35, P = 0.014), and Asian (OR = 1.23, P = 2.9 x 10(-4)) population samples. Combined over all ancestries, the OR was 1.19 (P = 3.9 x 10(-7)), was without significant heterogeneity between ancestries (P(het) = 0.36) and the SNP fully accounted for the association signal in each ancestry. Haplotypes bearing rs9397435G are well tagged by rs2046210T only in Asians. The rs9397435G allele showed associations with both estrogen receptor positive and estrogen receptor negative breast cancer. Using early-draft data from the 1,000 Genomes project, we found that the risk allele of a novel SNP (rs77275268), which is closely correlated with rs9397435, disrupts a partially methylated CpG sequence within a known CTCF binding site. These studies demonstrate that shifting the analysis among ancestral populations can provide valuable resolution in association mapping.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Intracranial volume, measured through magnetic resonance imaging and/or estimated from head circumference, is heritable and correlates with cognitive traits and several neurological disorders. We ...performed a genome-wide association study meta-analysis of intracranial volume (
= 79 174) and found 64 associating sequence variants explaining 5.0% of its variance. We used coding variation, transcript and protein levels, to uncover 12 genes likely mediating the effect of these variants, including
and
that affect cranial synostosis and microcephaly, respectively. Intracranial volume correlates genetically with volumes of cortical and sub-cortical regions, cognition, learning, neonatal and neurological traits. Parkinson's disease cases have greater and attention deficit hyperactivity disorder cases smaller intracranial volume than controls. Our Mendelian randomization studies indicate that intracranial volume associated variants either increase the risk of Parkinson's disease and decrease the risk of attention deficit hyperactivity disorder and neuroticism or correlate closely with a confounder.
Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic ...variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data
. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank
. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data.
For this purpose ...we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, find genomic spatial overlap between various types of genomic features, filter and aggregate them in various ways.
The GORpipe software is freely available for non-commercial academic usage and can be downloaded from www.nextcode.com/gorpipe CONTACT: hakon@wuxinextcode.comSupplementary information: Supplementary data are available at Bioinformatics online.
Mutations in genes encoding subunits of the phagocyte NADPH oxidase complex are recognized to cause chronic granulomatous disease (CGD), a severe primary immunodeficiency. Here we describe how ...deficiency of CYBC1, a previously uncharacterized protein in humans (C17orf62), leads to reduced expression of NADPH oxidase's main subunit (gp91
) and results in CGD. Analyzing two brothers diagnosed with CGD we identify a homozygous loss-of-function mutation, p.Tyr2Ter, in CYBC1. Imputation of p.Tyr2Ter into 155K chip-genotyped Icelanders reveals six additional homozygotes, all with signs of CGD, manifesting as colitis, rare infections, or a severely impaired PMA-induced neutrophil oxidative burst. Homozygosity for p.Tyr2Ter consequently associates with inflammatory bowel disease (IBD) in Iceland (P = 8.3 × 10
; OR = 67.6), as well as reduced height (P = 3.3 × 10
; -8.5 cm). Overall, we find that CYBC1 deficiency results in CGD characterized by colitis and a distinct profile of infections indicative of macrophage dysfunction.
The great majority of thyroid cancers are of the non-medullary type. Here we report findings from a genome-wide association study of non-medullary thyroid cancer, including in total 3,001 patients ...and 287,550 controls from five study groups of European descent. Our results yield five novel loci (all with P
<3 × 10
): 1q42.2 (rs12129938 in PCNXL2), 3q26.2 (rs6793295 a missense mutation in LRCC34 near TERC), 5q22.1 (rs73227498 between NREP and EPB41L4A), 10q24.33 (rs7902587 near OBFC1), and two independently associated variants at 15q22.33 (rs2289261 and rs56062135; both in SMAD3). We also confirm recently published association results from a Chinese study of a variant on 5p15.33 (rs2736100 near the TERT gene) and present a stronger association result for a moderately correlated variant (rs10069690; OR=1.20, P=3.2 × 10
) based on our study of individuals of European ancestry. In combination, these results raise several opportunities for future studies of the pathogenesis of thyroid cancer.