Population stratification is a well-known confounding factor in both common and rare variant association analyses. Rare variants tend to be more geographically clustered than common variants, because ...of their more recent origin. However, it is not yet clear if population stratification at a very fine scale (neighboring administrative regions within a country) would lead to statistical bias in rare variant analyses. As the inclusion of convenience controls from external studies is indeed a common procedure, in order to increase the power to detect genetic associations, this problem is important. We studied through simulation the impact of a fine scale population structure on different rare variant association strategies, assessing type I error and power. We showed that principal component analysis (PCA) based methods of adjustment for population stratification adequately corrected type I error inflation at the largest geographical scales, but not at finest scales. We also showed in our simulations that adding controls obviously increased power, but at a considerably lower level when controls were drawn from another population.
Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide ...microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.
Starch consumption is a prominent characteristic of agricultural societies and hunter-gatherers in arid environments. In contrast, rainforest and circum-arctic hunter-gatherers and some pastoralists ...consume much less starch. This behavioral variation raises the possibility that different selective pressures have acted on amylase, the enzyme responsible for starch hydrolysis. We found that copy number of the salivary amylase gene (AMY1) is correlated positively with salivary amylase protein level and that individuals from populations with high-starch diets have, on average, more AMY1 copies than those with traditionally low-starch diets. Comparisons with other loci in a subset of these populations suggest that the extent of AMY1 copy number differentiation is highly unusual. This example of positive selection on a copy number-variable gene is, to our knowledge, one of the first discovered in the human genome. Higher AMY1 copy numbers and protein levels probably improve the digestion of starchy foods and may buffer against the fitness-reducing effects of intestinal disease.
Extensive studies are currently being performed to associate disease susceptibility with one form of genetic variation, namely, single-nucleotide polymorphisms (SNPs). In recent years, another type ...of common genetic variation has been characterized, namely, structural variation, including copy number variants (CNVs). To determine the overall contribution of CNVs to complex phenotypes, we have performed association analyses of expression levels of 14,925 transcripts with SNPs and CNVs in individuals who are part of the International HapMap project. SNPs and CNVs captured 83.6% and 17.7% of the total detected genetic variation in gene expression, respectively, but the signals from the two types of variation had little overlap. Interrogation of the genome for both types of variants may be an effective way to elucidate the causes of complex phenotypes and disease in humans.
Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of ...270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
DNA copy number variation has long been associated with specific chromosomal rearrangements and genomic disorders, but its ubiquity in mammalian genomes was not fully realized until recently. ...Although our understanding of the extent of this variation is still developing, it seems likely that, at least in humans, copy number variants (CNVs) account for a substantial amount of genetic variation. Since many CNVs include genes that result in differential levels of gene expression, CNVs may account for a significant proportion of normal phenotypic variation. Current efforts are directed toward a more comprehensive cataloging and characterization of CNVs that will provide the basis for determining how genomic diversity impacts biological function, evolution, and common human diseases.
Brugada syndrome (BrS) remains genetically heterogeneous and is associated with slowed cardiac conduction. We aimed to identify genetic variation in BrS cases at loci associated with QRS duration.
A ...multi-centre study sequenced seven candidate genes (SCN10A, HAND1, PLN, CASQ2, TKT, TBX3, and TBX5) in 156 Caucasian SCN5A mutation-negative BrS patients (80% male; mean age 48) with symptoms (64%) and/or a family history of sudden death (47%) or BrS (18%). Forty-nine variants were identified: 18 were rare (MAF <1%) and non-synonymous; and 11/18 (61.1%), mostly in SCN10A, were predicted as pathogenic using multiple bioinformatics tools. Allele frequencies were compared with the Exome Sequencing and UK10K Projects. SKAT methods tested rare variation in SCN10A finding no statistically significant difference between cases and controls. Co-segregation analysis was possible for four of seven probands carrying a novel pathogenic variant. Only one pedigree (I671V/G1299A in SCN10A) showed co-segregation. The SCN10A SNP V1073 was, however, associated strongly with BrS 66.9 vs. 40.1% (UK10K) OR (95% CI) = 3.02 (2.35-3.87), P = 8.07 × 10-19. Voltage-clamp experiments for NaV1.8 were performed for SCN10A common variants V1073, A1073, and rare variants of interest: A200V and I671V. V1073, A200V and I671V, demonstrated significant reductions in peak INa compared with ancestral allele A1073 (rs6795970).
Rare variants in the screened QRS-associated genes (including SCN10A) are not responsible for a significant proportion of SCN5A mutation negative BrS. The common SNP SCN10A V1073 was strongly associated with BrS and demonstrated loss of NaV1.8 function, as did rare variants in isolated patients.
Sodium channel NaV1.5 underlies cardiac excitability and conduction. The last 3 residues of NaV1.5 (Ser-Ile-Val) constitute a PDZ domain-binding motif that interacts with PDZ proteins such as ...syntrophins and SAP97 at different locations within the cardiomyocyte, thus defining distinct pools of NaV1.5 multiprotein complexes. Here, we explored the in vivo and clinical impact of this motif through characterization of mutant mice and genetic screening of patients.
To investigate in vivo the regulatory role of this motif, we generated knock-in mice lacking the SIV domain (ΔSIV). ΔSIV mice displayed reduced NaV1.5 expression and sodium current (INa), specifically at the lateral myocyte membrane, whereas NaV1.5 expression and INa at the intercalated disks were unaffected. Optical mapping of ΔSIV hearts revealed that ventricular conduction velocity was preferentially decreased in the transversal direction to myocardial fiber orientation, leading to increased anisotropy of ventricular conduction. Internalization of wild-type and ΔSIV channels was unchanged in HEK293 cells. However, the proteasome inhibitor MG132 rescued ΔSIV INa, suggesting that the SIV motif is important for regulation of NaV1.5 degradation. A missense mutation within the SIV motif (p.V2016M) was identified in a patient with Brugada syndrome. The mutation decreased NaV1.5 cell surface expression and INa when expressed in HEK293 cells.
Our results demonstrate the in vivo significance of the PDZ domain-binding motif in the correct expression of NaV1.5 at the lateral cardiomyocyte membrane and underline the functional role of lateral NaV1.5 in ventricular conduction. Furthermore, we reveal a clinical relevance of the SIV motif in cardiac disease.
While 3D chromatin organization in topologically associating domains (TADs) and loops mediating regulatory element-promoter interactions is crucial for tissue-specific gene regulation, the extent of ...their involvement in human Mendelian disease is largely unknown. Here, we identify 7 families presenting a new cardiac entity associated with a heterozygous deletion of 2 CTCF binding sites on 4q25, inducing TAD fusion and chromatin conformation remodeling. The CTCF binding sites are located in a gene desert at 1 Mb from the Paired-like homeodomain transcription factor 2 gene (PITX2). By introducing the ortholog of the human deletion in the mouse genome, we recapitulate the patient phenotype and characterize an opposite dysregulation of PITX2 expression in the sinoatrial node (ectopic activation) and ventricle (reduction), respectively. Chromatin conformation assay performed in human induced pluripotent stem cell-derived cardiomyocytes harboring the minimal deletion identified in family#1 reveals a conformation remodeling and fusion of TADs. We conclude that TAD remodeling mediated by deletion of CTCF binding sites causes a new autosomal dominant Mendelian cardiac disorder.
Abstract
Aims
In 2003, an Australian woman was convicted by a jury of smothering and killing her four children over a 10-year period. Each child died suddenly and unexpectedly during a sleep period, ...at ages ranging from 19 days to 18 months. In 2019 we were asked to investigate if a genetic cause could explain the children’s deaths as part of an inquiry into the mother’s convictions.
Methods and results
Whole genomes or exomes of the mother and her four children were sequenced. Functional analysis of a novel CALM2 variant was performed by measuring Ca2+-binding affinity, interaction with calcium channels and channel function. We found two children had a novel calmodulin variant (CALM2 G114R) that was inherited maternally. Three genes (CALM1-3) encode identical calmodulin proteins. A variant in the corresponding residue of CALM3 (G114W) was recently reported in a child who died suddenly at age 4 and a sibling who suffered a cardiac arrest at age 5. We show that CALM2 G114R impairs calmodulin's ability to bind calcium and regulate two pivotal calcium channels (CaV1.2 and RyR2) involved in cardiac excitation contraction coupling. The deleterious effects of G114R are similar to those produced by G114W and N98S, which are considered arrhythmogenic and cause sudden cardiac death in children.
Conclusion
A novel functional calmodulin variant (G114R) predicted to cause idiopathic ventricular fibrillation, catecholaminergic polymorphic ventricular tachycardia, or mild long QT syndrome was present in two children. A fatal arrhythmic event may have been triggered by their intercurrent infections. Thus, calmodulinopathy emerges as a reasonable explanation for a natural cause of their deaths.
Graphical Abstract