With the recent burst of technological developments in genomics, and the clinical implementation of genome-wide assays, our understanding of the molecular basis of genomic disorders, specifically the ...contribution of structural variation to disease burden, is evolving quickly. Ongoing studies have revealed a ubiquitous role for genome architecture in the formation of structural variants at a given locus, both in DNA recombination-based processes and in replication-based processes. These reports showcase the influence of repeat sequences on genomic stability and structural variant complexity and also highlight the tremendous plasticity and dynamic nature of our genome in evolution, health and disease susceptibility.
Complex chromosomal rearrangements (CCRs) are rearrangements involving more than two chromosomes or more than two breakpoints. Whole genome sequencing (WGS) allows for outstanding high resolution ...characterization on the nucleotide level in unique sequences of such rearrangements, but problems remain for mapping breakpoints in repetitive regions of the genome, which are known to be prone to rearrangements. Hence, multiple complementary WGS experiments are sometimes needed to solve the structures of CCRs. We have studied three individuals with CCRs: Case 1 and Case 2 presented with de novo karyotypically balanced, complex interchromosomal rearrangements (46,XX,t(2;8;15)(q35;q24.1;q22) and 46,XY,t(1;10;5)(q32;p12;q31)), and Case 3 presented with a de novo, extremely complex intrachromosomal rearrangement on chromosome 1. Molecular cytogenetic investigation revealed cryptic deletions in the breakpoints of chromosome 2 and 8 in Case 1, and on chromosome 10 in Case 2, explaining their clinical symptoms. In Case 3, 26 breakpoints were identified using WGS, disrupting five known disease genes. All rearrangements were subsequently analyzed using optical maps, linked-read WGS, and short-read WGS. In conclusion, we present a case series of three unique de novo CCRs where we by combining the results from the different technologies fully solved the structure of each rearrangement. The power in combining short-read WGS with long-molecule sequencing or optical mapping in these unique de novo CCRs in a clinical setting is demonstrated.
Copy number variation (CNV) is a major source of genetic variation among humans. In addition to existing as benign polymorphisms, CNVs can also convey clinical phenotypes, including genomic ...disorders, sporadic diseases and complex human traits. CNV results from genomic rearrangements that can represent simple deletion or duplication of a genomic segment, or be more complex. Complex chromosomal rearrangements (CCRs) have been known for some time but their mechanisms have remained elusive. Recent technology advances and high-resolution human genome analyses have revealed that complex genomic rearrangements can account for a large fraction of non-recurrent rearrangements at a given locus. Various mechanisms, most of which are DNA-replication-based, for example fork stalling and template switching (FoSTeS) and microhomology-mediated break-induced replication (MMBIR), have been proposed for generating such complex genomic rearrangements and are probably responsible for CCR.
During the last two decades, the importance of human genome copy number variation (CNV) in disease has become widely recognized. However, much is not understood about underlying mechanisms. We show ...how, although model organism research guides molecular understanding, important insights are gained from study of the wealth of information available in the clinic. We describe progress in explaining nonallelic homologous recombination (NAHR), a major cause of copy number change occurring when control of allelic recombination fails, highlight the growing importance of replicative mechanisms to explain complex events, and describe progress in understanding extreme chromosome reorganization (chromothripsis). Both nonhomologous end-joining and aberrant replication have significant roles in chromothripsis. As we study CNV, the processes underlying human genome evolution are revealed.
Bardet-Biedl syndrome (BBS) is a defining ciliopathy, notable for extensive allelic and genetic heterogeneity, almost all of which has been identified through sequencing. Recent data have suggested ...that copy-number variants (CNVs) also contribute to BBS. We used a custom oligonucleotide array comparative genomic hybridization (aCGH) covering 20 genes that encode intraflagellar transport (IFT) components and 74 ciliopathy loci to screen 92 unrelated individuals with BBS, irrespective of their known mutational burden. We identified 17 individuals with exon-disruptive CNVs (18.5%), including 13 different deletions in eight BBS genes (BBS1, BBS2, ARL6/BBS3, BBS4, BBS5, BBS7, BBS9, and NPHP1) and a deletion and a duplication in other ciliopathy-associated genes (ALMS1 and NPHP4, respectively). By contrast, we found a single heterozygous exon-disruptive event in a BBS-associated gene (BBS9) in 229 control subjects. Superimposing these data with resequencing revealed CNVs to (1) be sufficient to cause disease, (2) Mendelize heterozygous deleterious alleles, and (3) contribute oligogenic alleles by combining point mutations and exonic CNVs in multiple genes. Finally, we report a deletion and a splice site mutation in IFT74, inherited under a recessive paradigm, defining a candidate BBS locus. Our data suggest that CNVs contribute pathogenic alleles to a substantial fraction of BBS-affected individuals and highlight how either deletions or point mutations in discrete splice isoforms can induce hypomorphic mutations in genes otherwise intolerant to deleterious variation. Our data also suggest that CNV analyses and resequencing studies unbiased for previous mutational burden is necessary to delineate the complexity of disease architecture.
We identified complex genomic rearrangements consisting of intermixed duplications and triplications of genomic segments at the MECP2 and PLP1 loci. These complex rearrangements were characterized by ...a triplicated segment embedded within a duplication in 11 unrelated subjects. Notably, only two breakpoint junctions were generated during each rearrangement formation. All the complex rearrangement products share a common genomic organization, duplication-inverted triplication-duplication (DUP-TRP/INV-DUP), in which the triplicated segment is inverted and located between directly oriented duplicated genomic segments. We provide evidence that the DUP-TRP/INV-DUP structures are mediated by inverted repeats that can be separated by >300 kb, a genomic architecture that apparently leads to susceptibility to such complex rearrangements. A similar inverted repeat-mediated mechanism may underlie structural variation in many other regions of the human genome. We propose a mechanism that involves both homology-driven events, via inverted repeats, and microhomologous or nonhomologous events.
Chromosomal insertions are genomic rearrangements with a chromosome segment inserted into a non-homologous chromosome or a non-adjacent locus on the same chromosome or the other homologue, ...constituting ~2% of nonrecurrent copy-number gains. Little is known about the molecular mechanisms of their formation. We identified 16 individuals with complex insertions among 56,000 individuals tested at Baylor Genetics using clinical array comparative genomic hybridization (aCGH) and fluorescence in situ hybridization (FISH). Custom high-density aCGH was performed on 10 individuals with available DNA, and breakpoint junctions were fine-mapped at nucleotide resolution by long-range PCR and DNA sequencing in 6 individuals to glean insights into potential mechanisms of formation. We observed microhomologies and templated insertions at the breakpoint junctions, resembling the breakpoint junction signatures found in complex genomic rearrangements generated by replication-based mechanism(s) with iterative template switches. In addition, we analyzed 5 families with apparently balanced insertion in one parent detected by FISH analysis and found that 3 parents had additional small copy-number variants (CNVs) at one or both sides of the inserting fragments as well as at the inserted sites. We propose that replicative repair can result in interchromosomal complex insertions generated through chromothripsis-like chromoanasynthesis involving two or three chromosomes, and cause a significant fraction of apparently balanced insertions harboring small flanking CNVs.
Duplication at the Xq28 band including the MECP2 gene is one of the most common genomic rearrangements identified in neurodevelopmentally delayed males. Such duplications are non-recurrent and can be ...generated by a non-homologous end joining (NHEJ) mechanism. We investigated the potential mechanisms for MECP2 duplication and examined whether genomic architectural features may play a role in their origin using a custom designed 4-Mb tiling-path oligonucleotide array CGH assay. Each of the 30 patients analyzed showed a unique duplication varying in size from ∼250 kb to ∼2.6 Mb. Interestingly, in 77% of these non-recurrent duplications, the distal breakpoints grouped within a 215 kb genomic interval, located 47 kb telomeric to the MECP2 gene. The genomic architecture of this region contains both direct and inverted low-copy repeat (LCR) sequences; this same region undergoes polymorphic structural variation in the general population. Array CGH revealed complex rearrangements in eight patients; in six patients the duplication contained an embedded triplicated segment, and in the other two, stretches of non-duplicated sequences occurred within the duplicated region. Breakpoint junction sequencing was achieved in four duplications and identified an inversion in one patient, demonstrating further complexity. We propose that the presence of LCRs in the vicinity of the MECP2 gene may generate an unstable DNA structure that can induce DNA strand lesions, such as a collapsed fork, and facilitate a Fork Stalling and Template Switching event producing the complex rearrangements involving MECP2.
Increased dosage of methyl-CpG-binding protein-2 (MeCP2) results in a dramatic neurodevelopmental phenotype with onset at birth. We generated induced pluripotent stem cells (iPSCs) from patients with ...the MECP2 duplication syndrome (MECP2dup), carrying different duplication sizes, to study the impact of increased MeCP2 dosage in human neurons. We show that cortical neurons derived from these different MECP2dup iPSC lines have increased synaptogenesis and dendritic complexity. In addition, using multi-electrodes arrays, we show that neuronal network synchronization was altered in MECP2dup-derived neurons. Given MeCP2 functions at the epigenetic level, we tested whether these alterations were reversible using a library of compounds with defined activity on epigenetic pathways. One histone deacetylase inhibitor, NCH-51, was validated as a potential clinical candidate. Interestingly, this compound has never been considered before as a therapeutic alternative for neurological disorders. Our model recapitulates early stages of the human MECP2 duplication syndrome and represents a promising cellular tool to facilitate therapeutic drug screening for severe neurodevelopmental disorders.
Inverted repeats (IRs) can facilitate structural variation as crucibles of genomic rearrangement. Complex duplication-inverted triplication-duplication (DUP-TRP/INV-DUP) rearrangements that contain ...breakpoint junctions within IRs have been recently associated with both MECP2 duplication syndrome (MIM#300260) and Pelizaeus-Merzbacher disease (PMD, MIM#312080). We investigated 17 unrelated PMD subjects with copy number gains at the PLP1 locus including triplication and quadruplication of specific genomic intervals-16/17 were found to have a DUP-TRP/INV-DUP rearrangement product. An IR distal to PLP1 facilitates DUP-TRP/INV-DUP formation as well as an inversion structural variation found frequently amongst normal individuals. We show that a homology-or homeology-driven replicative mechanism of DNA repair can apparently mediate template switches within stretches of microhomology. Moreover, we provide evidence that quadruplication and potentially higher order amplification of a genomic interval can occur in a manner consistent with rolling circle amplification as predicted by the microhomology-mediated break induced replication (MMBIR) model.