Abstract
Emerging evidence links genes within human-specific segmental duplications (HSDs) to traits and diseases unique to our species. Strikingly, despite being nearly identical by sequence ...(>98.5%), paralogous HSD genes are differentially expressed across human cell and tissue types, though the underlying mechanisms have not been examined. We compared cross-tissue mRNA levels of 75 HSD genes from 30 families between humans and chimpanzees and found expression patterns consistent with relaxed selection on or neofunctionalization of derived paralogs. In general, ancestral paralogs exhibited greatest expression conservation with chimpanzee orthologs, though exceptions suggest certain derived paralogs may retain or supplant ancestral functions. Concordantly, analysis of long-read isoform sequencing data sets from diverse human tissues and cell lines found that about half of derived paralogs exhibited globally lower expression. To understand mechanisms underlying these differences, we leveraged data from human lymphoblastoid cell lines (LCLs) and found no relationship between paralogous expression divergence and post-transcriptional regulation, sequence divergence, or copy-number variation. Considering cis-regulation, we reanalyzed ENCODE data and recovered hundreds of previously unidentified candidate CREs in HSDs. We also generated large-insert ChIP-sequencing data for active chromatin features in an LCL to better distinguish paralogous regions. Some duplicated CREs were sufficient to drive differential reporter activity, suggesting they may contribute to divergent cis-regulation of paralogous genes. This work provides evidence that cis-regulatory divergence contributes to novel expression patterns of recent gene duplicates in humans.
Microrchidia (MORC) proteins are GHKL (gyrase, heat-shock protein 90, histidine kinase, MutL) ATPases that function in gene regulation in multiple organisms. Animal MORCs also contain CW-type zinc ...finger domains, which are known to bind to modified histones. We solved the crystal structure of the murine MORC3 ATPase-CW domain bound to the nucleotide analog AMPPNP (phosphoaminophosphonic acid-adenylate ester) and in complex with a trimethylated histone H3 lysine 4 (H3K4) peptide (H3K4me3). We observed that the MORC3 N-terminal ATPase domain forms a dimer when bound to AMPPNP. We used native mass spectrometry to show that dimerization is ATP-dependent, and that dimer formation is enhanced in the presence of nonhydrolyzable ATP analogs. The CW domain uses an aromatic cage to bind trimethylated Lys4 and forms extensive hydrogen bonds with the H3 tail. We found that MORC3 localizes to promoters marked by H3K4me3 throughout the genome, consistent with its binding to H3K4me3 in vitro. Our work sheds light on aspects of the molecular dynamics and function of MORC3.
Structural variants (SVs)-including duplications, deletions, and inversions of DNA-can have significant genomic and functional impacts but are technically difficult to identify and assay compared ...with single-nucleotide variants. With the aid of new genomic technologies, it has become clear that SVs account for significant differences across and within species. This phenomenon is particularly well-documented for humans and other primates due to the wealth of sequence data available. In great apes, SVs affect a larger number of nucleotides than single-nucleotide variants, with many identified SVs exhibiting population and species specificity. In this review, we highlight the importance of SVs in human evolution by (1) how they have shaped great ape genomes resulting in sensitized regions associated with traits and diseases, (2) their impact on gene functions and regulation, which subsequently has played a role in natural selection, and (3) the role of gene duplications in human brain evolution. We further discuss how to incorporate SVs in research, including the strengths and limitations of various genomic approaches. Finally, we propose future considerations in integrating existing data and biospecimens with the ever-expanding SV compendium propelled by biotechnology advancements.
ARC syndrome (OMIM 208085) is an autosomal recessive multisystem disorder characterized by neurogenic arthrogryposis multiplex congenita, renal tubular dysfunction and neonatal cholestasis with bile ...duct hypoplasia and low gamma glutamyl transpeptidase (gGT) activity. Platelet dysfunction is common. Affected infants do not thrive and usually die in the first year of life. To elucidate the molecular basis of ARC, we mapped the disease to a 7-cM interval on 15q26.1 and then identified germline mutations in the gene VPS33B in 14 kindreds with ARC. VPS33B encodes a homolog of the class C yeast vacuolar protein sorting gene, Vps33, that contains a Sec1-like domain important in the regulation of vesicle-to-target SNARE complex formation and subsequent membrane fusion.