Many genomes have been sequenced to high-quality draft status using Sanger capillary electrophoresis and/or newer short-read sequence data and whole genome assembly techniques. However, even the best ...draft genomes contain gaps and other imperfections due to limitations in the input data and the techniques used to build draft assemblies. Sequencing biases, repetitive genomic features, genomic polymorphism, and other complicating factors all come together to make some regions difficult or impossible to assemble. Traditionally, draft genomes were upgraded to "phase 3 finished" status using time-consuming and expensive Sanger-based manual finishing processes. For more facile assembly and automated finishing of draft genomes, we present here an automated approach to finishing using long-reads from the Pacific Biosciences RS (PacBio) platform. Our algorithm and associated software tool, PBJelly, (publicly available at https://sourceforge.net/projects/pb-jelly/) automates the finishing process using long sequence reads in a reference-guided assembly process. PBJelly also provides "lift-over" co-ordinate tables to easily port existing annotations to the upgraded assembly. Using PBJelly and long PacBio reads, we upgraded the draft genome sequences of a simulated Drosophila melanogaster, the version 2 draft Drosophila pseudoobscura, an assembly of the Assemblathon 2.0 budgerigar dataset, and a preliminary assembly of the Sooty mangabey. With 24× mapped coverage of PacBio long-reads, we addressed 99% of gaps and were able to close 69% and improve 12% of all gaps in D. pseudoobscura. With 4× mapped coverage of PacBio long-reads we saw reads address 63% of gaps in our budgerigar assembly, of which 32% were closed and 63% improved. With 6.8× mapped coverage of mangabey PacBio long-reads we addressed 97% of gaps and closed 66% of addressed gaps and improved 19%. The accuracy of gap closure was validated by comparison to Sanger sequencing on gaps from the original D. pseudoobscura draft assembly and shown to be dependent on initial reference quality.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
IMPORTANCE: Clinical whole-exome sequencing is increasingly used for diagnostic evaluation of patients with suspected genetic disorders. OBJECTIVE: To perform clinical whole-exome sequencing and ...report (1) the rate of molecular diagnosis among phenotypic groups, (2) the spectrum of genetic alterations contributing to disease, and (3) the prevalence of medically actionable incidental findings such as FBN1 mutations causing Marfan syndrome. DESIGN, SETTING, AND PATIENTS: Observational study of 2000 consecutive patients with clinical whole-exome sequencing analyzed between June 2012 and August 2014. Whole-exome sequencing tests were performed at a clinical genetics laboratory in the United States. Results were reported by clinical molecular geneticists certified by the American Board of Medical Genetics and Genomics. Tests were ordered by the patient’s physician. The patients were primarily pediatric (1756 88%; mean age, 6 years; 888 females 44%, 1101 males 55%, and 11 fetuses 1% gender unknown), demonstrating diverse clinical manifestations most often including nervous system dysfunction such as developmental delay. MAIN OUTCOMES AND MEASURES: Whole-exome sequencing diagnosis rate overall and by phenotypic category, mode of inheritance, spectrum of genetic events, and reporting of incidental findings. RESULTS: A molecular diagnosis was reported for 504 patients (25.2%) with 58% of the diagnostic mutations not previously reported. Molecular diagnosis rates for each phenotypic category were 143/526 (27.2%; 95% CI, 23.5%-31.2%) for the neurological group, 282/1147 (24.6%; 95% CI, 22.1%-27.2%) for the neurological plus other organ systems group, 30/83 (36.1%; 95% CI, 26.1%-47.5%) for the specific neurological group, and 49/244 (20.1%; 95% CI, 15.6%-25.8%) for the nonneurological group. The Mendelian disease patterns of the 527 molecular diagnoses included 280 (53.1%) autosomal dominant, 181 (34.3%) autosomal recessive (including 5 with uniparental disomy), 65 (12.3%) X-linked, and 1 (0.2%) mitochondrial. Of 504 patients with a molecular diagnosis, 23 (4.6%) had blended phenotypes resulting from 2 single gene defects. About 30% of the positive cases harbored mutations in disease genes reported since 2011. There were 95 medically actionable incidental findings in genes unrelated to the phenotype but with immediate implications for management in 92 patients (4.6%), including 59 patients (3%) with mutations in genes recommended for reporting by the American College of Medical Genetics and Genomics. CONCLUSIONS AND RELEVANCE: Whole-exome sequencing provided a potential molecular diagnosis for 25% of a large cohort of patients referred for evaluation of suspected genetic conditions, including detection of rare genetic events and new mutations contributing to disease. The yield of whole-exome sequencing may offer advantages over traditional molecular diagnostic approaches in certain patients.
This study aims to identify the causative strain of SARS-CoV-2 in a cluster of vaccine breakthroughs. Vaccine breakthrough by a highly transmissible SARS-CoV-2 strain is a risk to global public ...health.
Nasopharyngeal swabs from suspected vaccine breakthrough cases were tested for SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) by qPCR (quantitative polymerase chain reaction) for Wuhan-Hu1 and alpha variant. Positive samples were then sequenced by Swift Normalase Amplicon Panels to determine the causal variant. GATK (genome analysis toolkit) variants were filtered with allele fraction ≥80 and min read depth 30x.
Viral sequencing revealed an infection cluster of 6 vaccinated patients infected with the delta (B.1.617.2) SARS-CoV-2 variant. With no history of vaccine breakthrough, this suggests the delta variant may possess immune evasion in patients that received the Pfizer BNT162b2, Moderna mRNA-1273, and Covaxin BBV152.
Delta variant may pose the highest risk out of any currently circulating SARS-CoV-2 variants, with previously described increased transmissibility over alpha variant and now, possible vaccine breakthrough.
Parts of this work was supported by the National Institute of Allergy and Infectious Diseases (1U19AI144297) and Baylor College of Medicine internal funding.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Of over 7000 patients referred to a diagnostic laboratory, 28% had diagnoses based on DNA sequencing, 5% of whom had two or more diagnoses. Their phenotypes could be better understood by considering ...whether the implicated genes affect independent biologic processes or organ systems.
Medical genetics focuses on the relationship between observed phenotypes and their underlying genotypes, modes of transmission, and risks of recurrence. Expected patterns of mendelian inheritance are often used to confirm the identification of disease genes, and deviations from mendelian expectations have led to the discovery of more complicated genetic underpinnings of disease (Fig. S1 in the Supplementary Appendix, available with the full text of this article at NEJM.org).
1
–
8
Multiple (or dual) molecular diagnoses involve more than one clinical diagnosis and more than one genetic locus (Figure 1), each segregating independently.
Diagnostic whole-exome sequencing affords opportunities for providing insights into relationships . . .
Our understanding of the evolutionary history of primates is undergoing continual revision due to ongoing genome sequencing efforts. Bolstered by growing fossil evidence, these data have led to ...increased acceptance of once controversial hypotheses regarding phylogenetic relationships, hybridization and introgression, and the biogeographical history of primate groups. Among these findings is a pattern of recent introgression between species within all major primate groups examined to date, though little is known about introgression deeper in time. To address this and other phylogenetic questions, here, we present new reference genome assemblies for 3 Old World monkey (OWM) species: Colobus angolensis ssp. palliatus (the black and white colobus), Macaca nemestrina (southern pig-tailed macaque), and Mandrillus leucophaeus (the drill). We combine these data with 23 additional primate genomes to estimate both the species tree and individual gene trees using thousands of loci. While our species tree is largely consistent with previous phylogenetic hypotheses, the gene trees reveal high levels of genealogical discordance associated with multiple primate radiations. We use strongly asymmetric patterns of gene tree discordance around specific branches to identify multiple instances of introgression between ancestral primate lineages. In addition, we exploit recent fossil evidence to perform fossil-calibrated molecular dating analyses across the tree. Taken together, our genome-wide data help to resolve multiple contentious sets of relationships among primates, while also providing insight into the biological processes and technical artifacts that led to the disagreements in the first place.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Megacystis-microcolon-intestinal hypoperistalsis syndrome (MMIHS) is a rare disorder of enteric smooth muscle function affecting the intestine and bladder. Patients with this severe phenotype are ...dependent on total parenteral nutrition and urinary catheterization. The cause of this syndrome has remained a mystery since Berdon's initial description in 1976. No genes have been clearly linked to MMIHS. We used whole-exome sequencing for gene discovery followed by targeted Sanger sequencing in a cohort of patients with MMIHS and intestinal pseudo-obstruction. We identified heterozygous ACTG2 missense variants in 15 unrelated subjects, ten being apparent de novo mutations. Ten unique variants were detected, of which six affected CpG dinucleotides and resulted in missense mutations at arginine residues, perhaps related to biased usage of CpG containing codons within actin genes. We also found some of the same heterozygous mutations that we observed as apparent de novo mutations in MMIHS segregating in families with intestinal pseudo-obstruction, suggesting that ACTG2 is responsible for a spectrum of smooth muscle disease. ACTG2 encodes γ2 enteric actin and is the first gene to be clearly associated with MMIHS, suggesting an important role for contractile proteins in enteric smooth muscle disease.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Most studies describing the human gut microbiome in healthy and diseased states have emphasized the bacterial component, but the fungal microbiome (i.e., the mycobiome) is beginning to gain ...recognition as a fundamental part of our microbiome. To date, human gut mycobiome studies have primarily been disease centric or in small cohorts of healthy individuals. To contribute to existing knowledge of the human mycobiome, we investigated the gut mycobiome of the Human Microbiome Project (HMP) cohort by sequencing the Internal Transcribed Spacer 2 (ITS2) region as well as the 18S rRNA gene.
Three hundred seventeen HMP stool samples were analyzed by ITS2 sequencing. Fecal fungal diversity was significantly lower in comparison to bacterial diversity. Yeast dominated the samples, comprising eight of the top 15 most abundant genera. Specifically, fungal communities were characterized by a high prevalence of Saccharomyces, Malassezia, and Candida, with S. cerevisiae, M. restricta, and C. albicans operational taxonomic units (OTUs) present in 96.8, 88.3, and 80.8% of samples, respectively. There was a high degree of inter- and intra-volunteer variability in fungal communities. However, S. cerevisiae, M. restricta, and C. albicans OTUs were found in 92.2, 78.3, and 63.6% of volunteers, respectively, in all samples donated over an approximately 1-year period. Metagenomic and 18S rRNA gene sequencing data agreed with ITS2 results; however, ITS2 sequencing provided greater resolution of the relatively low abundance mycobiome constituents.
Compared to bacterial communities, the human gut mycobiome is low in diversity and dominated by yeast including Saccharomyces, Malassezia, and Candida. Both inter- and intra-volunteer variability in the HMP cohort were high, revealing that unlike bacterial communities, an individual's mycobiome is no more similar to itself over time than to another person's. Nonetheless, several fungal species persisted across a majority of samples, evidence that a core gut mycobiome may exist. ITS2 sequencing data provided greater resolution of the mycobiome membership compared to metagenomic and 18S rRNA gene sequencing data, suggesting that it is a more sensitive method for studying the mycobiome of stool samples.
The human X and Y chromosomes evolved from an ordinary pair of autosomes, but millions of years ago genetic decay ravaged the Y chromosome, and only three per cent of its ancestral genes survived. We ...reconstructed the evolution of the Y chromosome across eight mammals to identify biases in gene content and the selective pressures that preserved the surviving ancestral genes. Our findings indicate that survival was nonrandom, and in two cases, convergent across placental and marsupial mammals. We conclude that the gene content of the Y chromosome became specialized through selection to maintain the ancestral dosage of homologous X-Y gene pairs that function as broadly expressed regulators of transcription, translation and protein stability. We propose that beyond its roles in testis determination and spermatogenesis, the Y chromosome is essential for male viability, and has unappreciated roles in Turner's syndrome and in phenotypic differences between the sexes in health and disease.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, KISLJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Meningiomas account for one-third of all primary brain tumors. Although typically benign, about 20% of meningiomas are aggressive, and despite the rigor of the current histopathological ...classification system there remains considerable uncertainty in predicting tumor behavior. Here, we analyzed 160 tumors from all 3 World Health Organization (WHO) grades (I through III) using clinical, gene expression, and sequencing data. Unsupervised clustering analysis identified 3 molecular types (A, B, and C) that reliably predicted recurrence. These groups did not directly correlate with the WHO grading system, which classifies more than half of the tumors in the most aggressive molecular type as benign. Transcriptional and biochemical analyses revealed that aggressive meningiomas involve loss of the repressor function of the DREAM complex, which results in cell-cycle activation; only tumors in this category tend to recur after full resection. These findings should improve our ability to predict recurrence and develop targeted treatments for these clinically challenging tumors.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK