Unexplained fever (UF) is a common problem in children under 3 years old. Although virus infection is suspected to be the cause of most of these fevers, a comprehensive analysis of viruses in samples ...from children with fever and healthy controls is important for establishing a relationship between viruses and UF. We used unbiased, deep sequencing to analyze 176 nasopharyngeal swabs (NP) and plasma samples from children with UF and afebrile controls, generating an average of 4.6 million sequences per sample. An analysis pipeline was developed to detect viral sequences, which resulted in the identification of sequences from 25 viral genera. These genera included expected pathogens, such as adenoviruses, enteroviruses, and roseoloviruses, plus viruses with unknown pathogenicity. Viruses that were unexpected in NP and plasma samples, such as the astrovirus MLB-2, were also detected. Sequencing allowed identification of virus subtype for some viruses, including roseoloviruses. Highly sensitive PCR assays detected low levels of viruses that were not detected in approximately 5 million sequences, but greater sequencing depth improved sensitivity. On average NP and plasma samples from febrile children contained 1.5- to 5-fold more viral sequences, respectively, than samples from afebrile children. Samples from febrile children contained a broader range of viral genera and contained multiple viral genera more frequently than samples from children without fever. Differences between febrile and afebrile groups were most striking in the plasma samples, where detection of viral sequence may be associated with a disseminated infection. These data indicate that virus infection is associated with UF. Further studies are important in order to establish the range of viral pathogens associated with fever and to understand of the role of viral infection in fever. Ultimately these studies may improve the medical treatment of children with UF by helping avoid antibiotic therapy for children with viral infections.
BACKGROUND: Characterizing the biogeography of the microbiome of healthy humans is essential for understanding microbial associated diseases. Previous studies mainly focused on a single body habitat ...from a limited set of subjects. Here, we analyzed one of the largest microbiome datasets to date and generated a biogeographical map that annotates the biodiversity, spatial relationships, and temporal stability of 22 habitats from 279 healthy humans. RESULTS: We identified 929 genera from more than 24 million 16S rRNA gene sequences of 22 habitats, and we provide a baseline of inter-subject variation for healthy adults. The oral habitat has the most stable microbiota with the highest alpha diversity, while the skin and vaginal microbiota are less stable and show lower alpha diversity. The level of biodiversity in one habitat is independent of the biodiversity of other habitats in the same individual. The abundances of a given genus at a body site in which it dominates do not correlate with the abundances at body sites where it is not dominant. Additionally, we observed the human microbiota exhibit both cosmopolitan and endemic features. Finally, comparing datasets of different projects revealed a project-based clustering pattern, emphasizing the significance of standardization of metagenomic studies. CONCLUSIONS: The data presented here extend the definition of the human microbiome by providing a more complete and accurate picture of human microbiome biogeography, addressing questions best answered by a large dataset of subjects and body sites that are deeply sampled by sequencing.
Understanding the tissue-specific genetic controls of protein levels is essential to uncover mechanisms of post-transcriptional gene regulation. In this study, we generated a genomic atlas of protein ...levels in three tissues relevant to neurological disorders (brain, cerebrospinal fluid and plasma) by profiling thousands of proteins from participants with and without Alzheimer's disease. We identified 274, 127 and 32 protein quantitative trait loci (pQTLs) for cerebrospinal fluid, plasma and brain, respectively. cis-pQTLs were more likely to be tissue shared, but trans-pQTLs tended to be tissue specific. Between 48.0% and 76.6% of pQTLs did not co-localize with expression, splicing, DNA methylation or histone acetylation QTLs. Using Mendelian randomization, we nominated proteins implicated in neurological diseases, including Alzheimer's disease, Parkinson's disease and stroke. This first multi-tissue study will be instrumental to map signals from genome-wide association studies onto functional genes, to discover pathways and to identify drug targets for neurological diseases.
The Human Microbiome Project (HMP) was undertaken with the goal of defining microbial communities in and on the bodies of healthy individuals using high-throughput, metagenomic sequencing analysis. ...The viruses present in these microbial communities, the 'human virome', are an important aspect of the human microbiome that is particularly understudied in the absence of overt disease. We analyzed eukaryotic double-stranded DNA (dsDNA) viruses, together with dsDNA replicative intermediates of single-stranded DNA viruses, in metagenomic sequence data generated by the HMP. 706 samples from 102 subjects were studied, with each subject sampled at up to five major body habitats: nose, skin, mouth, vagina, and stool. Fifty-one individuals had samples taken at two or three time points 30 to 359 days apart from at least one of the body habitats.
We detected an average of 5.5 viral genera in each individual. At least 1 virus was detected in 92% of the individuals sampled. These viruses included herpesviruses, papillomaviruses, polyomaviruses, adenoviruses, anelloviruses, parvoviruses, and circoviruses. Each individual had a distinct viral profile, demonstrating the high interpersonal diversity of the virome. Some components of the virome were stable over time.
This study is the first to use high-throughput DNA sequencing to describe the diversity of eukaryotic dsDNA viruses in a large cohort of normal individuals who were sampled at multiple body sites. Our results show that the human virome is a complex component of the microbial flora. Some viruses establish long-term infections that may be associated with increased risk or possibly with protection from disease. A better understanding of the composition and dynamics of the virome may hold important keys to human health.
Alzheimer's disease (AD) is the most common form of dementia. This neurodegenerative disorder is associated with neuronal death and gliosis heavily impacting the cerebral cortex. AD has a substantial ...but heterogeneous genetic component, presenting both Mendelian and complex genetic architectures. Using bulk RNA-seq from the parietal lobes and deconvolution methods, we previously reported that brains exhibiting different AD genetic architecture exhibit different cellular proportions. Here, we sought to directly investigate AD brain changes in cell proportion and gene expression using single-cell resolution.
We generated unsorted single-nuclei RNA sequencing data from brain tissue. We leveraged the tissue donated from a carrier of a Mendelian genetic mutation, PSEN1 p.A79V, and two family members who suffer from sporadic AD, but do not carry any autosomal mutations. We evaluated alternative alignment approaches to maximize the titer of reads, genes, and cells with high quality. In addition, we employed distinct clustering strategies to determine the best approach to identify cell clusters that reveal neuronal and glial cell types and avoid artifacts such as sample and batch effects. We propose an approach to cluster cells that reduces biases and enable further analyses.
We identified distinct types of neurons, both excitatory and inhibitory, and glial cells, including astrocytes, oligodendrocytes, and microglia, among others. In particular, we identified a reduced proportion of excitatory neurons in the Mendelian mutation carrier, but a similar distribution of inhibitory neurons. Furthermore, we investigated whether single-nuclei RNA-seq from the human brains recapitulate the expression profile of disease-associated microglia (DAM) discovered in mouse models. We also determined that when analyzing human single-nuclei data, it is critical to control for biases introduced by donor-specific expression profiles.
We propose a collection of best practices to generate a highly detailed molecular cell atlas of highly informative frozen tissue stored in brain banks. Importantly, we have developed a new web application to make this unique single-nuclei molecular atlas publicly available.
Determining bacterial abundance variation is the first step in understanding bacterial similarity between individuals. Categorization of bacterial communities into groups or community classes is the ...subsequent step in describing microbial distribution based on abundance patterns. Here, we present an analysis of the groupings of bacterial communities in stool, nasal, skin, vaginal and oral habitats in a healthy cohort of 236 subjects from the Human Microbiome Project.
We identify distinct community group patterns in the anterior nares, four skin sites, and vagina at the genus level. We also confirm three enterotypes previously identified in stools. We identify two clusters with low silhouette values in most oral sites, in which bacterial communities are more homogeneous. Subjects sharing a community class in one habitat do not necessarily share a community class in another, except in the three vaginal sites and the symmetric habitats of the left and right retroauricular creases. Demographic factors, including gender, age, and ethnicity, significantly influence community composition in several habitats. Community classes in the vagina, retroauricular crease and stool are stable over approximately 200 days.
The community composition, association of demographic factors with community classes, and demonstration of community stability deepen our understanding of the variability and dynamics of human microbiomes. This also has significant implications for experimental designs that seek microbial correlations with clinical phenotypes.
The human gut harbors thousands of bacterial taxa. A profusion of metagenomic sequence data has been generated from human stool samples in the last few years, raising the question of whether more ...taxa remain to be identified. We assessed metagenomic data generated by the Human Microbiome Project Consortium to determine if novel taxa remain to be discovered in stool samples from healthy individuals. To do this, we established a rigorous bioinformatics pipeline that uses sequence data from multiple platforms (Illumina GAIIX and Roche 454 FLX Titanium) and approaches (whole-genome shotgun and 16S rDNA amplicons) to validate novel taxa. We applied this approach to stool samples from 11 healthy subjects collected as part of the Human Microbiome Project. We discovered several low-abundance, novel bacterial taxa, which span three major phyla in the bacterial tree of life. We determined that these taxa are present in a larger set of Human Microbiome Project subjects and are found in two sampling sites (Houston and St. Louis). We show that the number of false-positive novel sequences (primarily chimeric sequences) would have been two orders of magnitude higher than the true number of novel taxa without validation using multiple datasets, highlighting the importance of establishing rigorous standards for the identification of novel taxa in metagenomic data. The majority of novel sequences are related to the recently discovered genus Barnesiella, further encouraging efforts to characterize the members of this genus and to study their roles in the microbial communities of the gut. A better understanding of the effects of less-abundant bacteria is important as we seek to understand the complex gut microbiome in healthy individuals and link changes in the microbiome to disease.
The gut virome includes eukaryotic viruses and bacteriophages that can shape the gut bacterial community and elicit host responses. The virome can be implicated in diseases, such as irritable bowel ...syndrome (IBS), where gut bacteria play an important role in pathogenesis. We provide a comprehensive and longitudinal characterization of the virome, including DNA and RNA viruses and paired multi-omics data in a cohort of healthy subjects and patients with IBS.
We selected 2 consecutive stool samples per subject from a longitudinal study cohort and performed metagenomic sequencing on DNA and RNA viruses after enriching for viral-like particles. Viral sequence abundance was evaluated over time, as well as in the context of diet, bacterial composition and function, metabolite levels, colonic gene expression, host genetics, and IBS subsets.
We found that the gut virome was temporally stable and correlated with the colonic transcriptome. We identified IBS-subset–specific changes in phage populations; Microviridae, Myoviridae, and Podoviridae species were elevated in diarrhea-predominant IBS, and other Microviridae and Myoviridae species were elevated in constipation-predominant IBS compared to healthy controls. We identified correlations between subsets of the virome and bacterial composition (unclassifiable “dark matter” and phages) and diet (eukaryotic viruses).
We found that the gut virome is stable over time but varies among subsets of patients with IBS. It can be affected by diet and potentially influences host function via interactions with gut bacteria and/or altering host gene expression.
Display omitted
The gut virome is temporally stable, can be affected by diet, and potentially influences host function via interactions with gut bacteria and/or altering host gene expression.
Low frequency coding variants in TREM2 are associated with Alzheimer disease (AD) risk and cerebrospinal fluid (CSF) TREM2 protein levels are different between AD cases and controls. Similarly, TREM2 ...risk variant carriers also exhibit differential CSF TREM2 levels. TREM2 has three different alternative transcripts, but most of the functional studies only model the longest transcript. No studies have analyzed TREM2 expression levels or alternative splicing in brains from AD and cognitively normal individuals. We wanted to determine whether there was differential expression of TREM2 in sporadic-AD cases versus AD-TREM2 carriers vs sex- and aged-matched normal controls; and if this differential expression was due to a particular TREM2 transcript.
We analyzed RNA-Seq data from parietal lobe brain tissue from AD cases with TREM2 variants (n = 33), AD cases (n = 195) and healthy controls (n = 118), from three independent datasets using Kallisto and the R package tximport to determine the read count for each transcript and quantified transcript abundance as transcripts per million.
The three TREM2 transcripts were expressed in brain cortex in the three datasets. We demonstrate for the first time that the transcript that lacks the transmembrane domain and encodes a soluble form of TREM2 (sTREM2) has an expression level around 60% of the canonical transcript, suggesting that around 25% of the sTREM2 protein levels could be explained by this transcript. We did not observe a difference in the overall TREM2 expression level between cases and controls. However, the isoform which lacks the 5' exon, but includes the transmembrane domain, was significantly lower in TREM2- p.R62H carriers than in AD cases (p = 0.007).
Using bulk RNA-Seq data from three different cohorts, we were able to quantify the expression level of the three TREM2 transcripts, demonstrating: (1) all three transcripts of them are highly expressed in the human cortex, (2) that up to 25% of the sTREM2 may be due to the expression of a specific isoform and not TREM2 cleavage; and (3) that TREM2 risk variants do not affect expression levels, suggesting that the effect of the TREM2 variants on CSF levels occurs at post-transcriptional level.