Esteban Gonzalez Burchard and colleagues explore how making medical research more diverse would aid not only social justice but scientific quality and clinical effectiveness, too.
In epigenome-wide association studies (EWAS), different methylation profiles of distinct cell types may lead to false discoveries. We introduce ReFACTor, a method based on principal component ...analysis (PCA) and designed for the correction of cell type heterogeneity in EWAS. ReFACTor does not require knowledge of cell counts, and it provides improved estimates of cell type composition, resulting in improved power and control for false positives in EWAS. Corresponding software is available at http://www.cs.tau.ac.il/~heran/cozygene/software/refactor.html.
Background Asthma is an inflammatory condition often punctuated by episodic symptomatic worsening, and accordingly, patients with asthma might have waxing and waning adherence to controller therapy. ...Objective We sought to measure changes in inhaled corticosteroid (ICS) adherence over time and to estimate the effect of this changing pattern of use on asthma exacerbations. Methods ICS adherence was estimated from electronic prescription and fill information for 298 participants in the Study of Asthma Phenotypes and Pharmacogenomic Interactions by Race-Ethnicity. For each patient, we calculated a moving average of ICS adherence for each day of follow-up. Asthma exacerbations were defined as the need for oral corticosteroids, an asthma-related emergency department visit, or an asthma-related hospitalization. Proportional hazard models were used to assess the relationship between ICS medication adherence and asthma exacerbations. Results Adherence to ICS medications began to increase before the first asthma exacerbation and continued afterward. Adherence was associated with a reduction in exacerbations but was only statistically significant among patients whose adherence was greater than 75% of the prescribed dose (hazard ratio, 0.61; 95% CI, 0.41-0.90) when compared with patients whose adherence was 25% or less. This pattern was largely confined to patients whose asthma was not well controlled initially. An estimated 24% of asthma exacerbations were attributable to ICS medication nonadherence. Conclusions ICS adherence varies in the time period leading up to and after an asthma exacerbation, and nonadherence likely contributes to a large number of these exacerbations. High levels of adherence are likely required to prevent these events.
Populations of non-European descent are harmed because they are not studied as intensely, and clues that could reveal new aspects of disease biology are missed. Including diverse populations in ...clinical and biomedical research is a must, ethically and scientifically. Research infrastructure needs to be retooled accordingly.
Genomic imprinting is an important regulatory mechanism that silences one of the parental copies of a gene. To systematically characterize this phenomenon, we analyze tissue specificity of imprinting ...from allelic expression data in 1582 primary tissue samples from 178 individuals from the Genotype-Tissue Expression (GTEx) project. We characterize imprinting in 42 genes, including both novel and previously identified genes. Tissue specificity of imprinting is widespread, and gender-specific effects are revealed in a small number of genes in muscle with stronger imprinting in males. IGF2 shows maternal expression in the brain instead of the canonical paternal expression elsewhere. Imprinting appears to have only a subtle impact on tissue-specific expression levels, with genes lacking a systematic expression difference between tissues with imprinted and biallelic expression. In summary, our systematic characterization of imprinting in adult tissues highlights variation in imprinting between genes, individuals, and tissues.
Research in transformed immortalized cell lines indicates the cadherin-related family member 3 (CDHR3) protein serves as a receptor for human rhinovirus (HRV)–C. Similar experiments indicate that the ...CDHR3 coding variant rs6967330 increases CDHR3 protein surface expression.
We sought to determine whether CDHR3 is necessary for HRV-C infection of primary airway epithelial cells (AECs) and to identify molecular mechanisms by which CDHR3 variants confer risk for asthma exacerbations.
CDHR3 function and influence on HRV-C infection were investigated by using single-cell transcriptomics, CRISPR-Cas9 gene knockout, and genotype-specific donor experiments performed in primary AECs. Nasal airway epithelium cis–expression quantitative trait locus (eQTL) analysis of CDHR3 was performed, followed by association testing for asthma hospitalization in minority children.
CDHR3 lung expression is exclusive to ciliated AECs and associated with basal bodies during and after motile ciliogenesis. Knockout of CDHR3 in human AECs did not prevent ciliated cell differentiation but was associated with a decrease in transepithelial resistance and an 80% decrease in HRV-C infection of the mucociliary epithelium. AECs from subjects homozygous for the risk-associated rs6967330 single nucleotide polymorphism (SNP) exhibited greater HRV-C infection compared with cells homozygous for the nonrisk allele. AEC cis-eQTL analysis indicated that rs6967330 and other SNPs are eQTLs for CDHR3. Only the eQTL block containing the rs6967330 SNP showed a significant association with childhood asthma hospitalization.
Genetic deletion and genotype-specific studies in primary AECs indicate CDHR3 is critical to HRV-C infection of ciliated cells. The rs6967330 SNP confers risk of severe childhood asthma exacerbations, likely through increasing HRV-C infection levels and protein surface localization.
Display omitted
We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We ...aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome missing from the reference genome. Our analysis revealed 296,485,284 bp in 125,715 distinct contigs present in the populations of African descent, demonstrating that the African pan-genome contains ~10% more DNA than the current human reference genome. Although the functional significance of nearly all of this sequence is unknown, 387 of the novel contigs fall within 315 distinct protein-coding genes, and the rest appear to be intergenic.
The genetic control of gene expression is a core component of human physiology. For the past several years, transcriptome-wide association studies have leveraged large datasets of linked genotype and ...RNA sequencing information to create a powerful gene-based test of association that has been used in dozens of studies. While numerous discoveries have been made, the populations in the training data are overwhelmingly of European descent, and little is known about the generalizability of these models to other populations. Here, we test for cross-population generalizability of gene expression prediction models using a dataset of African American individuals with RNA-Seq data in whole blood. We find that the default models trained in large datasets such as GTEx and DGN fare poorly in African Americans, with a notable reduction in prediction accuracy when compared to European Americans. We replicate these limitations in cross-population generalizability using the five populations in the GEUVADIS dataset. Via realistic simulations of both populations and gene expression, we show that accurate cross-population generalizability of transcriptome prediction only arises when eQTL architecture is substantially shared across populations. In contrast, models with non-identical eQTLs showed patterns similar to real-world data. Therefore, generating RNA-Seq data in diverse populations is a critical step towards multi-ethnic utility of gene expression prediction.