Genome-wide association studies are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed gdsfmt and SNPRelate (R packages for ...multi-core symmetric multiprocessing computer architectures) to accelerate two key computations on SNP data: principal component analysis (PCA) and relatedness analysis using identity-by-descent measures. The kernels of our algorithms are written in C/C++ and highly optimized. Benchmarks show the uniprocessor implementations of PCA and identity-by-descent are ∼8-50 times faster than the implementations provided in the popular EIGENSTRAT (v3.0) and PLINK (v1.07) programs, respectively, and can be sped up to 30-300-fold by using eight cores. SNPRelate can analyse tens of thousands of samples with millions of SNPs. For example, our package was used to perform PCA on 55 324 subjects from the 'Gene-Environment Association Studies' consortium studies.
The turnover of the intestinal epithelium is driven by multipotent LGR5
crypt-base columnar cells (CBCs) located at the bottom of crypt zones
. However, CBCs are lost following injury, such as ...irradiation
, but the intestinal epithelium is nevertheless able to recover
. Thus, a second population of quiescent '+4' cells, or reserve stem cells (RSCs), has previously been proposed to regenerate the damaged intestine
. Although CBCs and RSCs were thought to be mutually exclusive
, subsequent studies have found that LGR5
CBCs express RSC markers
and that RSCs were dispensable-whereas LGR5
cells were essential-for repair of the damaged intestine
. In addition, progenitors of absorptive enterocytes
, secretory cells
and slow cycling LGR5
cells
have been shown to contribute to regeneration whereas the transcriptional regulator YAP1, which is important for intestinal regeneration, was suggested to induce a pro-survival phenotype in LGR5
cells
. Thus, whether cellular plasticity or distinct cell populations are critical for intestinal regeneration remains unknown. Here we applied single-cell RNA sequencing to profile the regenerating mouse intestine and identified a distinct, damage-induced quiescent cell type that we term the revival stem cell (revSC). revSCs are marked by high clusterin expression and are extremely rare under homoeostatic conditions, yet give rise-in a temporal hierarchy-to all the major cell types of the intestine, including LGR5
CBCs. After intestinal damage by irradiation, targeted ablation of LGR5
CBCs, or treatment with dextran sodium sulfate, revSCs undergo a YAP1-dependent transient expansion, reconstitute the LGR5
CBC compartment and are required to regenerate a functional intestine. These studies thus define a unique stem cell that is mobilized by damage to revive the homoeostatic stem cell compartment and regenerate the intestinal epithelium.
GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R ...to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R's memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation.
Population scale sweeps of viral pathogens, such as SARS-CoV-2, require high intensity testing for effective management. Here, we describe "Systematic Parallel Analysis of RNA coupled to Sequencing ...for Covid-19 screening" (C19-SPAR-Seq), a multiplexed, scalable, readily automated platform for SARS-CoV-2 detection that is capable of analyzing tens of thousands of patient samples in a single run. To address strict requirements for control of assay parameters and output demanded by clinical diagnostics, we employ a control-based Precision-Recall and Receiver Operator Characteristics (coPR) analysis to assign run-specific quality control metrics. C19-SPAR-Seq coupled to coPR on a trial cohort of several hundred patients performs with a specificity of 100% and sensitivity of 91% on samples with low viral loads, and a sensitivity of >95% on high viral loads associated with disease onset and peak transmissibility. This study establishes the feasibility of employing C19-SPAR-Seq for the large-scale monitoring of SARS-CoV-2 and other pathogens.
Abstract Molecular profiling of individual cancers is key to personalised medicine. While sequencing technologies have required stringent sample collection and handling, recent technical advances ...offer sequencing from tissues collected in routine practice and tissues already stored in archives. In this paper, we establish methods for whole-transcriptome RNA sequencing (RNA-seq) from formalin-fixed paraffin-embedded tissues. We obtain average RNA-seq reads of >100 million per sample using the Illumina HiSeq2000 platform. We find high concordance with results from matching fresh frozen samples (>0.8 Spearman correlation). For validation, we compared low- and high-grade bladder cancer transcriptomes in 49 tumour samples after transurethral resection of bladder tumour. We found 947 differentially expressed protein-coding genes. While high-grade lesions exhibited distinct intertumour transcriptome heterogeneity, the transcriptome of low-grade tumours was homogeneous. Patient summary In this report, we show that it is now possible to use universally available bladder cancer samples that have been fixed in formalin to perform high-quality transcriptome analysis. This ability will facilitate the development of transcriptome-wide tests based on gene expression correlated with clinical outcome.
There have been few genome-wide association studies (GWAS) of prostate cancer among diverse populations. To search for novel prostate cancer risk variants, we conducted GWAS of prostate cancer in ...Japanese and Latinos. In addition, we tested prostate cancer risk variants and developed genetic risk models of prostate cancer for Japanese and Latinos.
Our first-stage GWAS of prostate cancer included Japanese (cases/controls = 1,033/1,042) and Latino (cases/controls = 1,043/1,057) from the Multiethnic Cohort (MEC). Significant associations from stage I (P < 1.0 × 10(-4)) were examined in silico in GWAS of prostate cancer (stage II) in Japanese (cases/controls = 1,583/3,386) and Europeans (cases/controls = 1,854/1,894).
No novel stage I single-nucleotide polymorphism (SNP) outside of known risk regions reached genome-wide significance. For Japanese, in stage I, the most notable putative novel association was seen with 10 SNPs (P ≤ 8.0 × 10(-6)) at chromosome 2q33; however, this was not replicated in stage II. For Latinos, the most significant association was observed with rs17023900 at the known 3p12 risk locus (stage I: OR = 1.45; P = 7.01 × 10(-5) and stage II: OR = 1.58; P = 3.05 × 10(-7)). The majority of the established risk variants for prostate cancer, 79% and 88%, were positively associated with prostate cancer in Japanese and Latinos (stage I), respectively. The cumulative effects of these variants significantly influence prostate cancer risk (OR per allele = 1.10; P = 2.71 × 10(-25) and OR = 1.07; P = 1.02 × 10(-16) for Japanese and Latinos, respectively).
Our GWAS of prostate cancer did not identify novel genome-wide significant variants. However, our findings show that established risk variants for prostate cancer significantly contribute to risk among Japanese and Latinos.
Abstract
Alterations of the Hippo signaling pathway have been found in patients with sarcomatoid renal cell carcinoma, a highly aggressive kidney disease with rapid progression and poor prognosis. It ...is unclear whether deregulated Hippo pathway is a primary cause of sarcomatoid cancers. To investigate whether conditional (kidney-specific) Hippo knockout leads to renal cancer development and/or sarcomatoid dedifferentiation, we conditionally inactivated both Lats1 and Lats2 from the adult renal proximal tubule epithelial cells. Mosaic deletion of Lats1 and Lats2 in the adult proximal tubules results in cell-autonomous sarcomatoid renal cell carcinoma ((54%) as early as 8 weeks after gene deletion) and lung metastasis. RNA sequencing of tumors compared to control renal cortices revealed massive transcriptional changes, with 2,292 genes significantly upregulated (FC>2) and 2261 genes significantly downregulated (FCË‚0.5). To visualize the cellular onset and progression of renal defects, we conducted histologic examination at different time points after induction of Lats1/2 deletion. Mutant kidneys displayed tubular atrophy with glomerular cysts. Loss of proximal tubule epithelial markers, and de novo expression of the mesenchymal marker vimentin, indicated that kidney epithelial cells had undergone an epithelial-to-mesenchymal transition (EMT). These studies demonstrate that loss of Lats1/2 rapidly leads to aggressive, metastatic sarcomatoid renal cell carcinoma.
Citation Format: Antoine Reginensi, Sanjay Jain, Benjamin Humphreys, Jess Shen, Didier Hodzic, Jeff Wrana, Helen McNeill. Adult deletion of Lats1/2 leads to sarcomatoid renal cell carcinoma abstract. In: Proceedings of the AACR Special Conference on the Hippo Pathway: Signaling, Cancer, and Beyond; 2019 May 8-11; San Diego, CA. Philadelphia (PA): AACR; Mol Cancer Res 2020;18(8_Suppl):Abstract nr IA25.
We detected clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) using SNP microarray data from over 50,000 subjects recruited for genome-wide association ...studies. This detection method requires a relatively high frequency of cells with the same abnormal karyotype (>5-10%; presumably of clonal origin) in the presence of normal cells. The frequency of detectable clonal mosaicism in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rapidly rises to 2-3% in the elderly. Many of the mosaic anomalies are characteristic of those found in hematological cancers and identify common deleted regions with genes previously associated with these cancers. Although only 3% of subjects with detectable clonal mosaicism had any record of hematological cancer before DNA sampling, those without a previous diagnosis have an estimated tenfold higher risk of a subsequent hematological cancer (95% confidence interval = 6-18).
Pervasive Developmental Disorders (PDD) are neurodevelopmental disorders characterized by impairments in social interaction, communication and behavior. Given the diversity and varying severity of ...PDD, diagnostic tools attempt to identify homogeneous subtypes within PDD. Identifying subtypes can lead to targeted etiology studies and to effective type-specific intervention. Cluster analysis can suggest coherent subsets in data; however, different methods and assumptions lead to different results. Several previous studies applied clustering to PDD data, varying in number and characteristics of the produced subtypes. Most studies used a relatively small dataset (fewer than 150 subjects), and all applied only a single clustering method. Here we study a relatively large dataset (358 PDD patients), using an ensemble of three clustering methods. The results are evaluated using several validation methods, and consolidated through an integration step. Four clusters are identified, analyzed and compared to subtypes previously defined by the widely used diagnostic tool DSM-IV.