Coated pits assemble by growth of a clathrin lattice, which is linked by adaptors to the underlying membrane. How does this process start? We used live-cell TIRF imaging with single-molecule EGFP ...sensitivity and high temporal resolution to detect arrival of the clathrin triskelions and AP2 adaptors that initiate coat assembly. Unbiased object identification and trajectory tracking, together with a statistical model, yield the arrival times and numbers of individual proteins, as well as experimentally confirmed estimates of the extent of substitution of endogenous by expressed, fluorescently tagged proteins. Pits initiate by coordinated arrival of clathrin and AP2, which is usually detected as two sequential steps, each of one triskelion with two adaptors. PI-4,5-P2 is essential for initiation. The accessory proteins FCHo1/2 are not; instead, they are required for sustained growth. This objective picture of coated pit initiation also shows that methods outlined here will be broadly useful for studies of dynamic assemblies in living cells.
Display omitted
► Single-molecule live-cell imaging shows individual proteins arriving at clathrin pits ► Pits start by two serial events, each involving the arrival of a clathrin and two AP2 ► PI-4,5-P2 is essential for initiation, whereas the accessory proteins FCho1/2 are not ► FCho1/2 are required to sustain the growth of the coat
Single-molecule studies reveal the first steps in the assembly of clathrin-coated pits, beginning with the coordinated arrival of a clathrin triskelion and two AP2 adaptor proteins. Whereas initial assembly requires PtdIns(4,5)P2, which mediates the association of AP2 with the membrane, sustained growth of the clathrin coat depends on the accessory proteins FCHo1 and FCHo2.
Post-sequencing quality control is a crucial component of RNA sequencing (RNA-seq) data generation and analysis, as sample quality can be affected by sample storage, extraction and sequencing ...protocols. RNA-seq is increasingly applied to cohorts ranging from hundreds to tens of thousands of samples in size, but existing tools do not readily scale to these sizes, and were not designed for a wide range of sample types and qualities. Here, we describe RNA-SeQC 2, an efficient reimplementation of RNA-SeQC (DeLuca et al., 2012) that adds multiple metrics designed to characterize sample quality across a wide range of RNA-seq protocols.
The command-line tool, documentation and C++ source code are available at the GitHub repository https://github.com/getzlab/rnaseqc. Code and data for reproducing the figures in this paper are available at https://github.com/getzlab/rnaseqc2-paper.
Supplementary data are available at Bioinformatics online.
Genetic studies of the transcriptome help bridge the gap between genetic variation and phenotypes. To maximize the potential of such studies, efficient methods to identify expression quantitative ...trait loci (eQTLs) and perform fine-mapping and genetic prediction of gene expression traits are needed. Current methods that leverage both total read counts and allele-specific expression to identify eQTLs are generally computationally intractable for large transcriptomic studies. Here, we describe a unified framework that addresses these needs and is scalable to thousands of samples. Using simulations and data from GTEx, we demonstrate its calibration and performance. For example, mixQTL shows a power gain equivalent to a 29% increase in sample size for genes with sufficient allele-specific read coverage. To showcase the potential of mixQTL, we apply it to 49 GTEx tissues and find 20% additional eQTLs (FDR < 0.05, per tissue) that are significantly more enriched among trait associated variants and candidate cis-regulatory elements comparing to the standard approach.
Coding variants represent many of the strongest associations between genotype and phenotype; however, they exhibit inter-individual differences in effect, termed 'variable penetrance'. Here, we study ...how cis-regulatory variation modifies the penetrance of coding variants. Using functional genomic and genetic data from the Genotype-Tissue Expression Project (GTEx), we observed that in the general population, purifying selection has depleted haplotype combinations predicted to increase pathogenic coding variant penetrance. Conversely, in cancer and autism patients, we observed an enrichment of penetrance increasing haplotype configurations for pathogenic variants in disease-implicated genes, providing evidence that regulatory haplotype configuration of coding variants affects disease risk. Finally, we experimentally validated this model by editing a Mendelian single-nucleotide polymorphism (SNP) using CRISPR/Cas9 on distinct expression haplotypes with the transcriptome as a phenotypic readout. Our results demonstrate that joint regulatory and coding variant effects are an important part of the genetic architecture of human traits and contribute to modified penetrance of disease-causing variants.
Numerous endocytic accessory proteins (EAPs) mediate assembly and maturation of clathrin-coated pits (CCPs) into cargo-containing vesicles. Analysis of EAP function through bulk measurement of cargo ...uptake has been hampered due to potential redundancy among EAPs and, as we show here, the plasticity and resilience of clathrin-mediated endocytosis (CME). Instead, EAP function is best studied by uncovering the correlation between variations in EAP association to individual CCPs and the resulting variations in maturation. However, most EAPs bind to CCPs in low numbers, making the measurement of EAP association via fused fluorescent reporters highly susceptible to detection errors. Here, we present a framework for unbiased measurement of EAP recruitment to CCPs and their direct effects on CCP dynamics. We identify dynamin and the EAP-binding α-adaptin appendage domain of the AP2 adaptor as switches in a regulated, multistep maturation process and provide direct evidence for a molecular checkpoint in CME.
Display omitted
•Highly sensitive detection of protein association with CCPs•CCPs undergo a multistep maturation process gated by a checkpoint•Early dynamin recruitment is required for CCP maturation•Compensatory mechanisms obscure EAP roles in CCP stabilization and maturation
Aguet et al. perform a high-sensitivity analysis of clathrin-coated pits (CCPs). They establish methods that circumvent many difficulties associated with low signal-to-noise ratios, enable quantitative tracking of CCP/accessory dynamics, and reveal that dynamin and the α-adaptin appendage domain of AP2 each gate different stages in the progression of clathrin-mediated endocytosis.
Single-nucleus RNA sequencing (sNuc-seq) profiles RNA from tissues that are preserved or cannot be dissociated, but it does not provide high throughput. Here, we develop DroNc-seq: massively parallel ...sNuc-seq with droplet technology. We profile 39,111 nuclei from mouse and human archived brain samples to demonstrate sensitive, efficient, and unbiased classification of cell types, paving the way for systematic charting of cell atlases.
Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA ...sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.
We apply integrative approaches to expression quantitative loci (eQTLs) from 44 tissues from the Genotype-Tissue Expression project and genome-wide association study data. About 60% of known ...trait-associated loci are in linkage disequilibrium with a cis-eQTL, over half of which were not found in previous large-scale whole blood studies. Applying polygenic analyses to metabolic, cardiovascular, anthropometric, autoimmune, and neurodegenerative traits, we find that eQTLs are significantly enriched for trait associations in relevant pathogenic tissues and explain a substantial proportion of the heritability (40-80%). For most traits, tissue-shared eQTLs underlie a greater proportion of trait associations, although tissue-specific eQTLs have a greater contribution to some traits, such as blood pressure. By integrating information from biological pathways with eQTL target genes and applying a gene-based approach, we validate previously implicated causal genes and pathways, and propose new variant and gene associations for several complex traits, which we replicate in the UK BioBank and BioVU.
Current genomics methods are designed to handle tens to thousands of samples but will need to scale to millions to match the pace of data and hypothesis generation in biomedical science. Here, we ...show that high efficiency at low cost can be achieved by leveraging general-purpose libraries for computing using graphics processing units (GPUs), such as PyTorch and TensorFlow. We demonstrate > 200-fold decreases in runtime and ~ 5-10-fold reductions in cost relative to CPUs. We anticipate that the accessibility of these libraries will lead to a widespread adoption of GPUs in computational genomics.