Single-cell transcriptomics yields ever growing data sets containing RNA expression levels for thousands of genes from up to millions of cells. Common data analysis pipelines include a dimensionality ...reduction step for visualising the data in two dimensions, most frequently performed using t-distributed stochastic neighbour embedding (t-SNE). It excels at revealing local structure in high-dimensional data, but naive applications often suffer from severe shortcomings, e.g. the global structure of the data is not represented accurately. Here we describe how to circumvent such pitfalls, and develop a protocol for creating more faithful t-SNE visualisations. It includes PCA initialisation, a high learning rate, and multi-scale similarity kernels; for very large data sets, we additionally use exaggeration and downsampling-based initialisation. We use published single-cell RNA-seq data sets to demonstrate that this protocol yields superior results compared to the naive application of t-SNE.
Comparing the impact of the COVID-19 pandemic between countries or across time is difficult because the reported numbers of cases and deaths can be strongly affected by testing capacity and reporting ...policy. Excess mortality, defined as the increase in all-cause mortality relative to the expected mortality, is widely considered as a more objective indicator of the COVID-19 death toll. However, there has been no global, frequently updated repository of the all-cause mortality data across countries. To fill this gap, we have collected weekly, monthly, or quarterly all-cause mortality data from 103 countries and territories, openly available as the regularly updated World Mortality Dataset. We used this dataset to compute the excess mortality in each country during the COVID-19 pandemic. We found that in several worst-affected countries (Peru, Ecuador, Bolivia, Mexico) the excess mortality was above 50% of the expected annual mortality (Peru, Ecuador, Bolivia, Mexico) or above 400 excess deaths per 100,000 population (Peru, Bulgaria, North Macedonia, Serbia). At the same time, in several other countries (e.g. Australia and New Zealand) mortality during the pandemic was below the usual level, presumably due to social distancing measures decreasing the non-COVID infectious mortality. Furthermore, we found that while many countries have been reporting the COVID-19 deaths very accurately, some countries have been substantially underreporting their COVID-19 deaths (e.g. Nicaragua, Russia, Uzbekistan), by up to two orders of magnitude (Tajikistan). Our results highlight the importance of open and rapid all-cause mortality reporting for pandemic monitoring.
Data on excess deaths in Russia in 2020 paint a much bleaker picture of the Covid‐19 death toll than the official daily updated number, argues Dmitry Kobak
Data on excess deaths in Russia in 2020 ...paint a much bleaker picture of the Covid‐19 death toll than the official daily updated number, argues Dmitry Kobak.
Standard preprocessing of single-cell RNA-seq UMI data includes normalization by sequencing depth to remove this technical variability, and nonlinear transformation to stabilize the variance across ...genes with different expression levels. Instead, two recent papers propose to use statistical count models for these tasks: Hafemeister and Satija (Genome Biol 20:296, 2019) recommend using Pearson residuals from negative binomial regression, while Townes et al. (Genome Biol 20:295, 2019) recommend fitting a generalized PCA model. Here, we investigate the connection between these approaches theoretically and empirically, and compare their effects on downstream processing.
We show that the model of Hafemeister and Satija produces noisy parameter estimates because it is overspecified, which is why the original paper employs post hoc smoothing. When specified more parsimoniously, it has a simple analytic solution equivalent to the rank-one Poisson GLM-PCA of Townes et al. Further, our analysis indicates that per-gene overdispersion estimates in Hafemeister and Satija are biased, and that the data are in fact consistent with the overdispersion parameter being independent of gene expression. We then use negative control data without biological variability to estimate the technical overdispersion of UMI counts, and find that across several different experimental protocols, the data are close to Poisson and suggest very moderate overdispersion. Finally, we perform a benchmark to compare the performance of Pearson residuals, variance-stabilizing transformations, and GLM-PCA on scRNA-seq datasets with known ground truth.
We demonstrate that analytic Pearson residuals strongly outperform other methods for identifying biologically variable genes, and capture more of the biologically meaningful variation when used for dimensionality reduction.
Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of ...single neuron responses can obscure what information these areas represent and how it is represented. Here we demonstrate the advantages of a new dimensionality reduction technique, demixed principal component analysis (dPCA), that decomposes population activity into a few components. In addition to systematically capturing the majority of the variance of the data, dPCA also exposes the dependence of the neural representation on task parameters such as stimuli, decisions, or rewards. To illustrate our method we reanalyze population data from four datasets comprising different species, different cortical areas and different experimental tasks. In each case, dPCA provides a concise way of visualizing the data that summarizes the task-dependent features of the population response in a single figure.
Patch-seq: Past, Present, and Future Lipovsek, Marcela; Bardy, Cedric; Cadwell, Cathryn R ...
The Journal of neuroscience,
02/2021, Letnik:
41, Številka:
5
Journal Article
Recenzirano
Odprti dostop
Single-cell transcriptomic approaches are revolutionizing neuroscience. Integrating this wealth of data with morphology and physiology, for the comprehensive study of neuronal biology, requires ...multiplexing gene expression data with complementary techniques. To meet this need, multiple groups in parallel have developed "Patch-seq," a modification of whole-cell patch-clamp protocols that enables mRNA sequencing of cell contents after electrophysiological recordings from individual neurons and morphologic reconstruction of the same cells. In this review, we first outline the critical technical developments that enabled robust Patch-seq experimental efforts and analytical solutions to interpret the rich multimodal data generated. We then review recent applications of Patch-seq that address novel and long-standing questions in neuroscience. These include the following: (1) targeted study of specific neuronal populations based on their anatomic location, functional properties, lineage, or a combination of these factors; (2) the compilation and integration of multimodal cell type atlases; and (3) the investigation of the molecular basis of morphologic and functional diversity. Finally, we highlight potential opportunities for further technical development and lines of research that may benefit from implementing the Patch-seq technique. As a multimodal approach at the intersection of molecular neurobiology and physiology, Patch-seq is uniquely positioned to directly link gene expression to brain function.
Dopamine neurons encode the difference between actual and predicted reward, or reward prediction error (RPE). Although many models have been proposed to account for this computation, it has ...been difficult to test these models experimentally. Here we established an awake electrophysiological recording system, combined with rabies virus and optogenetic cell-type identification, to characterize the firing patterns of monosynaptic inputs to dopamine neurons while mice performed classical conditioning tasks. We found that each variable required to compute RPE, including actual and predicted reward, was distributed in input neurons in multiple brain areas. Further, many input neurons across brain areas signaled combinations of these variables. These results demonstrate that even simple arithmetic computations such as RPE are not localized in specific brain areas but, rather, distributed across multiple nodes in a brain-wide network. Our systematic method to examine both activity and connectivity revealed unexpected redundancy for a simple computation in the brain.
•Electrophysiological recording from monosynaptic inputs of dopamine neurons was performed•Rabies virus tracing was combined with optogenetic tagging in awake recording•Information required to compute reward prediction errors (RPEs) was distributed•There are mixed representations of variables and partially computed RPEs in input neurons
Tian et al. combined rabies virus and photo-tagging to record from monosynaptic inputs to dopamine neurons. Information for reward prediction error computations is distributed and already mixed in input neurons, suggesting highly redundant and distributed computation for a simple arithmetic.
Layer 4 (L4) of mammalian neocortex plays a crucial role in cortical information processing, yet a complete census of its cell types and connectivity remains elusive. Using whole-cell recordings with ...morphological recovery, we identified one major excitatory and seven inhibitory types of neurons in L4 of adult mouse visual cortex (V1). Nearly all excitatory neurons were pyramidal and all somatostatin-positive (SOM
) non-fast-spiking interneurons were Martinotti cells. In contrast, in somatosensory cortex (S1), excitatory neurons were mostly stellate and SOM
interneurons were non-Martinotti. These morphologically distinct SOM
interneurons corresponded to different transcriptomic cell types and were differentially integrated into the local circuit with only S1 neurons receiving local excitatory input. We propose that cell type specific circuit motifs, such as the Martinotti/pyramidal and non-Martinotti/stellate pairs, are used across the cortex as building blocks to assemble cortical circuits.
Neocortical feedback is critical for attention, prediction, and learning. To mechanically understand its function requires deciphering its cell-type wiring. Recent studies revealed that feedback ...between primary motor to primary somatosensory areas in mice is disinhibitory, targeting vasoactive intestinal peptide-expressing interneurons, in addition to pyramidal cells. It is unknown whether this circuit motif represents a general cortico-cortical feedback organizing principle. Here we show that in contrast to this wiring rule, feedback between higher-order lateromedial visual area to primary visual cortex preferentially activates somatostatin-expressing interneurons. Functionally, both feedback circuits temporally sharpen feed-forward excitation eliciting a transient increase-followed by a prolonged decrease-in pyramidal cell activity under sustained feed-forward input. However, under feed-forward transient input, the primary motor to primary somatosensory cortex feedback facilitates bursting while lateromedial area to primary visual cortex feedback increases time precision. Our findings argue for multiple cortico-cortical feedback motifs implementing different dynamic non-linear operations.