Recent technological advancements have enabled the profiling of a large number of genome-wide features in individual cells. However, single-cell data present unique challenges that require the ...development of specialized methods and software infrastructure to successfully derive biological insights. The Bioconductor project has rapidly grown to meet these demands, hosting community-developed open-source software distributed as R packages. Featuring state-of-the-art computational methods, standardized data infrastructure and interactive data visualization tools, we present an overview and online book (https://osca.bioconductor.org) of single-cell methods for prospective users.
The R/Bioconductor package beadarray allows raw data from Illumina experiments to be read and stored in convenient R classes. Users are free to choose between various methods of image processing, ...background correction and normalization in their analysis rather than using the defaults in Illumina's; proprietary software. The package also allows quality assessment to be carried out on the raw data. The data can then be summarized and stored in a format which can be used by other R/Bioconductor packages to perform downstream analyses. Summarized data processed by Illumina's; BeadStudio software can also be read and analysed in the same manner. Availability: The beadarray package is available from the Bioconductor web page at www.bioconductor.org. A user's; guide and example data sets are provided with the package. Contact: md392@cam.ac.uk
Halophilic archaea of the class
are the most salt-requiring prokaryotes within the domain
. In 1997, minimal standards for the description of new taxa in the order
were proposed. From then on, the ...taxonomy of the class
provides an excellent example of how changing concepts on prokaryote taxonomy and the development of new methods were implemented. The last decades have witnessed a rapid expansion of the number of described taxa within the class
coinciding with the era of genome sequencing development. The current members of the International Committee on Systematics of Prokaryotes Subcommittee on the Taxonomy of
propose these revisions to the recommended minimal standards and encourage the use of advanced technologies in the taxonomic description of members of the
. Most previously required and some recommended minimal standards for the description of new taxa in the class
were retained in the present revision, but changes have been proposed in line with the new methodologies. In addition to the 16S rRNA gene, the
gene is an important molecular marker for the identification of members of the
. Phylogenomic analysis based on concatenated conserved, single-copy marker genes is required to infer the taxonomic status of new taxa. The overall genome relatedness indexes have proven to be determinative in the classification of the taxa within the class
. Average nucleotide identity, digital DNA-DNA hybridization, and average amino acid identity values should be calculated for rigorous comparison among close relatives.
Biological experiments involving genomics or other high-throughput assays typically yield a data matrix that can be explored and analyzed using the R programming language with packages from the ...Bioconductor project. Improvements in the throughput of these assays have resulted in an explosion of data even from routine experiments, which poses a challenge to the existing computational infrastructure for statistical data analysis. For example, single-cell RNA sequencing (scRNA-seq) experiments frequently generate large matrices containing expression values for each gene in each cell, requiring sparse or file-backed representations for memory-efficient manipulation in R. These alternative representations are not easily compatible with high-performance C++ code used for computationally intensive tasks in existing R/Bioconductor packages. Here, we describe a C++ interface named beachmat, which enables agnostic data access from various matrix representations. This allows package developers to write efficient C++ code that is interoperable with dense, sparse and file-backed matrices, amongst others. We evaluated the performance of beachmat for accessing data from each matrix representation using both simulated and real scRNA-seq data, and defined a clear memory/speed trade-off to motivate the choice of an appropriate representation. We also demonstrate how beachmat can be incorporated into the code of other packages to drive analyses of a very large scRNA-seq data set.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
As a group, the halophilic archaea (class
Halobacteria
) are the most salt-requiring and salt-resistant microorganisms within the domain
Archaea
. Halophilic archaea flourish in thalassohaline and ...athalassohaline environments and require over 100–150 g/L NaCl for growth and structural stability. Natural hypersaline environments vary in salt concentration, chemical composition and pH, and occur in climates ranging from tropical to polar and even under-sea. Accordingly, their resident haloarchaeal species vary enormously, as do their individual population compositions and community structures. These diverse halophilic archaeal strains are precious resources for theoretical and applied research but assessing their taxonomic and metabolic novelty and diversity in natural environments has been technically difficult up until recently. Environmental DNA-based high-throughput sequencing technology has now matured sufficiently to allow inexpensive recovery of massive amounts of sequence data, revealing the distribution and community composition of halophilic archaea in different hypersaline environments. While cultivation of haloarchaea is slow and tedious, and only recovers a fraction of the natural diversity, it is the conventional means of describing new species, and provides strains for detailed study. As of the end of May 2020, the class
Halobacteria
contains 71 genera and 275 species, 49.8% of which were first isolated from the marine salt environment and 50.2% from the inland salt environment, indicating that both thalassohaline and athalassohaline environments contain diverse halophilic archaea. However, there remain taxa that have not yet been isolated in pure culture, such as the nanohaloarchaea, which are widespread in the salt environment and may be one of the hot spots in the field of halophilic archaea research in the future. In this review, we focus on the cultivation strategies that have been used to isolate extremely halophilic archaea and point out some of the pitfalls and challenges.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Haloquadratum walsbyi commonly dominates the microbial flora of hypersaline waters. Its cells are extremely fragile squares requiring >14%(w/v) salt for growth, properties that should limit its ...dispersal and promote geographical isolation and divergence. To assess this, the genome sequences of two isolates recovered from sites at near maximum distance on Earth, were compared.
Both chromosomes are 3.1 MB in size, and 84% of each sequence was highly similar to the other (98.6% identity), comprising the core sequence. ORFs of this shared sequence were completely synteneic (conserved in genomic orientation and order), without inversion or rearrangement. Strain-specific insertions/deletions could be precisely mapped, often allowing the genetic events to be inferred. Many inferred deletions were associated with short direct repeats (4-20 bp). Deletion-coupled insertions are frequent, producing different sequences at identical positions. In cases where the inserted and deleted sequences are homologous, this leads to variant genes in a common synteneic background (as already described by others). Cas/CRISPR systems are present in C23(T) but have been lost in HBSQ001 except for a few spacer remnants. Numerous types of mobile genetic elements occur in both strains, most of which appear to be active, and with some specifically targetting others. Strain C23(T) carries two ∼6 kb plasmids that show similarity to halovirus His1 and to sequences nearby halovirus/plasmid gene clusters commonly found in haloarchaea.
Deletion-coupled insertions show that Hqr. walsbyi evolves by uptake and precise integration of foreign DNA, probably originating from close relatives. Change is also driven by mobile genetic elements but these do not by themselves explain the atypically low gene coding density found in this species. The remarkable genome conservation despite the presence of active systems for genome rearrangement implies both an efficient global dispersal system, and a high selective fitness for this species.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
NaCl-saturated brines such as saltern crystalliser ponds, inland salt lakes, deep-sea brines and liquids-of-deliquescence on halite are commonly regarded as a paradigm for the limit of life on Earth. ...There are, however, other habitats that are thermodynamically more extreme. Typically, NaCl-saturated environments contain all domains of life and perform complete biogeochemical cycling. Despite their reduced water activity, ∼0.755 at 5 M NaCl, some halophiles belonging to the Archaea and Bacteria exhibit optimum growth/metabolism in these brines. Furthermore, the recognised water-activity limit for microbial function, ∼0.585 for some strains of fungi, lies far below 0.755. Other biophysical constraints on the microbial biosphere (temperatures of >121°C; pH > 12; and high chaotropicity; e.g. ethanol at >18.9% w/v (24% v/v) and MgCl2 at >3.03 M) can prevent any cellular metabolism or ecosystem function. By contrast, NaCl-saturated environments contain biomass-dense, metabolically diverse, highly active and complex microbial ecosystems; and this underscores their moderate character. Here, we survey the evidence that NaCl-saturated brines are biologically permissive, fertile habitats that are thermodynamically mid-range rather than extreme. Indeed, were NaCl sufficiently soluble, some halophiles might grow at concentrations of up to 8 M. It may be that the finite solubility of NaCl has stabilised the genetic composition of halophile populations and limited the action of natural selection in driving halophile evolution towards greater xerophilicity. Further implications are considered for the origin(s) of life and other aspects of astrobiology.
TP53 deficiency is the most common alteration in cancer; however, this alone is typically insufficient to drive tumorigenesis. To identify genes promoting tumorigenesis in combination with TP53 ...deficiency, we perform genome-wide CRISPR-Cas9 knockout screens coupled with proliferation and transformation assays in isogenic cell lines. Loss of several known tumor suppressors enhances cellular proliferation and transformation. Loss of neddylation pathway genes promotes uncontrolled proliferation exclusively in TP53-deficient cells. Combined loss of CUL3 and TP53 activates an oncogenic transcriptional program governed by the nuclear factor κB (NF-κB), AP-1, and transforming growth factor β (TGF-β) pathways. This program maintains persistent cellular proliferation, induces partial epithelial to mesenchymal transition, and increases DNA damage, genomic instability, and chromosomal rearrangements. Our findings reveal CUL3 loss as a key event stimulating persistent proliferation in TP53-deficient cells. These findings may be clinically relevant, since TP53-CUL3-deficient cells are highly sensitive to ataxia telangiectasia mutated (ATM) inhibition, exposing a vulnerability that could be exploited for cancer treatment.
Display omitted
•Mixed-effect models with MEMcrispR applied to CRISPR screen analyses•Knockout of neddylation genes increases persistent proliferation in TP53−/− cells•TP53−/−,CUL3−/− cells exhibit persistent proliferation and partial EMT phenotype•TP53−/−,CUL3−/− cells show increased DNA damage and display sensitivity to ATM inhibition
Drainas et al. show that inactivation of genes in the neddylation pathway increases persistent proliferation in TP53-deficient cells. TP53- and CUL3-deficient cells induce an oncogenic transcriptional program, leading to partial EMT and heightened genomic instability. These cells show increased vulnerability to ATM inhibitors.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Precise control of gene expression requires the coordinated action of multiple factors at cis-regulatory elements. We recently developed single-molecule footprinting to simultaneously resolve the ...occupancy of multiple proteins including transcription factors, RNA polymerase II and nucleosomes on single DNA molecules genome-wide. The technique combines the use of cytosine methyltransferases to footprint the genome with bisulfite sequencing to resolve transcription factor binding patterns at cis-regulatory elements. DNA footprinting is performed by incubating permeabilized nuclei with recombinant methyltransferases. Upon DNA extraction, whole-genome or targeted bisulfite libraries are prepared and loaded on Illumina sequencers. The protocol can be completed in 4-5 d in any laboratory with access to high-throughput sequencing. Analysis can be performed in 2 d using a dedicated R package and requires access to a high-performance computing system. Our method can be used to analyze how transcription factors cooperate and antagonize to regulate transcription.
Illumina whole-genome expression BeadArrays are a popular choice in gene profiling studies. Aside from the vendor-provided software tools for analyzing BeadArray expression data ...(GenomeStudio/BeadStudio), there exists a comprehensive set of open-source analysis tools in the Bioconductor project, many of which have been tailored to exploit the unique properties of this platform. In this article, we explore a number of these software packages and demonstrate how to perform a complete analysis of BeadArray data in various formats. The key steps of importing data, performing quality assessments, preprocessing, and annotation in the common setting of assessing differential expression in designed experiments will be covered.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK