After more than two years the COVID-19 pandemic, that is caused by infection with the respiratory SARS-CoV-2 virus, is still ongoing. The risk to develop severe COVID-19 upon SARS-CoV-2 infection is ...increased in individuals with a high age, high body mass index, and who are smoking. The SARS-CoV-2 virus infects cells of the upper respiratory tract by entering these cells upon binding to the Angiotensin-converting enzyme 2 (ACE2) receptor. ACE2 is expressed in various cell types in the lung but the expression is especially high in goblet and ciliated cells. Recently, it was shown that next to its full-length isoform, ACE2 also has a short isoform. The short isoform is unable to bind SARS-CoV-2 and does not facilitate viral entry. In the current study we investigated whether active cigarette smoking increases the expression of the long or the short ACE2 isoform. We showed that in active smokers the expression of the long, active isoform, but not the short isoform of ACE2 is higher compared to never smokers. Additionally, it was shown that the expression of especially the long, active isoform of ACE2 was associated with secretory, club and goblet epithelial cells. This study increases our understanding of why current smokers are more susceptible to SARS-CoV-2 infection, in addition to the already established increased risk to develop severe COVID-19.
Nasal gene expression profiling is a new approach to investigate the airway epithelium as a biomarker to study the activity and treatment responses of obstructive pulmonary diseases. We investigated ...to what extent gene expression profiling of nasal brushings is similar to that of bronchial brushings. We performed genome wide gene expression profiling on matched nasal and bronchial epithelial brushes from 77 respiratory healthy individuals. To investigate differences and similarities among regulatory modules, network analysis was performed on correlated, differentially expressed and smoking-related genes using Gaussian Graphical Models. Between nasal and bronchial brushes, 619 genes were correlated and 1692 genes were differentially expressed (false discovery rate <0.05, |Fold-change|>2). Network analysis of correlated genes showed pro-inflammatory pathways to be similar between the two locations. Focusing on smoking-related genes, cytochrome-P450 pathway related genes were found to be similar, supporting the concept of a detoxifying response to tobacco exposure throughout the airways. In contrast, cilia-related pathways were decreased in nasal compared to bronchial brushes when focusing on differentially expressed genes. Collectively, while there are substantial differences in gene expression between nasal and bronchial brushes, we also found similarities, especially in the response to the external factors such as smoking.
The success of many clinical, association, or population genetics studies critically relies on properly performed variant calling step. The variety of modern genomics protocols, techniques, and ...platforms makes our choices of methods and algorithms difficult and there is no “one size fits all” solution for study design and data analysis. In this review, we discuss considerations that need to be taken into account while designing the study and preparing for the experiments. We outline the variety of variant types that can be detected using sequencing approaches and highlight some specific requirements and basic principles of their detection. Finally, we cover interesting developments that enable variant calling for a broad range of applications in the genomics field. We conclude by discussing technological and algorithmic advances that have the potential to change the ways of calling DNA variants in the nearest future.
The transcription factor C/EBPβ is a master regulator of mammary gland development and tissue remodelling during lactation. The CEBPB-mRNA is translated into three distinct protein isoforms named ...C/EBPβ-LAP1, -LAP2 and -LIP that are functionally different. The smaller isoform LIP lacks the N-terminal transactivation domains and is considered to act as an inhibitor of the transactivating LAP1/2 isoforms by competitive binding for the same DNA recognition sequences. Aberrantly high expression of LIP is associated with mammary epithelial proliferation and is found in grade III, estrogen receptor (ER) and progesterone (PR) receptor-negative human breast cancer. Here, we show that reverting the high LIP/LAP ratios in triple-negative breast cancer (TNBC) cell lines into low LIP/LAP ratios by overexpression of LAP reduces migration and matrix invasion of these TNBC cells. In addition, in untransformed MCF10A human mammary epithelial cells overexpression of LIP stimulates migration. Knockout of CEBPB in TNBC cells where LIP expression prevails, resulted in strongly reduced migration that was accompanied by a downregulation of genes involved in cell migration, extracellular matrix production and cytoskeletal remodelling, many of which are epithelial to mesenchymal transition (EMT) marker genes. Together, this study suggests that the LIP/LAP ratio is involved in regulating breast cancer cell migration and invasion. This study together with studies from others shows that understanding the functions the C/EBPβ-isoforms in breast cancer development may reveal new avenues of treatment.
Since the completion of the rat reference genome in 2003, whole-genome sequencing data from more than 40 rat strains have become available. These data represent the broad range of strains that are ...used in rat research including commonly used substrains. Currently, this wealth of information cannot be used to its full extent, because the variety of different variant calling algorithms employed by different groups impairs comparison between strains. In addition, all rat whole genome sequencing studies to date used an outdated reference genome for analysis (RGSC3.4 released in 2004).
Here we present a comprehensive, multi-sample and uniformly called set of genetic variants in 40 rat strains, including 19 substrains. We reanalyzed all primary data using a recent version of the rat reference assembly (RGSC5.0 released in 2012) and identified over 12 million genomic variants (SNVs, indels and structural variants) among the 40 strains. 28,318 SNVs are specific to individual substrains, which may be explained by introgression from other unsequenced strains and ongoing evolution by genetic drift. Substrain SNVs may have a larger predicted functional impact compared to older shared SNVs.
In summary we present a comprehensive catalog of uniformly analyzed genetic variants among 40 widely used rat inbred strains based on the RGSC5.0 assembly. This represents a valuable resource, which will facilitate rat functional genomic research. In line with previous observations, our genome-wide analyses do not show evidence for contribution of multiple ancestral founder rat subspecies to the currently used rat inbred strains, as is the case for mouse. In addition, we find that the degree of substrain variation is highly variable between strains, which is of importance for the correct interpretation of experimental data from different labs.
Single cell Strand-seq is a unique tool for the discovery and phasing of genomic inversions. Conventional methods to discover inversions with Strand-seq data are blind to known inversion locations, ...limiting their statistical power for the detection of inversions smaller than 10 Kb. Moreover, the methods rely on manual inspection to separate false and true positives. Here we describe "InvertypeR", a method based on a Bayesian binomial model that genotypes inversions using fixed genomic coordinates. We validated InvertypeR by re-genotyping inversions reported for three trios by the Human Genome Structural Variation Consortium. Although 6.3% of the family inversion genotypes in the original study showed Mendelian discordance, this was reduced to 0.5% using InvertypeR. By applying InvertypeR to published inversion coordinates and predicted inversion hotspots (n = 3701), as well as coordinates from conventional inversion discovery, we furthermore genotyped 66 inversions not previously reported for the three trios. InvertypeR discovers, genotypes, and phases inversions without relying on manual inspection. For greater accessibility, results are presented as phased chromosome ideograms with inversions linked to Strand-seq data in the genome browser. InvertypeR increases the power of Strand-seq for studies on the role of inversions in phenotypic variation, genome instability, and human disease.
Chromothripsis represents a novel phenomenon in the structural variation landscape of cancer genomes. Here, we analyze the genomes of ten patients with congenital disease who were preselected to ...carry complex chromosomal rearrangements with more than two breakpoints. The rearrangements displayed unanticipated complexity resembling chromothripsis. We find that eight of them contain hallmarks of multiple clustered double-stranded DNA breaks (DSBs) on one or more chromosomes. In addition, nucleotide resolution analysis of 98 breakpoint junctions indicates that break repair involves nonhomologous or microhomology-mediated end joining. We observed that these eight rearrangements are balanced or contain sporadic deletions ranging in size between a few hundred base pairs and several megabases. The two remaining complex rearrangements did not display signs of DSBs and contain duplications, indicative of rearrangement processes involving template switching. Our work provides detailed insight into the characteristics of chromothripsis and supports a role for clustered DSBs driving some constitutional chromothripsis rearrangements.
Display omitted
► Constitutional complex chromosomal rearrangements display unanticipated complexity resembling chromothripsis ► Some chromothripsis rearrangements involve clustered double-stranded DNA breaks ► There exist distinct classes of chromothripsis rearrangements
Complex genomic rearrangements are associated with birth defects and cancer development. Kloosterman, Cuppen, and colleagues use whole-genome sequencing analysis to unravel the precise structure of ten complex genomic rearrangements in patients with congenital malformations. Eight of the rearrangements are caused by chromosome shattering and nonhomologous DNA repair, while two rearrangements involve replicative repair processes. These rearrangements represent two different instances of chromothripsis. The authors show that chromosome shattering and repair may frequently underlie complex genomic rearrangements causing developmental defects.
BACKGROUND: Structural rearrangements form a major class of somatic variation in cancer genomes. Local chromosome shattering, termed chromothripsis, is a mechanism proposed to be the cause of ...clustered chromosomal rearrangements and was recently described to occur in a small percentage of tumors. The significance of these clusters for tumor development or metastatic spread is largely unclear. RESULTS: We used genome-wide long mate-pair sequencing and SNP array profiling to reveal that chromothripsis is a widespread phenomenon in primary colorectal cancer and metastases. We find large and small chromothripsis events in nearly every colorectal tumor sample and show that several breakpoints of chromothripsis clusters and isolated rearrangements affect cancer genes, including NOTCH2, EXO1 and MLL3. We complemented the structural variation studies by sequencing the coding regions of a cancer exome in all colorectal tumor samples and found somatic mutations in 24 genes, including APC, KRAS, SMAD4 and PIK3CA. A pairwise comparison of somatic variations in primary and metastatic samples indicated that many chromothripsis clusters, isolated rearrangements and point mutations are exclusively present in either the primary tumor or the metastasis and may affect cancer genes in a lesion-specific manner. CONCLUSIONS: We conclude that chromothripsis is a prevalent mechanism driving structural rearrangements in colorectal cancer and show that a complex interplay between point mutations, simple copy number changes and chromothripsis events drive colorectal tumor development and metastasis.
Cellular metabolism is a tightly controlled process in which the cell adapts fluxes through metabolic pathways in response to changes in nutrient supply. Among the transcription factors that regulate ...gene expression and thereby cause changes in cellular metabolism is the basic leucine-zipper (bZIP) transcription factor CCAAT/enhancer-binding protein alpha (C/EBPα). Protein lysine acetylation is a key post-translational modification (PTM) that integrates cellular metabolic cues with other physiological processes. Here, we show that C/EBPα is acetylated by the lysine acetyl transferase (KAT) p300 and deacetylated by the lysine deacetylase (KDAC) sirtuin1 (SIRT1). SIRT1 is activated in times of energy demand by high levels of nicotinamide adenine dinucleotide (NAD+) and controls mitochondrial biogenesis and function. A hypoacetylated mutant of C/EBPα induces the transcription of mitochondrial genes and results in increased mitochondrial respiration. Our study identifies C/EBPα as a key mediator of SIRT1-controlled adaption of energy homeostasis to changes in nutrient supply.
Display omitted
•p300 acetylates C/EBPα on several lysines•SIRT1 deacetylates C/EBPα•Hypoacetylated C/EBPα increases mitochondrial function•C/EBPα is a mediator of SIRT1-controlled energy homeostasis
Zaini et al. show that the transcription factor C/EBPα is acetylated by p300 and deacetylated by the lysine deacetylase SIRT1. Hypoacetylated C/EBPα induces the transcription of mitochondrial genes and results in increased mitochondrial respiration. C/EBPα is a key mediator of SIRT1-controlled adaption of energy homeostasis to changes in nutrient supply.