Members of the ZFP36 family of RNA-binding proteins regulate gene expression post-transcriptionally by binding to AU-rich elements in the 3'UTR of mRNA and stimulating mRNA degradation. The proteins ...within this family target different transcripts in different tissues. In particular, ZFP36 targets myogenic transcripts and may have a role in adult muscle stem cell quiescence. Our study examined the requirement of ZFP36L1 and ZFP36L2 in adult muscle cell fate regulation.
We generated single and double conditional knockout mice in which Zfp36l1 and/or Zfp36l2 were deleted in Pax7-expressing cells. Immunostained muscle sections were used to analyse resting skeletal muscle, and a cardiotoxin-induced injury model was used to determine the regenerative capacity of muscle.
We show that ZFP36L1 and ZFP36L2 proteins are expressed in satellite cells. Mice lacking the two proteins in Pax7-expressing cells have reduced body weight and have reduced skeletal muscle mass. Furthermore, the number of satellite cells is reduced in adult skeletal muscle and the capacity of this muscle to regenerate following muscle injury is diminished.
ZFP36L1 and ZFP36L2 act redundantly in myogenesis. These findings add further intricacy to the regulation of the cell fate of Pax7-expressing cells in skeletal muscle by RNA-binding proteins.
Abstract
The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations ...cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO—a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations—evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)—mechanistic models of molecular “pathways” (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.
Abstract
The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report ...the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations.
Abstract
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database that collates and summarizes information on stable, macromolecular complexes of known function. ...It captures complex composition, topology and function and links out to a large range of domain-specific resources that hold more detailed data, such as PDB or Reactome. We have made several significant improvements since our last update, including improving compliance to the FAIR data principles by providing complex-specific, stable identifiers that include versioning. Protein complexes are now available from 20 species for download in standards-compliant formats such as PSI-XML, MI-JSON and ComplexTAB or can be accessed via an improved REST API. A component-based JS front-end framework has been implemented to drive a new website and this has allowed the use of APIs from linked services to import and visualize information such as the 3D structure of protein complexes, its role in reactions and pathways and the co-expression of complex components in the tissues of multi-cellular organisms. A first draft of the complete complexome of Saccharomyces cerevisiae is now available to browse and download.
Abstract
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from a range of model organisms. It summarizes ...complex composition, topology and function along with links to a large range of domain-specific resources (i.e. wwPDB, EMDB and Reactome). Since the last update in 2019, we have produced a first draft complexome for Escherichia coli, maintained and updated that of Saccharomyces cerevisiae, added over 40 coronavirus complexes and increased the human complexome to over 1100 complexes that include approximately 200 complexes that act as targets for viral proteins or are part of the immune system. The display of protein features in ComplexViewer has been improved and the participant table is now colour-coordinated with the nodes in ComplexViewer. Community collaboration has expanded, for example by contributing to an analysis of putative transcription cofactors and providing data accessible to semantic web tools through Wikidata which is now populated with manually curated Complex Portal content through a new bot. Our data license is now CC0 to encourage data reuse. Users are encouraged to get in touch, provide us with feedback and send curation requests through the ‘Support’ link.
Abstract
Motivation
The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result of genome sequencing and the prediction of ...protein-coding genes. Providing functional annotation for these proteins presents a significant and continuing challenge.
Results
In response to this challenge, UniProt has developed a method of annotation, known as UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) developed by the members of the UniProt consortium. UniRule uses protein family signatures from InterPro, combined with taxonomic and other constraints, to select sets of reviewed proteins which have common functional properties supported by experimental evidence. This annotation is propagated to unreviewed records in UniProtKB that meet the same selection criteria, most of which do not have (and are never likely to have) experimentally verified functional annotation. Release 2020_01 of UniProtKB contains 6496 UniRule rules which provide annotation for 53 million proteins, accounting for 30% of the 178 million records in UniProtKB. UniRule provides scalable enrichment of annotation in UniProtKB.
Availability and implementation
UniRule rules are integrated into UniProtKB and can be viewed at https://www.uniprot.org/unirule/. UniRule rules and the code required to run the rules, are publicly available for researchers who wish to annotate their own sequences. The implementation used to run the rules is known as UniFIRE and is available at https://gitlab.ebi.ac.uk/uniprot-public/unifire.
Antisense-induced exon skipping can restore the open reading frame, and thus correct the dystrophin deficiency that causes Duchenne muscular dystrophy (DMD), a lethal muscle wasting condition. ...Successful proof-of-principle in preclinical models has led to human clinical trials. However, it is still not known what percentage of dystrophin-positive fibers and what level of expression is necessary for functional improvement. This study directly address these key questions in the mdx mouse model of DMD. To achieve a significant variation in dystrophin expression, we locally administered into tibialis anterior muscles various doses of a phosphorodiamidate morpholino oligomer (PMO) designed to skip the mutated exon 23 from the mRNA of murine dystrophin. We found a highly significant correlation between the number of dystrophin-positive fibers and resistance to contraction-induced injury, with a minimum of 20% of dystrophin-positive fibers required for meaningful improvement. Furthermore, our results also indicate that a relatively low level of dystrophin expression in muscle fibers may have significant clinical benefits. In contrast, improvements in muscle force were not correlated with either the number of positive fibers or total dystrophin levels, which highlight the need to conduct appropriate functional assessments in preclinical testing using the mdx mouse.
Phosphatases play an essential role in the regulation of protein phosphorylation. Less abundant than kinases, many phosphatases are components of one or more macromolecular complexes with different ...substrate specificities and specific functionalities. The expert scientific curation of phosphatase complexes for the UniProt and Complex Portal databases supports the whole scientific community by collating and organising small‐ and large‐scale experimental data from the scientific literature into context‐specific central resources, where the data can be freely accessed and used to further academic and translational research. In this review, we discuss how the diverse biological functions of phosphatase complexes are presented in UniProt and the Complex Portal, and how understanding the biological significance of phosphatase complexes in Caenorhabditis elegans offers insight into the mechanisms of substrate diversity in a variety of cellular and molecular processes.
The UniProt Knowledgebase (UniProtKB) and Complex Portal databases facilitate the understanding of how protein phosphorylation is regulated by providing structural and functional information on phosphatase‐containing macromolecular complexes across a wide range of species. Using the Caenorhabditis elegans model organism as a basis, this review addresses the challenges in annotating protein complex data and summarises how the two databases differ in their approach to presenting phosphatase‐containing complexes.
UniProt continues to support the ongoing process of making scientific data FAIR. Here we contribute to this process with a FAIRness assessment of our UniProtKB dataset followed by a critical ...reflection on the challenges and future directions of the adoption and validation of the FAIR principles and metrics.