The Universal Protein Resource (UniProt, http://www.uniprot.org ) consortium is an initiative of the SIB Swiss Institute of Bioinformatics (SIB), the European Bioinformatics Institute (EBI) and the ...Protein Information Resource (PIR) to provide the scientific community with a central resource for protein sequences and functional information. The UniProt consortium maintains the UniProt KnowledgeBase (UniProtKB), updated every 4 weeks, and several supplementary databases including the UniProt Reference Clusters (UniRef) and the UniProt Archive (UniParc).The Swiss-Prot section of the UniProt KnowledgeBase (UniProtKB/Swiss-Prot) contains publicly available expertly manually annotated protein sequences obtained from a broad spectrum of organisms. Plant protein entries are produced in the frame of the Plant Proteome Annotation Program (PPAP), with an emphasis on characterized proteins of Arabidopsis thaliana and Oryza sativa. High level annotations provided by UniProtKB/Swiss-Prot are widely used to predict annotation of newly available proteins through automatic pipelines.The purpose of this chapter is to present a guided tour of a UniProtKB/Swiss-Prot entry. We will also present some of the tools and databases that are linked to each entry.
Animal toxins are of interest to a wide range of scientists, due to their numerous applications in pharmacology, neurology, hematology, medicine, and drug research. This, and to a lesser extent the ...development of new performing tools in transcriptomics and proteomics, has led to an increase in toxin discovery. In this context, providing publicly available data on animal toxins has become essential. The UniProtKB/Swiss-Prot Tox-Prot program (http://www.uniprot.org/program/Toxins) plays a crucial role by providing such an access to venom protein sequences and functions from all venomous species. This program has up to now curated more than 5000 venom proteins to the high-quality standards of UniProtKB/Swiss-Prot (release 2012_02). Proteins targeted by these toxins are also available in the knowledgebase. This paper describes in details the type of information provided by UniProtKB/Swiss-Prot for toxins, as well as the structured format of the knowledgebase.
In contrast to vastly studied hypocotyl growth, little is known about diel regulation of leaf growth and its coordination with movements such as changes in leaf elevation angle (hyponasty). We ...developed a 3D live-leaf growth analysis system enabling simultaneous monitoring of growth and movements. Leaf growth is maximal several hours after dawn, requires light, and is regulated by daylength, suggesting coupling between growth and metabolism. We identify both blade and petiole positioning as important components of leaf movements in Arabidopsis thaliana and reveal a temporal delay between growth and movements. In hypocotyls, the combination of circadian expression of PHYTOCHROME INTERACTING FACTOR4 (PIF4) and PIF5 and their light-regulated protein stability drives rhythmic hypocotyl elongation with peak growth at dawn. We find that PIF4 and PIF5 are not essential to sustain rhythmic leaf growth but influence their amplitude. Furthermore, EARLY FLOWERING3, a member of the evening complex (EC), is required to maintain the correct phase between growth and movement. Our study shows that the mechanisms underlying rhythmic hypocotyl and leaf growth differ. Moreover, we reveal the temporal relationship between leaf elongation and movements and demonstrate the importance of the EC for the coordination of these phenotypic traits.
Efforts to precisely identify tumor human leukocyte antigen (HLA) bound peptides capable of mediating T cell-based tumor rejection still face important challenges. Recent studies suggest that ...non-canonical tumor-specific HLA peptides derived from annotated non-coding regions could elicit anti-tumor immune responses. However, sensitive and accurate mass spectrometry (MS)-based proteogenomics approaches are required to robustly identify these non-canonical peptides. We present an MS-based analytical approach that characterizes the non-canonical tumor HLA peptide repertoire, by incorporating whole exome sequencing, bulk and single-cell transcriptomics, ribosome profiling, and two MS/MS search tools in combination. This approach results in the accurate identification of hundreds of shared and tumor-specific non-canonical HLA peptides, including an immunogenic peptide derived from an open reading frame downstream of the melanoma stem cell marker gene ABCB5. These findings hold great promise for the discovery of previously unknown tumor antigens for cancer immunotherapy.
Myocardial infarction (MI) is a leading cause of death worldwide. Reperfusion is considered as an optimal therapy following cardiac ischemia. However, the promotion of a rapid elevation of O
levels ...in ischemic cells produces high amounts of reactive oxygen species (ROS) leading to myocardial tissue injury. This phenomenon is called ischemia reperfusion injury (IRI). We aimed at identifying new and effective compounds to treat MI and minimize IRI. We previously studied heart regeneration following myocardial injury in zebrafish and described each step of the regeneration process, from the day of injury until complete recovery, in terms of transcriptional responses. Here, we mined the data and performed a deep in silico analysis to identify drugs highly likely to induce cardiac regeneration. Fisetin was identified as the top candidate. We validated its effects in an in vitro model of MI/IRI in mammalian cardiac cells. Fisetin enhances viability of rat cardiomyocytes following hypoxia/starvation - reoxygenation. It inhibits apoptosis, decreases ROS generation and caspase activation and protects from DNA damage. Interestingly, fisetin also activates genes involved in cell proliferation. Fisetin is thus a highly promising candidate drug with clinical potential to protect from ischemic damage following MI and to overcome IRI.
Genetic variations affect behavior and cause disease but understanding how these variants drive complex traits is still an open question. A common approach is to link the genetic variants to ...intermediate molecular phenotypes such as the transcriptome using RNA-sequencing (RNA-seq). Paradoxically, these variants between the samples are usually ignored at the beginning of RNA-seq analyses of many model organisms. This can skew the transcriptome estimates that are used later for downstream analyses, such as expression quantitative trait locus (eQTL) detection. Here, we assessed the impact of reference-based analysis on the transcriptome and eQTLs in a widely-used mouse genetic population: the BXD panel of recombinant inbred lines. We highlight existing reference bias in the transcriptome data analysis and propose practical solutions which combine available genetic variants, genotypes, and genome reference sequence. The use of custom BXD line references improved downstream analysis compared to classical genome reference. These insights would likely benefit genetic studies with a transcriptomic component and demonstrate that genome references need to be reassessed and improved.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The timing and duration of sleep results from the interaction between a homeostatic sleep–wake-driven process and a periodic circadian process, and involves changes in gene regulation and expression. ...Unraveling the contributions of both processes and their interaction to transcriptional and epigenomic regulatory dynamics requires sampling over time under conditions of unperturbed and perturbed sleep. We profiled mRNA expression and chromatin accessibility in the cerebral cortex of mice over a 3-d period, including a 6-h sleep deprivation (SD) on day 2. We used mathematical modeling to integrate time series of mRNA expression data with sleep–wake history, which established that a large proportion of rhythmic genes are governed by the homeostatic process with varying degrees of interaction with the circadian process, sometimes working in opposition. Remarkably, SD caused long-term effects on gene-expression dynamics, outlasting phenotypic recovery, most strikingly illustrated by a damped oscillation of most core clock genes, including Arntl/Bmal1, suggesting that enforced wakefulness directly impacts the molecular clock machinery. Chromatin accessibility proved highly plastic and dynamically affected by SD. Dynamics in distal regions, rather than promoters, correlated with mRNA expression, implying that changes in expression result from constitutively accessible promoters under the influence of enhancers or repressors. Serum response factor (SRF) was predicted as a transcriptional regulator driving immediate response, suggesting that SRF activity mirrors the build-up and release of sleep pressure. Our results demonstrate that a single, short SD has long-term aftereffects at the genomic regulatory level and highlights the importance of the sleep–wake distribution to diurnal rhythmicity and circadian processes.
Plant growth is strongly influenced by the presence of neighbors that compete for light resources. In response to vegetational shading shade‐intolerant plants such as Arabidopsis display a suite of ...developmental responses known as the shade‐avoidance syndrome (SAS). The phytochrome B (phyB) photoreceptor is the major light sensor to mediate this adaptive response. Control of the SAS occurs in part with phyB, which controls protein abundance of phytochrome‐interacting factors 4 and 5 (PIF4 and PIF5) directly. The shade‐avoidance response also requires rapid biosynthesis of auxin and its transport to promote elongation growth. The identification of genome‐wide PIF5‐binding sites during shade avoidance revealed that this bHLH transcription factor regulates the expression of a subset of previously identified SAS genes. Moreover our study suggests that PIF4 and PIF5 regulate elongation growth by controlling directly the expression of genes that code for auxin biosynthesis and auxin signaling components.
Haplotype estimation, or phasing, has gained significant traction in large-scale projects due to its valuable contributions to population genetics, variant analysis, and the creation of reference ...panels for imputation and phasing of new samples. To scale with the growing number of samples, haplotype estimation methods designed for population scale rely on highly optimized statistical models to phase genotype data, and usually ignore read-level information. Statistical methods excel in resolving common variants, however, they still struggle at rare variants due to the lack of statistical information. In this study we introduce SAPPHIRE, a new method that leverages whole-genome sequencing data to enhance the precision of haplotype calls produced by statistical phasing. SAPPHIRE achieves this by refining haplotype estimates through the realignment of sequencing reads, particularly targeting low-confidence phase calls. Our findings demonstrate that SAPPHIRE significantly enhances the accuracy of haplotypes obtained from state of the art methods and also provides the subset of phase calls that are validated by sequencing reads. Finally, we show that our method scales to large data sets by its successful application to the extensive 3.6 Petabytes of sequencing data of the last UK Biobank 200,031 sample release.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK