Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in ...the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.
Abstract
The Ensembl project (https://www.ensembl.org) makes key genomic data sets available to the entire scientific community without restrictions. Ensembl seeks to be a fundamental resource ...driving scientific progress by creating, maintaining and updating reference genome annotation and comparative genomics resources. This year we describe our new and expanded gene, variant and comparative annotation capabilities, which led to a 50% increase in the number of vertebrate genomes we support. We have also doubled the number of available human variants and added regulatory regions for many mouse cell types and developmental stages. Our data sets and tools are available via the Ensembl website as well as a through a RESTful webservice, Perl application programming interface and as data files for download.
VectorBase is a National Institute of Allergy and Infectious Diseases supported Bioinformatics Resource Center (BRC) for invertebrate vectors of human pathogens. Now in its 11th year, VectorBase ...currently hosts the genomes of 35 organisms including a number of non-vectors for comparative analysis. Hosted data range from genome assemblies with annotated gene features, transcript and protein expression data to population genetics including variation and insecticide-resistance phenotypes. Here we describe improvements to our resource and the set of tools available for interrogating and accessing BRC data including the integration of Web Apollo to facilitate community annotation and providing Galaxy to support user-based workflows. VectorBase also actively supports our community through hands-on workshops and online tutorials. All information and data are freely available from our website at https://www.vectorbase.org/.
VectorBase (http://www.vectorbase.org) is a NIAID-supported bioinformatics resource for invertebrate vectors of human pathogens. It hosts data for nine genomes: mosquitoes (three Anopheles gambiae ...genomes, Aedes aegypti and Culex quinquefasciatus), tick (Ixodes scapularis), body louse (Pediculus humanus), kissing bug (Rhodnius prolixus) and tsetse fly (Glossina morsitans). Hosted data range from genomic features and expression data to population genetics and ontologies. We describe improvements and integration of new data that expand our taxonomic coverage. Releases are bi-monthly and include the delivery of preliminary data for emerging genomes. Frequent updates of the genome browser provide VectorBase users with increasing options for visualizing their own high-throughput data. One major development is a new population biology resource for storing genomic variations, insecticide resistance data and their associated metadata. It takes advantage of improved ontologies and controlled vocabularies. Combined, these new features ensure timely release of multiple types of data in the public domain while helping overcome the bottlenecks of bioinformatics and annotation by engaging with our user community.
A wide range of globally significant biomass fuels were burned during the fourth Fire Lab at Missoula Experiment (FLAME-4). A multi-channel photoacoustic absorption spectrometer (PAS) measured dry ...absorption at 405, 532, and 660 nm and thermally denuded (250 °C) absorption at 405 and 660 nm. Absorption coefficients were broken into contributions from black carbon (BC), brown carbon (BrC), and lensing following three different methodologies, with one extreme being a method that assumes the thermal denuder effectively removes organics and the other extreme being a method based on the assumption that black carbon (BC) has an Ångström exponent of unity. The methodologies employed provide ranges of potential importance of BrC to absorption but, on average, there was a difference of a factor of 2 in the ratio of the fraction of absorption attributable to BrC estimated by the two methods. BrC absorption at shorter visible wavelengths is of equal or greater importance to that of BC, with maximum contributions of up to 92 % of total aerosol absorption at 405 nm and up to 58 % of total absorption at 532 nm. Lensing is estimated to contribute a maximum of 30 % of total absorption, but typically contributes much less than this. Absorption enhancements and the estimated fraction of absorption from BrC show good correlation with the elemental-carbon-to-organic-carbon ratio (EC ∕ OC) of emitted aerosols and weaker correlation with the modified combustion efficiency (MCE). Previous studies have shown that BrC grows darker (larger imaginary refractive index) as the ratio of black to organic aerosol (OA) mass increases. This study is consistent with those findings but also demonstrates that the fraction of total absorption attributable to BrC shows the opposite trend: increasing as the organic fraction of aerosol emissions increases and the EC ∕ OC ratio decreases.
Chemical inhibitors of histone deacetylase (HDAC) activity are used as experimental tools to induce histone hyperacetylation and deregulate gene transcription, but it is not known whether the ...inhibition of HDACs plays any part in the normal physiological regulation of transcription. Using both in vitro and in vivo assays, we show that lactate, which accumulates when glycolysis exceeds the cell's aerobic metabolic capacity, is an endogenous HDAC inhibitor, deregulating transcription in an HDAC-dependent manner. Lactate is a relatively weak inhibitor (IC(50) 40 mM) compared to the established inhibitors trichostatin A and butyrate, but the genes deregulated overlap significantly with those affected by low concentrations of the more potent inhibitors. HDAC inhibition causes significant up and downregulation of genes, but genes that are associated with HDAC proteins are more likely to be upregulated and less likely to be downregulated than would be expected. Our results suggest that the primary effect of HDAC inhibition by endogenous short-chain fatty acids like lactate is to promote gene expression at genes associated with HDAC proteins. Therefore, we propose that lactate may be an important transcriptional regulator, linking the metabolic state of the cell to gene transcription.
Coastal development is spreading along the World's coasts. Sea levels are rising, so major future asset losses are expected. Planned retreat from the sea behind natural ecological defences is one ...adaptation option. To maintain it, land could be set aside for colonisation by coastal ecosystems, or buildings constructed on condition they are removed when sea level reaches a specified distance from the building.
Similarities among coastal issues in high-income countries encouraged us to produce a generalisable analytical framework for exploring planned retreat. We applied it to South East Queensland, Australia, where the option of planned retreat is disappearing because (1) State Government promotes population increase; (2) the need to provide places for naturally protective coastal ecosystems to occupy does not seem urgent, so houses are built there; (3) liability laws favour development; (4) planning ignores cumulative impacts, the path dependent nature of development and irreversible social–ecological threshold changes; (5) political pressure to build defences grows as the value of built assets increases.
To implement planned retreat, changes to coastal governance would be needed, for which we propose five guiding principles: (a) allocate authority and resources between levels of governance according to their effectiveness at each level; (b) strengthen development rules and incentives to relocate as an unwanted threshold is approached; (c) allow for uncertainties by enabling rules and incentives to be changed when circumstances change; (d) reassign public and private benefits, costs, risks, uncertainties and responsibilities from governments to beneficiaries of development; (e) institutionalise catastrophes as opportunities for change, not signals to rebuild.
Following from this research, one of our next the priorities is the psychology of social change, uncertainty, rights, obligations, incentives and trust. The other is to extend and deepen understanding of the responses of developers, bankers, insurers, house buyers, sellers and owners to changes in development rules and incentives.
Quantitative proteomic analysis with mass spectrometry holds great promise for simultaneously quantifying proteins in various biosamples, such as human plasma. Thus far, studies addressing the ...reproducible measurement of endogenous protein concentrations in human plasma have focussed on targeted analyses employing isotopically labelled standards. Non-targeted proteomics, on the other hand, has been less employed to this end, even though it has been instrumental in discovery proteomics, generating large datasets in multiple fields of research.
Using a non-targeted mass spectrometric assay (LCMSE), we quantified abundant plasma proteins (43 mg/mL-40 ug/mL range) in human blood plasma specimens from 30 healthy volunteers and one blood serum sample (ProteomeXchange: PXD000347). Quantitative results were obtained by label-free mass spectrometry using a single internal standard to estimate protein concentrations. This approach resulted in quantitative results for 59 proteins (cut off ≥11 samples quantified) of which 41 proteins were quantified in all 31 samples and 23 of these with an inter-assay variability of ≤ 20%. Results for 7 apolipoproteins were compared with those obtained using isotope-labelled standards, while 12 proteins were compared to routine immunoassays. Comparison of quantitative data obtained by LCMSE and immunoassays showed good to excellent correlations in relative protein abundance (r = 0.72-0.96) and comparable median concentrations for 8 out of 12 proteins tested. Plasma concentrations of 56 proteins determined by LCMSE were of similar accuracy as those reported by targeted studies and 7 apolipoproteins quantified by isotope-labelled standards, when compared to reference concentrations from literature.
This study shows that LCMSE offers good quantification of relative abundance as well as reasonable estimations of concentrations of abundant plasma proteins.
Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the ...size and complexity of the wheat genome
, and the lack of genome-assembly data for multiple wheat lines
. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses
. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm1
, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars.
Following the 2016 M7.8 Kaikōura earthquake, a time-varying seismic hazard model (KSHM) was developed to inform decision-making for the reinstatement of road and rail networks in the northern South ...Island. The source model is the sum of a gridded 100-year earthquake clustering model and an updated fault source model. The gridded model comprises long-term, medium-term and short-term components. The 100-year gridded model is constructed as the sum of 100 annual forecasts. A discounting method trades off expected earthquake occurrences of the distant future against those of the near future. The fault source model includes updates to account for newly revealed faults that ruptured in the Kaikōura earthquake and other recently obtained new information, and new time-varying probabilities of rupture for four fault segments. Two different characterisations of the Hikurangi subduction interface are incorporated via a logic tree, with weights determined by an expert panel. A suite of ground motion prediction equations contribute to a logic tree in order to account for epistemic uncertainties in source modelling for each of four tectonic region types. Here, we compare the resulting hazard estimates with the 2010 National Seismic hazard Model and recorded motions in past New Zealand and global earthquakes.