•An unsupervised sentence representation (embedding) method is proposed.•Our method uses word embeddings and Information Theoretic principles behind TF–IDF.•Word embeddings of a sentence contribute ...with their associated word information.•The proposed method is modular and identifiable at sentence level.•Results on well-known sentence similarity benchmarks are highly competitive.
Sentence representation at the semantic level is a challenging task for natural language processing and Artificial Intelligence. Despite the advances in word embeddings (i.e. word vector representations), capturing sentence meaning is an open question due to complexities of semantic interactions among words. In this paper, we present an embedding method, which is aimed at learning unsupervised sentence representations from unlabeled text. We propose an unsupervised method that models a sentence as a weighted series of word embeddings. The weights of the series are fitted by using Shannon’s Mutual Information (MI) among words, sentences and the corpus. In fact, the Term Frequency–Inverse Document Frequency transform (TF–IDF) is a reliable estimate of such MI. Our method offers advantages over existing ones: identifiable modules, short-term training, online inference of (unseen) sentence representations, as well as independence from domain, external knowledge and linguistic annotation resources. Results showed that our model, despite its concreteness and low computational cost, was competitive with the state of the art in well-known Semantic Textual Similarity (STS) tasks.
Abstract
RegulonDB, first published 20 years ago, is a comprehensive electronic resource about regulation of transcription initiation of Escherichia coli K-12 with decades of knowledge from classic ...molecular biology experiments, and recently also from high-throughput genomic methodologies. We curated the literature to keep RegulonDB up to date, and initiated curation of ChIP and gSELEX experiments. We estimate that current knowledge describes between 10% and 30% of the expected total number of transcription factor- gene regulatory interactions in E. coli. RegulonDB provides datasets for interactions for which there is no evidence that they affect expression, as well as expression datasets. We developed a proof of concept pipeline to merge binding and expression evidence to identify regulatory interactions. These datasets can be visualized in the RegulonDB JBrowse. We developed the Microbial Conditions Ontology with a controlled vocabulary for the minimal properties to reproduce an experiment, which contributes to integrate data from high throughput and classic literature. At a higher level of integration, we report Genetic Sensory-Response Units for 200 transcription factors, including their regulation at the metabolic level, and include summaries for 70 of them. Finally, we summarize our research with Natural language processing strategies to enhance our biocuration work.
Lean management principles include customer focus, continuous improvement and quality through waste reduction for organizations, including IT, bringing benefits such as bringing people together and ...changing paradigms. In this context, this paper presents a literature review on the current characteristics of lean manufacturing applied to information technology departments or companies. The methodology includes the selection of documents from the Scopus and Web of Science databases, a bibliometric study and a literature review with the selected documents. The bibliometric study identified the document type, language, year of publication, authors, author network, author’s H-index, affiliation, keyword clusters, country or territory and field of knowledge. The VOSviewer software was used to create the maps of the author network and keyword clusters. The literature review presented the main topics of Lean IT, which is a relatively new field of knowledge yet to be fully explored and researched. The results showed that there is no typical pattern regarding the number of publications, and that the concept is still much more used in business than in academy. There is no core of well-defined authors. The Lean methodology is not just the so-called "factory floor". It has been adopted by several other areas; although 50% of the documents are concentrated in two areas: Computer Science and Engineering. Hopefully this paper can contribute as a source of data for future studies on lean management within IT.
The present study aimed to explore the motivations of Brazilian dog owners and their knowledge about the risks related to raw meat-based diets (RMBD) as well as to evaluate important enteropathogens ...such as Salmonella spp., C. perfringens, and C. difficile, in feces of dogs fed different diets. The majority of the pet owners (69.3%) reported to have chosen this diet for their dogs, considering it to be more "natural". A large number of owners declared that RMBD do not pose health risks for their animals (87.9%) or humans (98.8%), even though almost one third of the respondents (34.8%) declared having at least one individual at high risk of infection in contact with RMBD-fed dogs. Stool samples from 46 RMBD-fed dogs and 192 dogs fed commercial dry feed were collected. The present study revealed that dogs fed raw meat diets were almost 30 times more likely to be positive for Salmonella spp. than dogs on a conventional diet. Some of the serovars detected were commonly associated with human salmonellosis, such as S. Typhimurium and S. Saintpaul, and were multidrug resistant. RMBD-fed dogs were more likely to be positive for C. perfringens type A (p = 0.008) and one C. perfringens type F was isolated from these animals. Two toxigenic strains (4.3%) of C. difficile were isolated only from raw meat-fed dogs, all of which were under antibiotic therapy. These toxigenic C. difficile isolates were classified as RT106/ST54 and RT600/ST149, previously associated with infection in dogs and humans. The present work revealed that the owners have a tendency to ignore or are unaware of the risks associated with raw meat diets for dogs. Also, the higher fecal shedding of important enteropathogens in dogs fed RMBD suggests that this diet poses a risk for the animals and the people in contact with them.
Mg rechargeable batteries (MgRBs) represent a safe and high-energy battery technology but suffer from the lack of suitable cathode materials due to the slow solid-state diffusion of the highly ...polarizing divalent Mg ion. Previous methods improve performance at the cost of incompatibility with anode/electrolyte and drastic decrease in volumetric energy density. Herein we report interlayer expansion as a general and effective atomic-level lattice engineering approach to transform inactive intercalation hosts into efficient Mg storage materials without introducing adverse side effects. As a proof-of-concept we have combined theory, synthesis, electrochemical measurement, and kinetic analysis to improve Mg diffusion behavior in MoS2, which is a poor Mg transporting material in its pristine form. First-principles simulations suggest that expanded interlayer spacing allows for fast Mg diffusion because of weakened Mg–host interactions. Experimentally, the expansion was realized by inserting a controlled amount of poly(ethylene oxide) into the lattice of MoS2 to increase the interlayer distance from 0.62 nm to up to 1.45 nm. The expansion boosts Mg diffusivity by 2 orders of magnitude, effectively enabling the otherwise barely active MoS2 to approach its theoretical storage capacity as well as to achieve one of the highest rate capabilities among Mg-intercalation materials. The interlayer expansion approach can be leveraged to a wide range of host materials for the storage of various ions, leading to novel intercalation chemistry and opening up new opportunities for the development of advanced materials for next-generation energy storage.
•This is the first report of Enterocytozoon bieneusi in cattle in Brazil.•E. bieneusi is common in Brazilian cattle as it was found in 9 of the 10 farms in 17.5% of the cattle studied.•A high genetic ...diversity of E. bieneusi in cattle was comprised of 12 genotypes of which 7 were detected for the first time.•The study also showed that cattle should be considered as potential sources of human infective spores.
Microsporidia are widely recognized as important human pathogens with Enterocytozoon bieneusi as the most common species infecting humans and animals, including cattle. Although Brazil has the second largest cattle herd in the world and it is the largest exporter of beef there are no data on the presence or impact of E. bieneusi on this important population. To fill this knowledge gap, fecal specimens were collected from 452 cattle from pre-weaned calves to adult cattle in the state of Rio de Janeiro. Host factors including age, gender, dairy/beef, body composition, and fecal consistency were included in the study. Using molecular methods, E. bieneusi was found in 79/452 (17.5%) fecal specimens. This represents the first report of this parasite in Brazilian cattle. A significantly higher prevalence was found in calves less than 2 months of age (27.6%) and those 3–8 months of age (28.8%) versus heifers (14.1%) and adults (1.4%) (P<0.05). Dairy cattle (26.2%) had a higher prevalence than beef cattle (9.7%) (P<0.001). No correlation was found between infection and gender, body composition, and fecal consistency. Molecular characterization of the internal transcribed spacer (ITS) revealed 12 genotypes; five previously reported in cattle (BEB4, BEB8, D, EbpA and I), and seven novel genotypes (BEB11–BEB17). A phylogenetic analysis showed that 6 genotypes (D, EbpA, BEB12, BEB13, BEB15, and BEB16) identified in 18 animals clustered within the designated zoonotic Group 1 while the other 6 genotypes (I, BEB4, BEB8, BEB11, BEB14, BEB17) identified in 61 animals clustered within Group 2. The identification of genotypes in Brazilian cattle that have previously been reported in humans highlights the potential risk of zoonotic transmission and suggests that the role of cattle in transmission of human infections requires further study.
Engineering drawings are commonly used across different industries such as oil and gas, mechanical engineering and others. Digitising these drawings is becoming increasingly important. This is mainly ...due to the legacy of drawings and documents that may provide rich source of information for industries. Analysing these drawings often requires applying a set of digital image processing methods to detect and classify symbols and other components. Despite the recent significant advances in image processing, and in particular in deep neural networks, automatic analysis and processing of these engineering drawings is still far from being complete. This paper presents a general framework for complex engineering drawing digitisation. A thorough and critical review of relevant literature, methods and algorithms in machine learning and machine vision is presented. Real-life industrial scenario on how to contextualise the digitised information from specific type of these drawings, namely piping and instrumentation diagrams, is discussed in details. A discussion of how new trends on machine vision such as deep learning could be applied to this domain is presented with conclusions and suggestions for future research directions.
For reducing health impacts from air pollution, it is important to know the sources contributing to human exposure. This study systematically reviewed and analysed available source apportionment ...studies on particulate matter (of diameter of 10 and 2.5 microns, PM10 and PM2.5) performed in cities to estimate typical shares of the sources of pollution by country and by region. A database with city source apportionment records, estimated with the use of receptor models, was also developed and available at the website of the World Health Organization.
Systematic Scopus and Google searches were performed to retrieve city studies of source apportionment for particulate matter. Six source categories were defined. Country and regional averages of source apportionment were estimated based on city population weighting.
A total of 419 source apportionment records from studies conducted in cities of 51 countries were used to calculate regional averages of sources of ambient particulate matter. Based on the available information, globally 25% of urban ambient air pollution from PM2.5 is contributed by traffic, 15% by industrial activities, 20% by domestic fuel burning, 22% from unspecified sources of human origin, and 18% from natural dust and salt. The available source apportionment records exhibit, however, important heterogeneities in assessed source categories and incompleteness in certain countries/regions.
Traffic is one important contributor to ambient PM in cities. To reduce air pollution in cities and the substantial disease burden it causes, solutions to sustainably reduce ambient PM from traffic, industrial activities and biomass burning should urgently be sought. However, further efforts are required to improve data availability and evaluation, and possibly to combine with other types of information in view of increasing usefulness for policy making.
Display omitted
•Typical shares of the ambient sources of PM2.5 by country and by region were estimated.•Traffic has been targeted as important contributor to ambient air pollution in cities.•A database for Source Apportionment studies as of August 2014 has been compiled.
Summary
Culture and culturally specific beliefs or practices may influence perceptions and decisions, potentially contributing to childhood obesity. The objective of this study is to identify the ...cultural factors (expressed through decisions, behaviors, individual experiences, perceptions, attitudes, or views) related to childhood and adolescent obesity in Mexico. Ten databases and one search engine were searched from 1995 onwards for qualitative studies. The Sunrise Enabler Model, described within the Cultural Care Theory, guided this review. Sample, the phenomenon of interest, study design, and evaluation data were extracted, and the Critical Appraisals Skills Programme tool was used to assess the quality of the included studies. Twenty‐four studies were included. Of these, 12 studies included children or adolescents, 12 included parents, eight included schoolteachers, four included school staff (other than teachers), four included food vendors, and one included policymakers. Cultural values, beliefs, lifeways (especially food and food costumes), kinship, and social factors (particularly immediate and extended family) strongly influenced childhood and adolescent obesity‐related lifestyles in Mexico. Most cultural factors related to childhood obesity in Mexico identified in this review may be modifiable and amenable to practical interventions.