Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly ...published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations.
Automatic food image recognition systems are alleviating the process of food-intake estimation and dietary assessment. However, due to the nature of food images, their recognition is a particularly ...challenging task, which is why traditional approaches in the field have achieved a low classification accuracy. Deep neural networks have outperformed such solutions, and we present a novel approach to the problem of food and drink image detection and recognition that uses a newly-defined deep convolutional neural network architecture, called NutriNet. This architecture was tuned on a recognition dataset containing 225,953 512 × 512 pixel images of 520 different food and drink items from a broad spectrum of food groups, on which we achieved a classification accuracy of 86 . 72 % , along with an accuracy of 94 . 47 % on a detection dataset containing 130 , 517 images. We also performed a real-world test on a dataset of self-acquired images, combined with images from Parkinson's disease patients, all taken using a smartphone camera, achieving a top-five accuracy of 55 % , which is an encouraging result for real-world images. Additionally, we tested NutriNet on the University of Milano-Bicocca 2016 (UNIMIB2016) food image dataset, on which we improved upon the provided baseline recognition result. An online training component was implemented to continually fine-tune the food and drink recognition model on new images. The model is being used in practice as part of a mobile app for the dietary assessment of Parkinson's disease patients.
Recently, food science has been garnering a lot of attention. There are many open research questions on food interactions, as one of the main environmental factors, with other health-related entities ...such as diseases, treatments, and drugs. In the last 2 decades, a large amount of work has been done in natural language processing and machine learning to enable biomedical information extraction. However, machine learning in food science domains remains inadequately resourced, which brings to attention the problem of developing methods for food information extraction. There are only few food semantic resources and few rule-based methods for food information extraction, which often depend on some external resources. However, an annotated corpus with food entities along with their normalization was published in 2019 by using several food semantic resources.
In this study, we investigated how the recently published bidirectional encoder representations from transformers (BERT) model, which provides state-of-the-art results in information extraction, can be fine-tuned for food information extraction.
We introduce FoodNER, which is a collection of corpus-based food named-entity recognition methods. It consists of 15 different models obtained by fine-tuning 3 pretrained BERT models on 5 groups of semantic resources: food versus nonfood entity, 2 subsets of Hansard food semantic tags, FoodOn semantic tags, and Systematized Nomenclature of Medicine Clinical Terms food semantic tags.
All BERT models provided very promising results with 93.30% to 94.31% macro F1 scores in the task of distinguishing food versus nonfood entity, which represents the new state-of-the-art technology in food information extraction. Considering the tasks where semantic tags are predicted, all BERT models obtained very promising results once again, with their macro F1 scores ranging from 73.39% to 78.96%.
FoodNER can be used to extract and annotate food entities in 5 different tasks: food versus nonfood entities and distinguishing food entities on the level of food groups by using the closest Hansard semantic tags, the parent Hansard semantic tags, the FoodOn semantic tags, or the Systematized Nomenclature of Medicine Clinical Terms semantic tags.
The COVID-19 pandemic affects all aspects of human life including their food consumption. The changes in the food production and supply processes introduce changes to the global dietary patterns.
To ...study the COVID-19 impact on food consumption process, we have analyzed two data sets that consist of food preparation recipes published before (69,444) and during the quarantine (10,009) period. Since working with large data sets is a time-consuming task, we have applied a recently proposed artificial intelligence approach called DietHub. The approach uses the recipe preparation description (i.e. text) and automatically provides a list of main ingredients annotated using the Hansard semantic tags. After extracting the semantic tags of the ingredients for every recipe, we have compared the food consumption patterns between the two data sets by comparing the relative frequency of the ingredients that compose the recipes.
Using the AI methodology, the changes in the food consumption patterns before and during the COVID-19 pandemic are obvious. The highest positive difference in the food consumption can be found in foods such as “Pulses/ plants producing pulses”, “Pancake/Tortilla/Outcake”, and “Soup/pottage”, which increase by 300%, 280%, and 100%, respectively. Conversely, the largest decrease in consumption can be food for food such as “Order Perciformes (type of fish)”, “Corn/cereals/grain”, and “Wine-making”, with a reduction of 50%, 40%, and 30%, respectively. This kind of analysis is valuable in times of crisis and emergencies, which is a very good example of the scientific support that regulators require in order to take quick and appropriate response.
•COVID-19 influence on food consumption patterns.•Utilization of AI methodology for food semantic annotation.•Insight into quarantine food consumption patterns.
The present study tested the combination of an established and a validated food-choice research method (the 'fake food buffet') with a new food-matching technology to automate the data collection and ...analysis.
The methodology combines fake-food image recognition using deep learning and food matching and standardization based on natural language processing. The former is specific because it uses a single deep learning network to perform both the segmentation and the classification at the pixel level of the image. To assess its performance, measures based on the standard pixel accuracy and Intersection over Union were applied. Food matching firstly describes each of the recognized food items in the image and then matches the food items with their compositional data, considering both their food names and their descriptors.
The final accuracy of the deep learning model trained on fake-food images acquired by 124 study participants and providing fifty-five food classes was 92·18 %, while the food matching was performed with a classification accuracy of 93 %.
The present findings are a step towards automating dietary assessment and food-choice research. The methodology outperforms other approaches in pixel accuracy, and since it is the first automatic solution for recognizing the images of fake foods, the results could be used as a baseline for possible future studies. As the approach enables a semi-automatic description of recognized food items (e.g. with respect to FoodEx2), these can be linked to any food composition database that applies the same classification and description system.
Incomparable and insufficiently detailed information on dietary intakes are common challenges associated with dietary assessment methods. Being a European Union country, Slovenia is expected to ...conduct national food consumption studies in line with harmonised EU Menu methodology. The present study aimed to describe the methodology and protocols in the Slovenian nationally representative dietary survey SI.Menu 2017/18, and to assess population dietary habits with respect to food consumption and energy and macronutrient intakes. While the study targeted various population groups, this report is focused on adults. A representative sample of participants was randomly selected from the Central Register of Population according to sex, age classes and place of residency, following a two-stage stratified sampling procedure. Information on food consumption was collected with two non-consecutive 24-h dietary recalls using a web-based Open Platform for Clinical Nutrition (OPEN) software. Data were complemented with a food propensity questionnaire to adjust for usual intake distribution. Altogether, 364 adults (18–64 years) and 416 elderlies (65–74 years) were included in the data analyses. Study results highlighted that observed dietary patterns notably differ from food-based dietary guidelines. Typical diets are unbalanced due to high amounts of consumed meat and meat products, foods high in sugar, fat and salt, and low intake of fruits and vegetables and milk and dairy products. Consequently, the energy proportion of carbohydrates, proteins, and to some extent, free sugars and total fats, as well as intake of dietary fibre and total water deviates from the reference values. Age and sex were significantly marked by differences in dietary intakes, with particularly unfavourable trends in adults and men. Study results call for adoption of prevention and public health intervention strategies to improve dietary patterns, taking into account population group differences. In addition, all developed protocols and tools will be useful for further data collection, supporting regular dietary monitoring systems and trend analyses.
Missing data are a common problem in most research fields and introduce an element of ambiguity into data analysis. They can arise due to different reasons: mishandling of samples, measurement error, ...deleted aberrant value or simply lack of analysis. The nutrition domain is no exception to the problem of missing data. This paper addresses the problem of missing data in food composition databases (FCDBs). Missing data in FCDBs results in incomplete FCDBs, which have limited usage, because any dietary assessment can be performed only on a complete dataset. Most often, this problem is resolved by calculating means/medians from excising data in the same database or borrowing data from other FCDBs. These solutions introduce significant error. We focus on missing data imputation techniques based on methods for substituting missing values with statistical prediction: Non-Negative Matrix Factorization (NMF), Multiple Imputations by Chained Equations (MICE), Nonparametric Missing Value Imputation using Random Forest (MissForest), and K-Nearest Neighbors (KNN), and compared them with commonly used approaches - fill-in with mean, fill-in with median. The data used was from national FCDBs collected by EuroFIR (European Food Information Resource Network). The results show that the state-of-the-art methods for imputation yield better results than the traditional approaches.
Display omitted
•Missing food composition data.•Statistical Methods.•Better quality food composition databases.
The European Food Safety Authority has developed a standardized food classification and description system called FoodEx2. It uses facets to describe food properties and aspects from various ...perspectives, making it easier to compare food consumption data from different sources and perform more detailed data analyses. However, both food composition data and food consumption data, which need to be linked, are lacking in FoodEx2 because the process of classification and description has to be manually performed-a process that is laborious and requires good knowledge of the system and also good knowledge of food (composition, processing, marketing, etc.). In this paper, we introduce a semi-automatic system for classifying and describing foods according to FoodEx2, which consists of three parts. The first involves a machine learning approach and classifies foods into four FoodEx2 categories, with two for single foods: raw (r) and derivatives (d), and two for composite foods: simple (s) and aggregated (c). The second uses a natural language processing approach and probability theory to describe foods. The third combines the result from the first and the second part by defining post-processing rules in order to improve the result for the classification part. We tested the system using a set of food items (from Slovenia) manually-coded according to FoodEx2. The new semi-automatic system obtained an accuracy of 89% for the classification part and 79% for the description part, or an overall result of 79% for the whole system.
We compared three interventions designed for reducing the consumption of sugar-sweetened beverages (SSBs) aimed at decreasing the risk of overweight and obesity among children. We included three ...experimental (n = 508) and one control school (n = 164) in Slovenia (672 children; 10-16 years) to evaluate interventions that influence behaviour change via environmental (E), communication (C), or combined (i.e., double) environmental and communication approaches (EC) compared to no intervention (NOI). Data of children from the 'intervention' and 'non-intervention' schools were compared before and after the interventions. The quantity of water consumed (average, mL/day) by children increased in the C and EC schools, while it decreased in the E and NOI schools. Children in the C and EC schools consumed less beverages with sugar (SSBs + fruit juices), and sweet beverages (beverages with: sugar, low-calorie and/or noncaloric sweeteners) but consumed more juices. The awareness about the health risks of SSB consumption improved among children of the 'combined intervention' EC school and was significantly different from the awareness among children of other schools (
= 0.03). A communication intervention in the school environment has more potential to reduce the intake of SSBs than a sole environmental intervention, but optimum results can be obtained when combined with environmental changes.
Vitamin D is involved in calcium and phosphorus metabolism, and is vital for numerous bodily functions. In the absence of sufficient UV-B light-induced skin biosynthesis, dietary intake becomes the ...most important source of vitamin D. In the absence of biosynthesis, the recommended dietary vitamin D intake is 10–20 µg/day. Major contributors to dietary vitamin D intake are the few foods naturally containing vitamin D (i.e., fish), enriched foods, and supplements. The present study aimed to estimate the vitamin D intake in Slovenia, to identify food groups that notably contribute to vitamin D intake, and to predict the effects of hypothetical mandatory milk fortification. This study was conducted using data collected by the national cross-sectional food consumption survey (SI.Menu) in adolescents (n = 468; 10–17 years), adults (n = 364; 18–64 years), and the elderly (n = 416; 65–74 years). Data collection was carried out between March 2017 and April 2018 using the EU Menu Methodology, which included two 24-h recalls, and a food propensity questionnaire. Very low vitamin D intakes were found; many did not even meet the threshold for very low vitamin D intake (2.5 µg/day). Mean daily vitamin D intake was 2.7, 2.9, and 2.5 µg in adolescents, adults, and the elderly, respectively. Daily energy intake was found to be a significant predictor of vitamin D intake in all population groups. In adolescents and adults, sex was also found to be a significant predictor, with higher vitamin D intake in males. The study results explained the previously reported high prevalence of vitamin D deficiency in Slovenia. An efficient policy approach is required to address the risk of vitamin D deficiency, particularly in vulnerable populations.