•Electric vehicles (EVs) charge with nonlinear profiles where power decreases over time.•Smart charging for EV fleets aims to maximize use of charging infrastructure.•Infrastructure capacity is ...wasted if decreasing power is not taken into account.•Machine learning models can be used to predict EV power draw.•Smart charging supported by such models use the infrastructure more effectively.
The ongoing electrification of mobility comes with the challenge of charging electric vehicles (EVs) sufficiently while charging infrastructure capacities are limited. Smart charging algorithms produce charge plans for individual EVs and aim to assign charging capacities fairly and efficiently between vehicles in a fleet. In practice, EV charging processes follow nonlinear charge profiles such as constant-current, constant-voltage (CCCV). Smart charging must consider charge profiles in order to avoid gaps between charge plans and actual EV power consumption. Generally valid models of charge profiles and their parameters for a diverse set of EVs are not publicly available. In this work we propose a data-driven approach for integrating a machine learning model to predict arbitrary charge profiles into a smart charging algorithm. We train machine learning models with a dataset consisting of charging processes from the workplace gathered in 2016–2018 from a heterogeneous EV fleet of 1001 EVs with 18 unique models. Each charging process includes the time series of charging power. After preprocessing, the dataset contains 10.595 charging processes leading to 1.2 million data points in total. We then compare different machine learning models for charge profile predictions finding that XGBoost yields the most accurate predictions with a mean absolute error (MAE) of 126W and a relative MAE of 0.06. Simulations show that smart charging with the integrated XGBoost model leads to a more effective infrastructure usage with up to 21% more energy charged compared to smart charging without considering charge profiles. Furthermore, an ablation study on regression model features shows the EV’s model is not a necessary attribute for accurate charge profile predictions. However, charging features are required including the number of phases used for charging.
Display omitted
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
In the field of pervasive computing, wearable devices have been widely used for recognizing human activities. One important area in this research is the recognition of activities of daily living ...where especially inertial sensors and interaction sensors (like RFID tags with scanners) are popular choices as data sources. Using interaction sensors, however, has one drawback: they may not differentiate between proper interaction and simple touching of an object. A positive signal from an interaction sensor is not necessarily caused by a performed activity e.g., when an object is only touched but no interaction occurred afterwards. There are, however, many scenarios like medicine intake that rely heavily on correctly recognized activities. In our work, we aim to address this limitation and present a multimodal egocentric-based activity recognition approach. Our solution relies on object detection that recognizes activity-critical objects in a frame. As it is infeasible to always expect a high quality camera view, we enrich the vision features with inertial sensor data that monitors the users' arm movement. This way we try to overcome the drawbacks of each respective sensor. We present our results of combining inertial and video features to recognize human activities on different types of scenarios where we achieve an F 1 -measure of up to 79.6%.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Arrow R-CNN for handwritten diagram recognition Schäfer, Bernhard; Keuper, Margret; Stuckenschmidt, Heiner
International journal on document analysis and recognition,
06/2021, Volume:
24, Issue:
1-2
Journal Article
Peer reviewed
Open access
We address the problem of offline handwritten diagram recognition. Recently, it has been shown that diagram symbols can be directly recognized with deep learning object detectors. However, object ...detectors are not able to recognize the diagram structure. We propose Arrow R-CNN, the first deep learning system for joint symbol and structure recognition in handwritten diagrams. Arrow R-CNN extends the Faster R-CNN object detector with an arrow head and tail keypoint predictor and a diagram-aware postprocessing method. We propose a network architecture and data augmentation methods targeted at small diagram datasets. Our diagram-aware postprocessing method addresses the insufficiencies of standard Faster R-CNN postprocessing. It reconstructs a diagram from a set of symbol detections and arrow keypoints. Arrow R-CNN improves state-of-the-art substantially: on a scanned flowchart dataset, we increase the rate of recognized diagrams from 37.7 to 78.6%.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
The tremendous growth of educational institutions’ electronic data provides the opportunity to extract information that can be used to predict students’ overall success, predict students’ dropout ...rate, evaluate the performance of teachers and instructors, improve the learning material according to students’ needs, and much more. This paper aims to review the latest trends in predicting students’ performance in higher education. We provide a comprehensive background for understanding Educational Data Mining (EDM). We also explain the measures of determining academic success and highlight the strengths and weaknesses of the most common data mining (DM) tools and methods used nowadays. Moreover, we provide a rich literature review of the EDM work that has been published during the past 12 years (2007–2018) with focus on the prediction of academic performance in higher education. We analyze the most commonly used features and methods in predicting academic achievement, and highlight the benefits of the mostly used DM tools in EDM. The results of this paper could assist researchers and educational planners who are attempting to carry out EDM solutions in the domain of higher education as we highlight the type of features that the previous researches found to have significant impact on the prediction, as well as the benefits and drawbacks of the DM methods and tools used for predicting academic outcomes.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Working with multimodal datasets is a challenging task as it requires annotations which often are time consuming and difficult to acquire. This includes in particular video recordings which often ...need to be watched as a whole before they can be labeled. Additionally, other modalities like acceleration data are often recorded alongside a video. For that purpose, we created an annotation tool that enables to annotate datasets of video and inertial sensor data. In contrast to most existing approaches, we focus on semi-supervised labeling support to infer labels for the whole dataset. This means, after labeling a small set of instances our system is able to provide labeling recommendations. We aim to rely on the acceleration data of a wrist-worn sensor to support the labeling of a video recording. For that purpose, we apply template matching to identify time intervals of certain activities. We test our approach on three datasets, one containing warehouse picking activities, one consisting of activities of daily living and one about meal preparations. Our results show that the presented method is able to give hints to annotators about possible label candidates.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
The tremendous growth in electronic educational data creates the need to have meaningful information extracted from it. Educational Data Mining (EDM) is an exciting research area that can reveal ...valuable knowledge from educational databases. This knowledge can be used for many purposes, including identifying dropouts or weak students who need special attention and discovering extraordinary students who can be offered lifetime opportunities. Although former studies in EDM used an extensive range of features for predicting students’ academic achievement (in terms of (i) achieved grades or (ii) passing and failing), those features are sometimes not obtainable for practical usage, and therefore, the prediction models are not feasible for employment. This study uses data mining (DM) algorithms to predict the academic performance of master’ s students by using a non-extensive data set and including only the features that are easy to collect at the beginning of a studying program. To perform this study, we have collected over 700 students' records from 2010 to 2018 from the Faculty of Business Informatics and Mathematics at the University of Mannheim in Germany. Those records include demographics and post-enrollment features such as semester grades. The empirical results show the following: (i) the most significant features for predicting students' academic achievements are the students’ grades in each semester (importance rate between 14 and 36%), followed by the distance from students’ accommodation to university (importance rate between 6 and 18%) and culture (importance rate between 7 and 17%). On the other hand, gender, age, the numbers of failed courses, and the number of registered and unregistered exams per semester are less significant for the predictions. (ii) As expected, predictions performed after the second semester is more accurate than those performed after the first semester. (iii) Unsurprisingly, models that predict two classes yield better results than those that predict three. (iv) Random Forest classifier performs the best in all prediction models (0.77–0.94 accuracy), and using oversampling methods to deal with imbalanced data can significantly improve the performance of DM methods. For future work, we recommend testing the predictive models on other master programs and a larger datasets. Furthermore, we recommend investigating other oversampling approaches.
Demand forecasting is an important task for retailers as it is required for various operational decisions. One key challenge is to forecast demand on special days that are subject to vastly different ...demand patterns than on regular days. We present the case of a bakery chain with an emphasis on special calendar days, for which we address the problem of forecasting the daily demand for different product categories at the store level. Such forecasts are an input for production and ordering decisions. We treat the forecasting problem as a supervised machine learning task and provide an evaluation of different methods, including artificial neural networks and gradient-boosted decision trees. In particular, we outline and discuss the possibility of formulating a classification instead of a regression problem. An empirical comparison with established approaches reveals the superiority of machine learning methods, while classification-based approaches outperform regression-based approaches. We also found that machine learning methods not only provide more accurate forecasts but are also more suitable for applications in a large-scale demand forecasting scenario that often occurs in the retail industry.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Advances in wearable technologies have the ability to revolutionize and improve people's lives. The gains go beyond the personal sphere, encompassing business and, by extension, the global economy. ...The technologies are incorporated in electronic devices that collect data from consumers' bodies and their immediate environment. Human activities recognition, which involves the use of various body sensors and modalities either separately or simultaneously, is one of the most important areas of wearable technology development. In real-life scenarios, the number of sensors deployed is dictated by practical and financial considerations. In the research for this article, we reviewed our earlier efforts and have accordingly reduced the number of required sensors, limiting ourselves to first-person vision data for activities recognition. Nonetheless, our results beat state of the art by more than 4% of F1 score.
Retailers that offer perishable items are required to make hundreds of ordering decisions on a daily basis. For certain products, it is even necessary to make intraday decisions in order to increase ...the freshness of the goods while still serving the demand. We present a use case from the bakery domain where a part of the assortment has to be baked during the day as the delivered goods are not ready for sale. Hence, the operational performance depends on the decisions of the store personnel which can be optimized by a decision support system. Our approach to tackle this problem consists of two distinct phases: First, we forecast the hourly demand for each product. Second, the forecasts are input for a scheduling problem whose solution represents the baking plan that is provided to the store personnel. Based on our empirical evaluation, we conclude that forecasting accuracy has the biggest impact on the operational performance. More enhanced prediction methods noticeably outperform the reference methods. In particular, the machine learning based forecasting model significantly outperforms established time series models. If the computed schedules are executed as suggested, the customers can be served with freshly baked goods.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The Semantic Web is commonly perceived as a web of partially interlinked machine readable data. This data is inherently distributed and resembles the structure of the web in terms of resources being ...provided by different parties at different physical locations. A number of infrastructures for storing and querying distributed semantic web data, primarily encoded in RDF have been developed but almost all the work on description logic reasoning as a basis for implementing inference in the Web Ontology Language OWL still assumes a centralized approach where the complete terminology has to be present on a single system and all inference steps are carried out on this system.
We propose a distributed reasoning method that preserves soundness and completeness of reasoning under the original OWL import semantics. The method is based on resolution methods for \documentclass12pt{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$\mathcal{ALCHIQ}$\end{document} ontologies that we modify to work in a distributed setting. Results show a promising runtime decrease compared to centralized reasoning and indicate that benefits from parallel computation trade off the overhead caused by communication between the local reasoners.