Applications of remote sensing (RS) data cover several fields such as: cartography, surveillance, land-use planning, archaeology, environmental studies, resources management, etc. However, the amount ...of RS data has grown considerably due to the increase of aerial and satellite sensors. With this continuous increase, the necessity of having automated tools for the interpretation and analysis of RS big data is clearly obvious. The manual interpretation becomes a time consuming and expensive task. In this paper, a novel tool for interpreting and analyzing RS big data is described. The proposed system allows knowledge gathering for decision support in RS fields. It helps users easily make decisions in many fields related to RS by providing descriptive, predictive and prescriptive analytics. The paper outlines the design and development of a framework based on three steps: RS data acquisition, modeling, and analysis & interpretation. The performance of the proposed system has been demonstrated through three models: clustering, decision tree and association rules. Results show that the proposed tool can provide efficient decision support (descriptive and predictive) which can be adapted to several RS users’ requests. Additionally, assessing these results show good performances of the developed tool.
Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data ...warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. The present paper used the combination of parallelization methods and shared cache memory in systems distributed on the basis of data warehouse. According to the conducted assessment, the proposed method exhibited 7.1% speed improvement to kattle optimization instrument and 7.9% to talend instrument in terms of implementation time of the ETL process. Therefore, parallelization could notably improve the ETL process. It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed.
In the telecommunications industry, because of the information from different data sources, there are many discrete, uncertainty and heterogeneous data types cannot be used directly for data ...warehouse, there is not common ETL model. In this paper, investigation for the ETL tools in the Teradata warehouse, design a general ETL model for telecoms industry. Take the telecom business analysis system for example, verify the ETL model, prove that ETL model improve data conversion efficiency and it is good generality.
Abstract A core issue of the decision-making process in the medical field is to support the execution of analytical (OLAP) similarity queries over images in data warehousing environments. In this ...paper, we focus on this issue. We propose imageDWE , a non-conventional data warehousing environment that enables the storage of intrinsic features taken from medical images in a data warehouse and supports OLAP similarity queries over them. To comply with this goal, we introduce the concept of perceptual layer, which is an abstraction used to represent an image dataset according to a given feature descriptor in order to enable similarity search. Based on this concept, we propose the imageDW , an extended data warehouse with dimension tables specifically designed to support one or more perceptual layers. We also detail how to build an imageDW and how to load image data into it. Furthermore, we show how to process OLAP similarity queries composed of a conventional predicate and a similarity search predicate that encompasses the specification of one or more perceptual layers. Moreover, we introduce an index technique to improve the OLAP query processing over images. We carried out performance tests over a data warehouse environment that consolidated medical images from exams of several modalities. The results demonstrated the feasibility and efficiency of our proposed imageDWE to manage images and to process OLAP similarity queries. The results also demonstrated that the use of the proposed index technique guaranteed a great improvement in query processing.
En este artículo se presenta el modelo de datos multidimensional de dos data marts que forman parte de un Sistema de Soporte a la Toma de Decisiones en el área de la Genómica, el cual está basado en ...tecnologías de Bodegas de datos y OLAP. El primer data mart está relacionado con el "Análisis de unidades de información", que permite almacenar y consultar información sobre las unidades de información (Exón o Intrón) en la estructura de un gen, el orden y la posición inicial y final de las unidades de información. El segundo data mart llamado "Análisis fractal" permite almacenar y consultar información sobre los genes, por ejemplo, el número de unidades de información y longitud del gen, y medidas adicionales obtenidas del análisis fractal realizadas por una investigación previa. Finalmente, se presentan los problemas durante el proceso de cargue de datos y el modelado de los datos, junto con las soluciones planteadas a los mismos, y algunas interfaces de la herramienta desarrollada.
EXTRACTING PROCESS AND MAPPING MANAGEMENT FOR HETEROGENNOUS SYSTEMS Hagara, Igor; Tanuška, Pavol; Duchovičová, Soňa
Vedecké práce Materiálovotechnologickej fakulty Slovenskej technickej univerzity v Bratislave so sídlom v Trnave,
12/2013, Volume:
21, Issue:
33
Journal Article
Peer reviewed
Open access
A lot of papers describe three common methods of data selection from primary systems. This paper defines how to select the correct method or combinations of methods for minimizing the impact of ...production system and common operation. Before using any method, it is necessary to know the primary system and its databases structures for the optimal use of the actual data structure setup and the best design for ETL process. Databases structures are usually categorized into groups, which characterize their quality. The classification helps to find the ideal method for each group and thus design a solution of ETL process with the minimal impact on the data warehouse and production system.