Feature selection in the data with different types of feature values, i.e., the heterogeneous or mixed data, is especially of practical importance because such types of data sets widely exist in real ...world. The key issue for feature selection in mixed data is how to properly deal with different types of the features or attributes in the data set. Motivated by the fuzzy rough set theory which allows different fuzzy relations to be defined for different types of attributes to measure the similarity between objects and in view of the effectiveness of entropy to measure information uncertainty, we propose in this paper a fuzzy rough set-based information entropy for feature selection in a mixed data set. It is proved that the newly-defined entropy meets the common requirement of monotonicity and can equivalently characterize the existing attribute reductions in the fuzzy rough set theory. Then, a feature selection algorithm is formulated based on the proposed entropy and a filter-wrapper method is suggested to select the best feature subset in terms of classification accuracy. An extensive numerical experiment is further conducted to assess the performance of the feature selection method and the results are satisfactory.
•A novel fuzzy rough set-based information entropy is constructed for mixed data.•The proposed entropy can equivalently characterize the existing attribute reductions in the fuzzy rough set theory.•A feature selection algorithm is formulated based on the proposed entropy.•A filter-wrapper method is suggested to select a best feature subset.
•Proposing a novel integration of Z numbers and Best Worst Method.•The method results in lower inconsistency.•The uncertainty of the real word decisions is considered in the proposed method.
Best ...Worst Method (BWM) has recently been proposed as a method for Multi Criteria Decision Making (MCDM). Studies show that BWM compared with other methods such as Analytic Hierarchy Process (AHP), leads to lower inconsistency of the results while reducing the number of required pairwise comparisons. MCDM methods such as BWM require accurate information. However, it often happens in practice that a level of uncertainty accompanies the information. The main aim of this paper is to address this problem and provide an integration of BWM and Z-numbers, namely ZBWM. Providing BWM with Z-numbers enables the BWM method to handle the uncertainty of information of a multi-criteria decision. Additionally, the capabilities of the proposed method in the process of utilizing the linguistic information dealing with big data are highlighted. The proposed method is examined to address a supplier development problem. By experimental results, we show that ZBWM results lower inconsistency when compared with BWM. A Z-number contains subjectivity in its fuzzy part, which can be addressed in future applications of ZBWM.
Abstract
Psosets and trellises are generalizations of posets and lattices respectively. In fact, these notions are introduced independently by E. Fried and H. L. Skala. It is well known that a graph ...can be used to describe a partial order. In this paper, the concepts of modular, weakly distributive and normal trellises are introduced. We have proved that a trellis satisfying sheering property is modular. It is also proved that every strongly connected trellis is nonmodular. It is shown that every separating element of a trellis is modular and every modular element is weakly separating. Well-known result of M.H. Stone namely “Every maximal ideal of a distributive lattice is prime” is generalized to trellises. It is proved that a relatively complemented trellis is a lattice if and only if it is normal.
Attribute reduction is one of the biggest challenges encountered in computational intelligence, data mining, pattern recognition, and machine learning. Effective in feature selection as the rough set ...theory is, it can only handle symbolic attributes. In order to overcome this drawback, the fuzzy rough set model is proposed, which is an extended model of rough sets and is able to deal with imprecision and uncertainty in both symbolic and numerical attributes. The existing attribute selection algorithms based on the fuzzy rough set model mainly take the angle of "attribute set," which means they define the object function representing the predictive ability for an attribute subset with regard to the domain of discourse, rather than following the view of an "object pair." Algorithms from the viewpoint of the object pair can ignore the object pairs that are already discerned by the selected attribute subsets and, thus, need only to deal with part of object pairs instead of the whole object pairs from the discourse, which makes such algorithms more efficient in attribute selection. In this paper, we propose the concept of reduced maximal discernibility pairs, which directly adopts the perspective of the object pair in the framework of the fuzzy rough set model. Then, we develop two attribute selection algorithms, named as reduced maximal discernibility pairs selection and weighted reduced maximal discernibility pair selection, based on the reduced maximal discernibility pairs. Experiment results show that the proposed algorithms are effective and efficient in attribute selection.
Feature selection aims to select a feature subset from an original feature set based on a certain evaluation criterion. Since feature selection can achieve efficient feature reduction, it has become ...a key method for data preprocessing in many data mining tasks. Recently, many feature selection strategies have been developed since in most cases it is infeasible to obtain an optimal/reduced feature subset by using exhaustive search. Among these strategies, fuzzy rough set theory has proved to be an ideal candidate for dealing with uncertain information. This article provides a comprehensive review on the fuzzy rough set theory and two fuzzy rough set theory based feature selection methods, that is, fuzzy rough set based feature selection methods and fuzzy rough neural network based feature selection methods. We review the publications related to the fuzzy rough theory and its applications in feature selection. In addition, the challenges in the two types of feature selection methods are also discussed.
This article is categorized under:
Technologies > Machine Learning
A common framework of fuzzy rough set theory based feature selection approaches.
By introducing the new concepts of fuzzy β-covering and fuzzy β-neighborhood, we define two new types of fuzzy covering rough set models which can be regarded as bridges linking covering rough set ...theory and fuzzy rough set theory. We show the properties of the two models, and reveal the relationships between the two models and some others. Moreover, we present the matrix representations of the newly defined lower and upper approximation operators so that the calculation of lower and upper approximations of subsets can be converted into operations on matrices. Finally, we generalize the models and their matrix representations to L-fuzzy covering rough sets which are defined over fuzzy lattices.
Wind speed forecasting is still a challenge due to the stochastic and highly varying characteristics of wind. In this paper, a graph deep learning model is proposed to learn the powerful ...spatio-temporal features from the wind speed and wind direction data in neighboring wind farms. The underlying wind farms are modeled by an undirected graph, where each node corresponds to a wind site. For each node, temporal features are extracted using a long short-term memory Network. A scalable graph convolutional deep learning architecture (GCDLA), motivated by the localized first-order approximation of spectral graph convolutions, leverages the extracted temporal features to forecast the wind-speed time series of the whole graph nodes. The proposed GCDLA captures spatial wind features as well as deep temporal features of the wind data at each wind site. To further improve the prediction accuracy and capture robust latent representations, the rough set theory is incorporated with the proposed graph deep network by introducing upper and lower bound parameter approximations in the model. Simulation results show the advantages of capturing deep spatial and temporal interval features in the proposed framework compared to the state-of-the-art deep learning models as well as shallow architectures in the recent literature.