Akademska digitalna zbirka SLovenije - logo
E-viri
Recenzirano Odprti dostop
  • Random forest swarm optimiz...
    Asadi, Shahrokh; Roshan, SeyedEhsan; Kattan, Michael W.

    Journal of biomedical informatics, March 2021, 2021-Mar, 2021-03-00, 20210301, Letnik: 115
    Journal Article

    Display omitted •Through combining the multi-objective particle swarm optimization and Random forest, a new approach is proposed to predict the heart disease.•The main goal is to produce diverse and accurate classifiers and determine the (near) optimal number of classifiers.•The results indicate that the proposed algorithm outperforms the other techniques in terms of accuracy and statistical tests. Heart disease has been one of the leading causes of death worldwide in recent years. Among diagnostic methods for heart disease, angiography is one of the most common methods, but it is costly and has side effects. Given the difficulty of heart disease prediction, data mining can play an important role in predicting heart disease accurately. In this paper, by combining the multi-objective particle swarm optimization (MOPSO) and Random Forest, a new approach is proposed to predict heart disease. The main goal is to produce diverse and accurate decision trees and determine the (near) optimal number of them simultaneously. In this method, an evolutionary multi-objective approach is used instead of employing a commonly used approach, i.e., bootstrap, feature selection in the Random Forest, and random number selection of training sets. By doing so, different training sets with different samples and features for training each tree are generated. Also, the obtained solutions in Pareto-optimal fronts determine the required number of training sets to build the random forest. By doing so, the random forest's performance can be enhanced, and consequently, the prediction accuracy will be improved. The proposed method's effectiveness is investigated by comparing its performance over six heart datasets with individual and ensemble classifiers. The results suggest that the proposed method with the (near) optimal number of classifiers outperforms the random forest algorithm with different classifiers.