UNI-MB - logo
UMNIK - logo
 
E-viri
Recenzirano Odprti dostop
  • Variable Clustering in High...
    Yengo, Loïc; Jacques, Julien; Biernacki, Christophe; Canouil, Mickael

    The R journal, 2016, Letnik: 8, Številka: 1
    Journal Article

    Dimension reduction is one of the biggest challenge in high-dimensional regression models. We recently introduced a new methodology based on variable clustering as a means to reduce dimensionality. We introduce here an R package that implements two enhancements regarding the latter methodology. First, an improvement in computational time for estimating the parameters is presented. As a second enhancement, users of our method are now allowed to constrain the model to identify variables with weak or no effect on the response. An overview of the package functionalities as well as examples to run an analysis are described. Numerical experiments on simulated and real data were performed to illustrate the gain of computational time and the good predictive performance of our method compared to standard dimension reduction approaches.