UNI-MB - logo
UMNIK - logo
 
(UM)
  • Artificially generated near-infrared spectral data for classification purposes
    Sem, Vilma ; Kolar, Jana ; Lusa, Lara
    Near-Infrared Spectroscopy has became a widely used analytical technique in different research fields due to its non-destructiveness and low-cost. The spectra are rich in information but extremely ... complex, therefore their analysis necessitates the use of advanced statistical methods. The empirical properties of the statistical methods can be assessed using artificially generated data that resemble real Near-Infrared Spectroscopy. In this paper we propose a new data generation approach (ABS) that takes into account the theoretical knowledge about the near-infrared absorption of the functional groups. The proposed method is compared to real data and to a simpler data generation method, which simulates the data from a multivariate normal distribution whose parameters are estimated from real data (MVNorig). The comparison between real data and the data generation approaches is based on a class-imbalanced classification problem using linear discriminant analysis, classification trees and support vector machines. Both simulation approaches generated spectra with a good resemblance to real data, MVNorig performing slightly better than ABS; using real and simulated data we would have reached similar conclusions about the class-imbalance problem in classification. Both methods can be used to artificially generate near-infrared spectra. The method based on multivariate normal distribution can be used when a large number of real data spectra are available, while the appropriateness of the results of the ABS method depend on the exactness of functional group near-infrared absorption knowledge.
    Source: Chemometrics and Intelligent Laboratory Systems. - ISSN 0169-7439 (Vol. 172, Jan. 2018, str. 100-108)
    Type of material - article, component part
    Publish date - 2018
    Language - english
    COBISS.SI-ID - 33505753