UNI-MB - logo
UMNIK - logo
 
E-viri
Celotno besedilo
Recenzirano Odprti dostop
  • Controlling extrapolations ...
    Navarro Pérez, Rodrigo; Schunck, Nicolas

    Physics letters. B, 10/2022, Letnik: 833
    Journal Article

    Predictions of nuclear properties far from measured data are inherently inaccurate because of uncertainties in our knowledge of nuclear forces and in our treatment of quantum many-body effects in strongly-interacting systems. While the model bias can be directly calculated when experimental data is available, only an estimate can be made in the absence of such measurements. Current approaches to compute the estimated bias quickly lose predictive power when their input variables are taken far from the training region, resulting in uncontrolled uncertainties in applications such as nucleosynthesis simulations. In this letter, we present a novel technique to identify the input variables of machine learning algorithms that can provide robust estimates of model bias. Our process is based on selecting input variables, or features, based on their probability distribution functions across the entire nuclear chart. We illustrate our approach on the problem of quantifying the model bias in nuclear binding energies calculated with Density Functional Theory (DFT). We prove that building model biases with only Z and N as features leads to highly unreliable extrapolations. Conversely, we show that proper feature selection can systematically improve theoretical predictions without increasing uncertainties.