DIKUL - logo
E-viri
Celotno besedilo
Recenzirano Odprti dostop
  • Ensemble learning-based cro...
    Brandt, Patric; Beyer, Florian; Borrmann, Peter; Möller, Markus; Gerighausen, Heike

    GIScience & remote sensing/GIScience and remote sensing, 12/2024, Letnik: 61, Številka: 1
    Journal Article

    Detailed and accurate statistics on crop productivity are key to inform decision-making related to sustainable food production and supply ensuring global food security. However, annual and high-resolution crop yield data provided by official agricultural statistics are generally lacking. Earth observation (EO) imagery, geodata on meteorological and soil conditions, as well as advances in machine learning (ML) provide huge opportunities for model-based crop yield estimation in terms of covering large spatial scales with unprecedented granularity. This study proposes a novel yield estimation approach that is bottom-up scalable from parcel to administrative levels by leveraging ML-ensembles, comprising of six regression estimators (base estimators), and multi-source geodata, including EO imagery. To ensure the approach’s robustness, two ensemble learning techniques are investigated, namely meta-learning through model stacking and majority voting. ML-ensembles were evaluated multi-annually and crop-specifically for three major winter crops, namely winter wheat (WW), winter barley (WB), and winter rapeseed (WR) in two German federal states, covering 140,000 to 155,000 parcels per year. ML-ensembles were evaluated at the parcel and district level for two German federal states against official yield reports, ranging from 2019 to 2022, based on metrics such as coefficient of determination (Formula: see text) and normalized root mean square error (Formula: see text). Overall, the most robustly performing ensemble learning technique was majority voting yielding Formula: see text and Formula: see text values of 0.74, 13.4% for WW, 0.68, 16.9% for WB, and 0.66, 14.1% for WR, respectively, through cross-validation at parcel level. At the district level, majority voting reached Formula: see text and Formula: see text ranges of 0.79–0.89, 7.2–8.1% for WW, 0.80–0.84, 6.0–9.9% for WB, and 0.60–0.78, 6.1–10.4% for WR, respectively. Capitalizing on ensemble learning-based majority voting, examples of unprecedented high-resolution crop yield maps at Formula: see text spatial resolution are presented. Implementing a scalable yield estimation approach, as proposed in this study, into crop yield reporting frameworks of public authorities mandated to provide official agricultural statistics would increase the spatial resolution of annually reported yields, eventually covering the entire cropland available. Such unprecedented data products delivered through map services may improve decision-making support for a variety of stakeholders across different spatial scales, ranging from parcel to higher administrative levels.