Although a key driver of Earth's climate system, global land-atmosphere energy fluxes are poorly constrained. Here we use machine learning to merge energy flux measurements from FLUXNET eddy ...covariance towers with remote sensing and meteorological data to estimate global gridded net radiation, latent and sensible heat and their uncertainties. The resulting FLUXCOM database comprises 147 products in two setups: (1) 0.0833° resolution using MODIS remote sensing data (RS) and (2) 0.5° resolution using remote sensing and meteorological data (RS + METEO). Within each setup we use a full factorial design across machine learning methods, forcing datasets and energy balance closure corrections. For RS and RS + METEO setups respectively, we estimate 2001-2013 global (±1 s.d.) net radiation as 75.49 ± 1.39 W m
and 77.52 ± 2.43 W m
, sensible heat as 32.39 ± 4.17 W m
and 35.58 ± 4.75 W m
, and latent heat flux as 39.14 ± 6.60 W m
and 39.49 ± 4.51 W m
(as evapotranspiration, 75.6 ± 9.8 × 10
km
yr
and 76 ± 6.8 × 10
km
yr
). FLUXCOM products are suitable to quantify global land-atmosphere interactions and benchmark land surface model simulations.
FLUXNET comprises globally distributed eddy-covariance-based estimates of carbon fluxes between the biosphere and the atmosphere. Since eddy covariance flux towers have a relatively small footprint ...and are distributed unevenly across the world, upscaling the observations is necessary to obtain global-scale estimates of biosphere-atmosphere exchange. Based on cross-consistency checks with atmospheric inversions, sun-induced fluorescence (SIF) and dynamic global vegetation models (DGVMs), here we provide a systematic assessment of the latest upscaling efforts for gross primary production (GPP) and net ecosystem exchange (NEE) of the FLUXCOM initiative, where different machine learning methods, forcing data sets and sets of predictor variables were employed.
Large interannual variations in the measured growth rate of atmospheric carbon dioxide (CO2) originate primarily from fluctuations in carbon uptake by land ecosystems13. It remains uncertain, ...however, to what extent temperature and water availability control the carbon balance of land ecosystems across spatial and temporal scales314. Here we use empirical models based on eddy covariance data15 and process-based models16,17 to investigate the effect of changes in temperature and water availability on gross primary productivity (GPP), terrestrial ecosystem respiration (TER) and net ecosystem exchange (NEE) at local and global scales. We find that water availability is the dominant driver of the local interannual variability in GPP and TER. To a lesser extent this is true also for NEE at the local scale, but when integrated globally, temporal NEE variability is mostly driven by temperature fluctuations. We suggest that this apparent paradox can be explained by two compensatory water effects. Temporal water-driven GPP and TER variations compensate locally, dampening water-driven NEE variability. Spatial water availability anomalies also compensate, leaving a dominant temperature signal in the year-to-year fluctuations of the land carbon sink. These findings help to reconcile seemingly contradictory reports regarding the importance of temperature and water in controlling the interannual variability of the terrestrial carbon balance36,9,11,12,14. Our study indicates that spatial climate covariation drives the global carbon cycle response.
Interactions between the biosphere and the atmosphere can be well characterized by fluxes between the two. In particular, carbon and energy fluxes play a major role in understanding biogeochemical ...processes on an ecosystem level or global scale. However, the fluxes can only be measured at individual sites, e.g., by eddy covariance towers, and an upscaling of these local observations is required to analyze global patterns. Previous work focused on upscaling monthly, 8-day, or daily average values, and global maps for each flux have been provided accordingly. In this paper, we raise the upscaling of carbon and energy fluxes between land and atmosphere to the next level by increasing the temporal resolution to subdaily timescales. We provide continuous half-hourly fluxes for the period from 2001 to 2014 at 0.5° spatial resolution, which allows for analyzing diurnal cycles globally. The data set contains four fluxes: gross primary production (GPP), net ecosystem exchange (NEE), latent heat (LE), and sensible heat (H). We propose two prediction approaches for the diurnal cycles based on large-scale regression models and compare them in extensive cross-validation experiments using different sets of predictor variables. We analyze the results for a set of FLUXNET tower sites showing the suitability of our approaches for this upscaling task. Finally, we have selected one approach to calculate the global half-hourly data products based on predictor variables from remote sensing and meteorology at daily resolution as well as half-hourly potential radiation. In addition, we provide a derived product that only contains monthly average diurnal cycles, which is a lightweight version in terms of data storage that still allows studying the important characteristics of diurnal patterns globally. We recommend to primarily use these monthly average diurnal cycles, because they are less affected by the impacts of day-to-day variation, observation noise, and short-term fluctuations on subdaily timescales compared to the full half-hourly flux products. The global half-hourly data products are available at https://doi.org/10.17871/BACI.224.
Wind turbines remove kinetic energy from the atmospheric flow, which reduces wind speeds and limits generation rates of large wind farms. These interactions can be approximated using a vertical ...kinetic energy (VKE) flux method, which predicts that the maximum power generation potential is 26% of the instantaneous downward transport of kinetic energy using the preturbine climatology. We compare the energy flux method to the Weather Research and Forecasting (WRF) regional atmospheric model equipped with a wind turbine parameterization over a 10⁵ km² region in the central United States. The WRF simulations yield a maximum generation of 1.1 Wₑ·m−2, whereas the VKE method predicts the time series while underestimating the maximum generation rate by about 50%. Because VKE derives the generation limit from the preturbine climatology, potential changes in the vertical kinetic energy flux from the free atmosphere are not considered. Such changes are important at night when WRF estimates are about twice the VKE value because wind turbines interact with the decoupled nocturnal low-level jet in this region. Daytime estimates agree better to 20% because the wind turbines induce comparatively small changes to the downward kinetic energy flux. This combination of downward transport limits and wind speed reductions explains why large-scale wind power generation in windy regions is limited to about 1 Wₑ·m−2, with VKE capturing this combination in a comparatively simple way.
Forests play a crucial role in the global carbon (C) cycle by storing and sequestering a substantial amount of C in the terrestrial biosphere. Due to temporal dynamics in climate and vegetation ...activity, there are significant regional variations in carbon dioxide (CO2) fluxes between the biosphere and atmosphere in forests that are affecting the global C cycle. Current forest CO2 flux dynamics are controlled by instantaneous climate, soil, and vegetation conditions, which carry legacy effects from disturbances and extreme climate events. Our level of understanding from the legacies of these processes on net CO2 fluxes is still limited due to their complexities and their long-term effects. Here, we combined remote sensing, climate, and eddy-covariance flux data to study net ecosystem CO2 exchange (NEE) at 185 forest sites globally. Instead of commonly used non-dynamic statistical methods, we employed a type of recurrent neural network (RNN), called Long Short-Term Memory network (LSTM) that captures information from the vegetation and climate's temporal dynamics. The resulting data-driven model integrates interannual and seasonal variations of climate and vegetation by using Landsat and climate data at each site. The presented LSTM algorithm was able to effectively describe the overall seasonal variability (Nash-Sutcliffe efficiency, NSE = 0.66) and across-site (NSE = 0.42) variations in NEE, while it had less success in predicting specific seasonal and interannual anomalies (NSE = 0.07). This analysis demonstrated that an LSTM approach with embedded climate and vegetation memory effects outperformed a non-dynamic statistical model (i.e. Random Forest) for estimating NEE. Additionally, it is shown that the vegetation mean seasonal cycle embeds most of the information content to realistically explain the spatial and seasonal variations in NEE. These findings show the relevance of capturing memory effects from both climate and vegetation in quantifying spatio-temporal variations in forest NEE.
Understanding the global carbon (C) cycle is of crucial importance to map current and future climate dynamics relative to global environmental change. A full characterization of C cycling requires ...detailed information on spatiotemporal patterns of surface–atmosphere fluxes. However, relevant C cycle observations are highly variable in their coverage and reporting standards. Especially problematic is the lack of integration of the carbon dioxide (CO2) exchange of the ocean, inland freshwaters and the land surface with the atmosphere. Here we adopt a data-driven approach to synthesize a wide range of observation-based spatially explicit surface–atmosphere CO2 fluxes from 2001 to 2010, to identify the state of today's observational opportunities and data limitations. The considered fluxes include net exchange of open oceans, continental shelves, estuaries, rivers, and lakes, as well as CO2 fluxes related to net ecosystem productivity, fire emissions, loss of tropical aboveground C, harvested wood and crops, as well as fossil fuel and cement emissions. Spatially explicit CO2 fluxes are obtained through geostatistical and/or remote-sensing-based upscaling, thereby minimizing biophysical or biogeochemical assumptions encoded in process-based models. We estimate a bottom-up net C exchange (NCE) between the surface (land, ocean, and coastal areas) and the atmosphere. Though we provide also global estimates, the primary goal of this study is to identify key uncertainties and observational shortcomings that need to be prioritized in the expansion of in situ observatories. Uncertainties for NCE and its components are derived using resampling. In many regions, our NCE estimates agree well with independent estimates from other sources such as process-based models and atmospheric inversions. This holds for Europe (mean ± 1 SD: 0.8 ± 0.1 PgC yr−1, positive numbers are sources to the atmosphere), Russia (0.1 ± 0.4 PgC yr−1), East Asia (1.6 ± 0.3 PgC yr−1), South Asia (0.3 ± 0.1 PgC yr−1), Australia (0.2 ± 0.3 PgC yr−1), and most of the Ocean regions. Our NCE estimates give a likely too large CO2 sink in tropical areas such as the Amazon, Congo, and Indonesia. Overall, and because of the overestimated CO2 uptake in tropical lands, our global bottom-up NCE amounts to a net sink of −5.4 ± 2.0 PgC yr−1. By contrast, the accurately measured mean atmospheric growth rate of CO2 over 2001–2010 indicates that the true value of NCE is a net CO2 source of 4.3 ± 0.1 PgC yr−1. This mismatch of nearly 10 PgC yr−1 highlights observational gaps and limitations of data-driven models in tropical lands, but also in North America. Our uncertainty assessment provides the basis for setting priority regions where to increase carbon observations in the future. High on the priority list are tropical land regions, which suffer from a lack of in situ observations. Second, extensive pCO2 data are missing in the Southern Ocean. Third, we lack observations that could enable seasonal estimates of shelf, estuary, and inland water–atmosphere C exchange. Our consistent derivation of data uncertainties could serve as prior knowledge in multicriteria optimization such as the Carbon Cycle Data Assimilation System (CCDAS) and atmospheric inversions, without over- or under-stating bottom-up data credibility. In the future, NCE estimates of carbon sinks could be aggregated at national scale to compare with the official national inventories of CO2 fluxes in the land use, land use change, and forestry sector, upon which future emission reductions are proposed.
Data analysis and model-data comparisons in the environmental sciences require diagnostic measures that quantify time series dynamics and structure, and are robust to noise in observational data. ...This paper investigates the temporal dynamics of environmental time series using measures quantifying their information content and complexity. The measures are used to classify natural processes on one hand, and to compare models with observations on the other. The present analysis focuses on the global carbon cycle as an area of research in which model-data integration and comparisons are key to improving our understanding of natural phenomena. We investigate the dynamics of observed and simulated time series of Gross Primary Productivity (GPP), a key variable in terrestrial ecosystems that quantifies ecosystem carbon uptake. However, the dynamics, patterns and magnitudes of GPP time series, both observed and simulated, vary substantially on different temporal and spatial scales. We demonstrate here that information content and complexity, or Information Theory Quantifiers (ITQ) for short, serve as robust and efficient data-analytical and model benchmarking tools for evaluating the temporal structure and dynamical properties of simulated or observed time series at various spatial scales. At continental scale, we compare GPP time series simulated with two models and an observations-based product. This analysis reveals qualitative differences between model evaluation based on ITQ compared to traditional model performance metrics, indicating that good model performance in terms of absolute or relative error does not imply that the dynamics of the observations is captured well. Furthermore, we show, using an ensemble of site-scale measurements obtained from the FLUXNET archive in the Mediterranean, that model-data or model-model mismatches as indicated by ITQ can be attributed to and interpreted as differences in the temporal structure of the respective ecological time series. At global scale, our understanding of C fluxes relies on the use of consistently applied land models. Here, we use ITQ to evaluate model structure: The measures are largely insensitive to climatic scenarios, land use and atmospheric gas concentrations used to drive them, but clearly separate the structure of 13 different land models taken from the CMIP5 archive and an observations-based product. In conclusion, diagnostic measures of this kind provide data-analytical tools that distinguish different types of natural processes based solely on their dynamics, and are thus highly suitable for environmental science applications such as model structural diagnostics.
Today, many processes at the Earth's surface are constantly monitored by multiple data streams. These observations have become central to advancing our understanding of vegetation dynamics in ...response to climate or land use change. Another set of important applications is monitoring effects of extreme climatic events, other disturbances such as fires, or abrupt land transitions. One important methodological question is how to reliably detect anomalies in an automated and generic way within multivariate data streams, which typically vary seasonally and are interconnected across variables. Although many algorithms have been proposed for detecting anomalies in multivariate data, only a few have been investigated in the context of Earth system science applications. In this study, we systematically combine and compare feature extraction and anomaly detection algorithms for detecting anomalous events. Our aim is to identify suitable workflows for automatically detecting anomalous patterns in multivariate Earth system data streams. We rely on artificial data that mimic typical properties and anomalies in multivariate spatiotemporal Earth observations like sudden changes in basic characteristics of time series such as the sample mean, the variance, changes in the cycle amplitude, and trends. This artificial experiment is needed as there is no gold standard for the identification of anomalies in real Earth observations. Our results show that a well-chosen feature extraction step (e.g., subtracting seasonal cycles, or dimensionality reduction) is more important than the choice of a particular anomaly detection algorithm. Nevertheless, we identify three detection algorithms (k-nearest neighbors mean distance, kernel density estimation, a recurrence approach) and their combinations (ensembles) that outperform other multivariate approaches as well as univariate extreme-event detection methods. Our results therefore provide an effective workflow to automatically detect anomalies in Earth system science data.
Tropical ecosystems experience particularly fast transformations largely as a consequence of land use and climate change. Consequences for ecosystem functioning and services are hard to predict and ...require analyzing multiple data sets simultaneously. Today, we are equipped with a wide range of spatio-temporal observation-based data streams that monitor the rapid transformations of tropical ecosystems in terms of state variables (e.g., biomass, leaf area, soil moisture) but also in terms of ecosystem processes (e.g., gross primary production, evapotranspiration, runoff). However, the underexplored joint potential of such data streams, combined with deficient access to data and processing, constrain our understanding of ecosystem functioning, despite the importance of tropical ecosystems in the regional-to-global carbon and water cycling. Our objectives are: 1. To facilitate access to regional “Analysis Ready Data Cubes” and enable efficient processing 2. To contribute to the understanding of ecosystem functioning and atmosphere-biosphere interactions. 3. To get a dynamic perspective of environmental conditions for biodiversity. To achieve our objectives, we developed a regional variant of an “Earth System Data Lab” (RegESDL) tailored to address the challenges of northern South America. The study region extensively covers natural ecosystems such as rainforest and savannas, and includes strong topographic gradients (0–6,500 masl). Currently, environmental threats such as deforestation and ecosystem degradation continue to increase. In this contribution, we show the value of the approach for characterizing ecosystem functioning through the efficient implementation of time series and dimensionality reduction analysis at pixel level. Specifically, we present an analysis of seasonality as it is manifested in multiple indicators of ecosystem primary production. We demonstrate that the RegESDL has the ability to underscore contrasting patterns of ecosystem seasonality and therefore has the potential to contribute to the characterization of ecosystem function. These results illustrate the potential of the RegESDL to explore complex land-surface processes and the need for further exploration. The paper concludes with some suggestions for developing future big-data infrastructures and its applications in the tropics.