Constructing transferable and interpretable machine learning models for black carbon concentrations

E-viri

Recenzirano Odprti dostop

Constructing transferable and interpretable machine learning models for black carbon concentrations

Fung, Pak Lun; Savadkoohi, Marjan; Zaidan, Martha Arbayani; Niemi, Jarkko V.; Timonen, Hilkka; Pandolfi, Marco; Alastuey, Andrés; Querol, Xavier; Hussein, Tareq; Petäjä, Tuukka

Environment international, February 2024, 2024-Feb, 2024-02-00, 20240201, 2024-02-01, Letnik: 184

Journal Article

•We evaluated multiple machine learning models to estimate black carbon concentration.•BC correlates well with accumulation mode and nitrogen dioxide at the studied sites.•The model trained in Barcelona shows good accuracy in other European cities.•The model trained at urban background works well at traffic sites.•We calculated the static and dynamic relative importance to explain black-box models. Black carbon (BC) has received increasing attention from researchers due to its adverse health effects. However, in-situ BC measurements are often not included as a regulated variable in air quality monitoring networks. Machine learning (ML) models have been studied extensively to serve as virtual sensors to complement the reference instruments. This study evaluates and compares three white-box (WB) and four black-box (BB) ML models to estimate BC concentrations, with the focus to show their transferability and interpretability. We train the models with the long-term air pollutant and weather measurements in Barcelona urban background site, and test them in other European urban and traffic sites. Despite the difference in geographical locations and measurement sites, BC correlates the strongest with particle number concentration of accumulation mode (PNacc, r = 0.73–0.85) and nitrogen dioxide (NO2, r = 0.68–0.85) and the weakest with meteorological parameters. Due to its similarity of correlation behaviour, the ML models trained in Barcelona performs prominently at the traffic site in Helsinki (R2 = 0.80–0.86; mean absolute error MAE = 3.90–4.73 %) and at the urban background site in Dresden (R2 = 0.79–0.84; MAE = 4.23–4.82 %). WB models appear to explain less variability of BC than BB models, long short-term memory (LSTM) model of which outperforms the rest of the models. In terms of interpretability, we adopt several methods for individual model to quantify and normalize the relative importance of each input feature. The overall static relative importance commonly used for WB models demonstrate varying results from the dynamic values utilized to show local contribution used for BB models. PNacc and NO2 on average have the strongest absolute static contribution; however, they simultaneously impact the estimation positively and negatively at different sites. This comprehensive analysis demonstrates that the possibility of these interpretable air pollutant ML models to be transfered across space and time.

Išči dalje

Avtor

Dostop do baze podatkov JCR je dovoljen samo uporabnikom iz Slovenije. Vaš trenutni IP-naslov ni na seznamu dovoljenih za dostop, zato je potrebna avtentikacija z ustreznim računom AAI.

Leto	Faktor vpliva		Izdaja		Kategorija		Razvrstitev
Leto	JCR	SNIP	JCR	SNIP	JCR	SNIP	JCR	SNIP

Povezave do osebnih bibliografij avtorjev	Povezave do podatkov o raziskovalcih v sistemu SICRIS

Vir: Osebne bibliografije in: SICRIS

Naloži sliko

Vnos na polico

Dodajanje gradiva na polico je uspelo.

Dodajanje gradiva na polico je spodletelo.

Dodajanje gradiva na polico ni bilo potrebno.

Trajna povezava

E-pošta

Faktor vpliva

Izberite knjižnično izkaznico:

Baze podatkov, v katerih je revija indeksirana

Citiranje

Tema