SORA

Advancing, promoting and sharing knowledge of health through excellence in teaching, clinical practice and research into the prevention and treatment of illness

Development of Europe-Wide Models for Particle Elemental Composition Using Supervised Linear Regression and Random Forest.

Chen, J; de Hoogh, K; Gulliver, J; Hoffmann, B; Hertel, O; Ketzel, M; Weinmayr, G; Bauwelinck, M; van Donkelaar, A; Hvidtfeldt, UA; et al. Chen, J; de Hoogh, K; Gulliver, J; Hoffmann, B; Hertel, O; Ketzel, M; Weinmayr, G; Bauwelinck, M; van Donkelaar, A; Hvidtfeldt, UA; Atkinson, R; Janssen, NAH; Martin, RV; Samoli, E; Andersen, ZJ; Oftedal, BM; Stafoggia, M; Bellander, T; Strak, M; Wolf, K; Vienneau, D; Brunekreef, B; Hoek, G (2020) Development of Europe-Wide Models for Particle Elemental Composition Using Supervised Linear Regression and Random Forest. Environ Sci Technol, 54 (24). pp. 15561-16260. ISSN 1520-5851 https://doi.org/10.1021/acs.est.0c06595
SGUL Authors: Atkinson, Richard William

[img]
Preview
PDF Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (5MB) | Preview

Abstract

We developed Europe-wide models of long-term exposure to eight elements (copper, iron, potassium, nickel, sulfur, silicon, vanadium, and zinc) in particulate matter with diameter <2.5 μm (PM2.5) using standardized measurements for one-year periods between October 2008 and April 2011 in 19 study areas across Europe, with supervised linear regression (SLR) and random forest (RF) algorithms. Potential predictor variables were obtained from satellites, chemical transport models, land-use, traffic, and industrial point source databases to represent different sources. Overall model performance across Europe was moderate to good for all elements with hold-out-validation R-squared ranging from 0.41 to 0.90. RF consistently outperformed SLR. Models explained within-area variation much less than the overall variation, with similar performance for RF and SLR. Maps proved a useful additional model evaluation tool. Models differed substantially between elements regarding major predictor variables, broadly reflecting known sources. Agreement between the two algorithm predictions was generally high at the overall European level and varied substantially at the national level. Applying the two models in epidemiological studies could lead to different associations with health. If both between- and within-area exposure variability are exploited, RF may be preferred. If only within-area variability is used, both methods should be interpreted equally.

Item Type: Article
Additional Information: Copyright © 2020 American Chemical Society This is an open access article published under a Creative Commons Non-Commercial No Derivative Works (CC-BY-NC-ND) Attribution License, which permits copying and redistribution of the article, and creation of adaptations, all for non-commercial purposes.
Keywords: MD Multidisciplinary, Environmental Sciences
SGUL Research Institute / Research Centre: Academic Structure > Population Health Research Institute (INPH)
Journal or Publication Title: Environ Sci Technol
ISSN: 1520-5851
Language: eng
Dates:
DateEvent
15 December 2020Published
25 November 2020Published Online
13 November 2020Accepted
Publisher License: Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
Projects:
Project IDFunderFunder ID
R-82811201Environmental Protection Agencyhttp://dx.doi.org/10.13039/501100001589
201606010329China Scholarship Councilhttp://dx.doi.org/10.13039/501100004543
PubMed ID: 33237771
Go to PubMed abstract
URI: https://openaccess.sgul.ac.uk/id/eprint/112666
Publisher's version: https://doi.org/10.1021/acs.est.0c06595

Actions (login required)

Edit Item Edit Item