Annual Mean PM2.5 Components (EC, NH4, NO3, OC, SO4) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019, v1

Overview

Particulate matter with aerodynamic diameter less than 2.5 μm (PM2.5) increases mortality and morbidity. PM2.5 is composed of a mixture of chemical components that vary across space and time. Due to limited hyperlocal data availability, less is known about health risks of PM2.5 components, their US-wide exposure disparities, or which species are driving the biggest intra-urban changes in PM2.5 mass. The first national super-learned models were developed across the US for hyperlocal estimation of annual mean elemental carbon (EC), ammonium (NH4+), nitrate (NO3-), organic carbon (OC), and sulfate (SO42-) concentrations across 3,535 urban areas at a 50-m spatial resolution, and at a 1-km resolution for non-urban areas from 2000 to 2019. Using ensembles of machine learning models and ~82 billion predictions across 20 years, hyperlocal super-learned PM2.5 components are now available for further research. Remarkable spatiotemporal intra-urban and inter-urban variabilities were found in PM2.5 components. It is anticipated for this work to be a critical milestone for conducting new studies on individual and combined health risks of PM2.5 components, environmental justice analysis, or understanding fine-scale spatiotemporal variabilities of PM2.5 composition. Urban planners and regulators may also use these predictions for selecting locations of new daycares, schools, nursing homes, or air-quality monitors.

Methods

This work aimed to predict annual mean levels of several major PM2.5 components that contribute substantially to PM2.5 mass (EC, NH4+, NO3-, OC, and SO42-) across 20 years from 2000-2019 using machine-learning models (ML), combined using either a generalized additive model (GAM) ensemble geographically-weighted-averaging (GAM-ENWA) or super-learning (SL) in the contiguous United States. Since majority of the US population (~80%) lives in urban areas, models were developed for urban areas at 50 m spatial resolution across 3,535 urban areas, and in non-urban areas at 1 km spatial resolution. Authors divided monitoring data into training and test sets, developed multiple ML models using the training set, predicted for each grid using each ML model, and ensembled the predictions of the ML models. The best model was chosen based on the performance in the test set.

Download

Download the Annual Mean PM2.5 Components (EC, NH4, NO3, OC, SO4) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019 v1. The files in the table below are in RDS format. The urban file sizes range from approximately 2.5 to 3.8 GB and the non-urban file sizes are about 150 MB.

  Elemental Carbon (EC) Ammonium (NH4+) Nitrate (NO3-) Organic Carbon (OC) Sulfate (SO42-)
2000

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2001

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2002

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2003

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2004

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2005

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2006

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2007

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2008

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2009

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2010

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2011

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2012

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2013

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2014

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2015

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2016

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2017

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2018

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

2019

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Urban

Non-Urban

Citation

Amini, H.1, 2*, M. Danesh-Yazdi1, Q. Di3, W. Requia4, Y. Wei1, Y. AbuAwad5, L. Shi6, M. Franklin7, C.-M. Kang1, J. M. Wolfson1, P. James8,1, R. Habre9, Q. Zhu6, J. S. Apte10,11, Z. J. Andersen2, X. Xing12, C. Hultquist12,13, I. Kloog14, F. Dominici1,15, P. Koutrakis1, J. Schwartz1. 2022. Annual Mean PM2.5 Components (EC, NH4, NO3, OC, SO4) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019 v1. (Preliminary Release). Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC). https://doi.org/10.7927/7wj3-en73. Accessed DAY MONTH YEAR.

1Harvard T.H. Chan School of Public Health, Boston, MA, United States
2Department of Public Health, University of Copenhagen, Copenhagen, Denmark
3Vanke School of Public Health, Tsinghua University, Beijing, China
4Fundação Getúlio Vargas, Brasilia, Brazil
5PERFORM Centre, Concordia University, Montreal, Quebec, Canada
6Gangarosa Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, GA, United States
7Department of Statistical Sciences, University of Toronto, Toronto, Canada
8Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, United States
9Keck School of Medicine, University of Southern California, Los Angeles, CA, United States
10Department of Civil and Environmental Engineering, University of California, Berkeley, CA, United States
11School of Public Health, University of California, Berkeley, CA, United States
12The Center for International Earth Science Information Network (CIESIN), Columbia University, Palisades, NY, United States
13School of Earth and Environment, University of Canterbury, Christchurch, New Zealand
14Icahn School of Medicine at Mount Sinai, NY, United States
15Harvard Data Science Initiative, Cambridge, MA, United States

Acknowledgement

This project was supported by Cyprus Harvard Endowment Program for the Environment and Public Health, US Environment Protection Agency grant RD-8358720, National Institute of Health grant P30 ES000002 and R01 ES032418-01.

Disclaimer

This is a preliminary open data release, pending peer review of the associated journal article. Following the peer review process, data curation will be completed by the NASA Socioeconomic Data and Applications Center (SEDAC) and the data will be disseminated through the SEDAC catalog.