Skip to content

Data, code, and methodology supporting the BuzzFeed News' analysis of COVID-19 ZIP codes and demographic trends.

Notifications You must be signed in to change notification settings

BuzzFeedNews/2020-05-covid-city-zip-codes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

US City COVID-19 ZIP Code Analysis

This repository contains data and code supporting a BuzzFeed News article about city-level ZIP code demographics and COVID-19 cases, published May 7, 2020. See below for details.

Data

ZIP code–level data

The analysis uses ZIP–code level case counts (as of May 4, 2020 for each city except for Detroit, which is as of May 7, 2020) for the following five cities, stored in the data/raw directory:

City ZIP shapefiles

The data/raw directory also includes ZIP-code level shapefiles for each of the five cities. Those geospatial files come from each city's open data portals and are used to filter for the appropriate ZIP codes and create the maps that are included in the article.

Census data

The demographic data used in the analysis comes from the 5-year ACS estimates for 2018 at the ZCTA level. The data file included in this repository has been pre-processed from seven different data files that are not included here, in order to reduce the size of the raw data in this repository.

CBSA and county data

The data/county-data directory contains several datasets relevant to the metro-area calculations described below. The datasets are:

Analysis

Analyze ZIP code–level COVID-19 case / demographic correlations

The city-demographic-factors-analysis.ipynb notebook loads the data for each city, calculates correlations between various demographic factors and per capita case counts (at a ZIP code level), and graphically explores some of those correlations. It also outputs the GeoJSON and CSV files used to create the maps and scatterplots in the story.

Calculate population, cases, and deaths in largest US metro areas

The calculate-metro-area-proportions.ipynb notebook uses the county-level data described above to calculate the proportion of population, COVID-19 cases, and COVID-19 deaths in the United States' 15 largest metro areas, relative to US totals.

Licensing

All code in this repository is available under the MIT License. Files in the output/ directory are available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

Contact

If you have any questions about this repository you can reach out to John Templon at [email protected].

Looking for more from BuzzFeed News? Click here for a list of our open-sourced projects, data, and code.

About

Data, code, and methodology supporting the BuzzFeed News' analysis of COVID-19 ZIP codes and demographic trends.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published