Where Do These Data Come From?

Data for this site is compiled from a number of sources on a daily basis:

  1. County and state COVID-19 data - the New York Times’ COVID-19 Project
  2. State health disparities data - the Missouri Department of Health and Senior Services
  3. St. Louis area health disparities data - the City of St. Louis and St. Louis County
  4. St. Louis area zip code data:
  5. St. Louis area hospitalization data - the St. Louis Metropolitan Pandemic Task Force
  6. Kansas City area zip code data:
  7. Demographic data - the U.S. Census Bureau’s American Community Survey 5-year estimates (2014-2018)

See the Technical Notes page for additional details.

Why Do Numbers Not Match?

Due to lags in reporting, both at the public health department level and at the New York Times itself, case and mortality numbers may lag behind other figures reported in the media. At the same time, because the New York Times obtains data from multiple sources, some case and mortality numbers may actually be higher than what is reported by the state.

What Are Probable Cases?

On May 6th, the New York Times began including probable cases - those without a confirmatory test result but where symptoms indicate a COVID-19 infection. Not all counties report probable cases, however.

What are Per Capita Rates?

In order to compare rates across neighborhoods, cities, states, or countries, use utilize per capita statistics. When we calculate a “rate per capita” or a “per capita rate,” we take our rate and multiply it by a set number of individuals. This helps us ensure that our data are expressed relative to the size of the population they describe. For example, we typically calculate infant mortality rate by multiplying a given jurisdiction’s rate by 1,000. In the United States, we typically calculate per capita crime rates by taking a city’s or state’s rate, and multiplying it by 100,000.

Calculating rates is particularly important for mapping these quantities. This is sometimes called “normalization” by cartographers. By ensuring that we adjust the values mapped for the number of people in a given area, we ensure that our map is not just reflecting variations in population between jurisdictions.

What are Log Plots?

The website Datawrapper has a great explainer on how to read log plots. The key takeaway from their explanation is that “Log scales show relative values instead of absolute ones.” Instead of showing absolute change, they show us the rate at which change happens. These are useful for comparing how COVID-19, for example, spreads between multiple counties. The steeper the line for a particular jurisdiction, the faster the rate of growth.

What are Rolling Averages?

A rolling average is a technique for calculating an average for values for different “windows.” All of the plots on this site use a seven day rolling average, meaning that the the value of a given day’s average is calculated from the prior seven days’ values. This has the effect of “smoothing” out large daily variations, for example, in the number of new COVID cases or hospitalizations reported. Doing so gives us a better sense of the trend - if cases climbing or falling, for example.

Why Do You Treat Joplin and Kansas City as Counties?

Since both cities have their own health departments, and straddle multiple counties, both the State of Missouri and the New York Times have been releasing data on infections and deaths exclusive of the counties they lie in. This has been true for Kansas City since the beginning of the pandemic, and true for Joplin since June 25th. I have adjusted the populations for both the cities themselves and the surrounding counties to provide accurate per capita rates, and have also created maps that show Joplin and Kansas City as their own entities.

What is a Metropolitan Statistical Area?

A Metropolitan Statistical Area, or a “MSA,” is a region defined by the U.S. Office of Management and Budget with the U.S. Census Bureau:

The general concept of a metropolitan or micropolitan statistical area is that of a core area containing a substantial population nucleus, together with adjacent communities having a high degree of economic and social integration with that core.

You can read more about MSAs on the Census Bureau’s website.

This site uses these pre-existing definitions for all data involving “metro” areas, with the county-equivalents for Joplin and Kansas City added to their respective metros. There are a total of eight MSAs in Missouri, several of which include counties in adjacent states:

  1. Cape Girardeau, MO-IL Metro Area
  2. Columbia, MO Metro Area
  3. Jefferson City, MO Metro Area
  4. Joplin, MO Metro Area
  5. Kansas City, MO-KS Metro Area
  6. Springfield, MO Metro Area
  7. St. Joseph, MO-KS Metro Area
  8. St. Louis, MO-IL Metro Area


Can I Reuse Your Data or Graphics?

Absolutely! All code, data, website content, and images are available under a CC-BY 4.0 license. This means that you can use these data and graphics as long as you tell your audience where they came from. Graphics should not be altered to remove the caption that indicates who made them and where the underlying data came from.

How Do I Download Your Data?

The easiest way to download them is to go to the GitHub repository for this project and click on the green Code button. The menu that appears will give you an option to download a .zip file that contains all of the clean data and most of the raw data (in the data/ folder).

How Do I Download Your Graphics?

Low resolution version of the graphics can be downloaded one at a time from the website by right clicking on them. If you want to download them all at once, or download the high resolution versions, follow the same process for downloading the data from GitHub (see above). The images will be in the results/ folder.




Copyright © 2020-2021 Christopher Prener, Ph.D. All code, data, website content, and images are available under a Creative Commons Attribution 4.0 International License.