class: center, middle, inverse, title-slide # tidystl ## Mapping St. Louis health and social data with R ### Chris Prener ### April 18th, 2019 --- # "Hi, I'm Chris" .pull-left[ * Assistant Professor of Sociology at Saint Louis University * I teach statistics, GIS, and classes on urban and health issues * I'm interested in how neighborhoods shape our lives, and how we in turn shape neighborhoods... * ...which means I spend a lot of time using bulk administrative data released by cities. ] <img src="assets/prenerLogo.png" width="40%" style="display: block; margin: auto 0 auto auto;" /> --- # St. Louis Offers A Lot to Researchers <img src="assets/forclose.png" width="100%" style="display: block; margin: auto;" /> --- # But... <img src="assets/kondo.png" width="100%" style="display: block; margin: auto;" /> --- # Barriers to Joyful Data * There is no single API access point for data from the City * Closed formats (`.pdf`, Access Databases) are the norm * Untidy data abounds --- # Untidy Data Abounds <img src="assets/stlmpd_01.png" width="100%" style="display: block; margin: auto;" /> --- # Untidy Data Abounds <img src="assets/stlmpd_02.png" width="100%" style="display: block; margin: auto;" /> --- # Untidy Data Abounds <img src="assets/stlmpd_03.png" width="100%" style="display: block; margin: auto;" /> --- # Untidy Data Abounds <img src="assets/stlmpd_04.png" width="100%" style="display: block; margin: auto;" /> --- # Barriers to Joyful Data * There is no single API access point for data from the City * Closed formats (`.pdf`, Access Databases) are the norm * Untidy data abounds * Answering questions with the City's data also requires specialized GIS skills --- # Barriers to Joyful Data * There is no single API access point for data from the City * Closed formats (`.pdf`, Access Databases) are the norm * Untidy data abounds * Answering questions with the City's data also requires specialized GIS skills * Geocoding is expensive * for example, geocoding the 1.4 million Citizens' Service Bureau incidents would cost ~$5,600 --- # The `tidystl` Ecosystem <img src="assets/tidystl_01.png" width="100%" style="display: block; margin: auto;" /> --- # The `tidystl` Ecosystem <img src="assets/tidystl_02.png" width="100%" style="display: block; margin: auto;" /> --- # The `tidystl` Ecosystem <img src="assets/tidystl_03.png" width="100%" style="display: block; margin: auto;" /> --- # The `tidystl` Ecosystem <img src="assets/tidystl_04.png" width="100%" style="display: block; margin: auto;" /> --- # The `tidystl` Ecosystem <img src="assets/tidystl_05.png" width="100%" style="display: block; margin: auto;" /> --- class: inverse, center, middle # Illustrations --- # Address Normalization <img src="assets/normalize.png" width="100%" style="display: block; margin: auto;" /> --- # Open Source, Composite Geocoding <img src="assets/compositeGeocoder.png" width="100%" style="display: block; margin: auto;" />