Once identification numbers have been been added, this function creates a subset that will be parsed and then re-applied to the primary data set.

pm_prep(.data, var, type)

Arguments

.data

A source tibble that has already had identification numbers added using pm_identify.

var

A character variable containing address data to be parsed

type

The type of addresses to be parsed, one of either "street" or "intersection"

Value

A tibble with one observation per unique address in the source data frame. The tibble will have two variables, pm.uid and pm.address, which is used as the basis for exploratory parsing.

Details

Creation of the subset data is dependent on whether the data are house numbers or intersections. Any addresses identified as "short", "full", or "partial" will be returned when type = "street". Addresses that are identified as "unknown" will also be included in "street", but the accuracy of the parser may be limited. Addresses identified as intersections will be returned when type = "intersection".