Once identification numbers have been been added, this function creates a subset that will be parsed and then re-applied to the primary data set.
pm_prep(.data, var, type)
.data | A source tibble that has already had identification numbers added using pm_identify. |
---|---|
var | A character variable containing address data to be parsed |
type | The type of addresses to be parsed, one of either |
A tibble with one observation per unique address in the source data
frame. The tibble will have two variables, pm.uid
and pm.address
,
which is used as the basis for exploratory parsing.
Creation of the subset data is dependent on whether the data are
house numbers or intersections. Any addresses identified as "short"
,
"full"
, or "partial"
will be returned when type = "street"
.
Addresses that are identified as "unknown"
will also be included in "street"
,
but the accuracy of the parser may be limited. Addresses identified as intersections
will be returned when type = "intersection"
.