my_address_file.csv is a file in the current working directory with an address column named
address, then the DeGAUSS command:
docker run --rm -v $PWD:/tmp ghcr.io/degauss-org/geocoder:3.2.1 my_address_file.csv
my_address_file_geocoder_3.2.1_score_threshold_0.5.csv with added columns:
matched_zip: matched address componets (e.g.,
matched_streetis the street the geocoder matched with the input address); can be used to investigate input address misspellings, typos, etc.
precision: The method/precision of the geocode. The value will be one of:
range: interpolated based on address ranges from street segments
street: center of the matched street
intersection: intersection of two streets
zip: centroid of the matched zip code
city: centroid of the matched city
score: The percentage of text match between the given address and the geocoded result, expressed as a number between 0 and 1. A higher score indicates a closer match. Note that each score is relative within a precision method (i.e. a
rangeis not the same as a
lon: geocoded coordinates for matched address
geocode_result: A character string summarizing the geocoding result. The value will be one of
geocoded: the address was geocoded with a
imprecise_geocode: the address was geocoded, but results were suppressed because the
scorewas less than
po_box: the address was not geocoded because it is a PO Box
cincy_inst_foster_addr: the address was not geocoded because it is a known institutional address, not a residential address
non_address_text: the address was not geocoded because it was blank or listed as “foreign”, “verify”, or “unknown”
cityare returned with a missing
lonbecause they are likely too inaccurate and/or too imprecise to be used for further analysis.
lonare also returned as missing if the
scoreis less than
0.5(regardless of the precision).
docker run --rm -v $PWD:/tmp degauss/geocoder:3.2.0 my_address_file.csv 0.6).
addressand an optional identifier column (e.g.,
id). Fewer columns will increase geocoding speed.
32709) and not “plus four” (i.e.
3333 Burnet Ave Cincinnati 45229 OH)
geocoder.dbis a SQL database prepared following the instructions here using 2021 TIGER/Line Street Range Address files from the Census
For detailed documentation on DeGAUSS, including general usage and installation, please see the DeGAUSS homepage.