my_address_file_geocoded.csv is a file in the current working directory with coordinate columns named
end_date, then the DeGAUSS command:
docker run --rm -v $PWD:/tmp ghcr.io/degauss-org/st_census_tract:0.2.1 my_address_file_geocoded.csv
my_address_file_geocoded_st_census_tract_0.2.1.csv with added columns:
census_tract_vintage: decennial census year
census_tract_id: census tract FIPS identifier
Block Group identifiers are defined as the concatenation of the state, county, tract, and block group fips identifiers (commonly called GISJOIN or GEOID in census data). All census tract identifiers are 11 digits and all census block group identifiers are 12 digits, with the exception of some 1990, 1980, and 1970 tracts that are 9 digits, resulting in 10 digit block group identifiers.
Input data must have columns called
lon containing the latitude and longitude, respecitvely, as well as
end_date specifying a date range over which tract-level geomarkers will be assessed. The date range will be used to assign a census tract vintage, ranging from 1970 to 2020 by decade. If you do not have temporal data and wish to use the 2010 tract or block group boundaries, you can utilize the census_block_group DeGAUSS container.
After the vintage is assigned, the latitude and longitude will be overlayed within a tract to assign a census tract identifier from the appropriate decade.
If the date range spans two census decades, the result will contain one row per decade. For example,
where a 2010 tract identifier is assigned to the first row, and a 2020 tract identifier is assigned to the second row.
For detailed documentation on DeGAUSS, including general usage and installation, please see the DeGAUSS homepage.