This is an example of the workflow a PAC3 study site might use to add geomarkers to their data with DeGAUSS.
If you have used DeGAUSS, would you mind providing us some feedback and completing a short survey?
In steps 2 through 6:
See the Installing Docker webpage.
Note about Docker Settings:
After installing Docker, but before running containers, go to Docker Settings > Advanced and change memory to greater than 4000 MB (or 4 GiB)
![]()
If you are using a Windows computer, also set CPUs to 1.
![]()
Click Apply and wait for Docker to restart.
The input file must be a CSV file with one column called
address containing all address components. Other columns
may be present and will be returned in the output file, but should be
kept to a minimum to reduce file size.
An example input CSV file (called my_address_file.csv)
might look like:
| id | address |
|---|---|
| 13100070229 | 1922 CATALINA AV CINCINNATI, OH 45237 |
| 54000600136 | 5358 LILIBET CT DELHI TOWNSHIP, OH 45238 |
| 11200020024 | 630 GREENWOOD AV CINCINNATI, OH 45229 |
Refer to the DeGAUSS geocoding webpage for more information about the input file and address string formatting.
After navigating to your working directory, use the ghcr.io/degauss-org/geocoder
to geocode your addresses.
macOS example call:
docker run --rm -v "$PWD":/tmp ghcr.io/degauss-org/geocoder:3.0.2 my_address_file.csv
Windows (CMD) example call:
docker run --rm -v "%cd%":/tmp ghcr.io/degauss-org/geocoder:3.0.2 my_address_file.csv
Replace my_address_file.csv with the name of the CSV
file to be geocoded and run the call in the shell.
Note for Windows Users:
In this and all following docker calls in this example, replace"$PWD"with"%cd%". Refer to the DeGAUSS Troubleshooting page for more information.
See here for more information on the anatomy of a degauss command.
The output file is written to the same directory and
in our example, will be called
my_address_file_geocoded_v3.0.2.csv.
Example output:
| id | address | start_date | end_date | matched_street | matched_zip | matched_city | matched_state | lat | lon | score | precision | geocode_result |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 54000600136 | 5358 LILIBET CT DELHI TOWNSHIP OH 45238 | 2015-05-05 | 2015-05-06 | Lilibet Ct | 45238 | Delhi Hills | OH | 39.11552 | -84.61902 | 0.754 | range | geocoded |
| 13100070229 | 1922 CATALINA AV CINCINNATI OH 45237 | 2010-06-07 | 2010-06-08 | Catalina Ave | 45237 | Cincinnati | OH | 39.17112 | -84.46176 | 0.922 | range | geocoded |
| 11200020024 | 630 GREENWOOD AV CINCINNATI OH 45229 | 2019-07-08 | 2019-07-09 | Greenwood Ave | 45229 | Cincinnati | OH | 39.15321 | -84.49236 | 0.922 | range | geocoded |
For more information on interpreting geocoder output, see here.
macOS example call:
docker run --rm -v "$PWD":/tmp ghcr.io/degauss-org/dep_index:0.1 my_address_file_geocoded_v3.0.2.csv
Windows (CMD) example call:
docker run --rm -v "%cd%":/tmp ghcr.io/degauss-org/dep_index:0.1 my_address_file_geocoded_v3.0.2.csv
Replace my_address_file_geocoded_v3.0.2.csv with the
name of the geocoded CSV file created in Step 3 and run.
The output file is written to the same directory
and, in our example, will be called
my_address_file_geocoded_v3.0.2_dep_index_v0.1.csv.
Example output:
| id | address | matched_street | matched_zip | matched_city | matched_state | lat | lon | score | precision | geocode_result | fips_tract_id | fraction_assisted_income | fraction_high_school_edu | median_income | fraction_no_health_ins | fraction_poverty | fraction_vacant_housing | dep_index |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 54000600136 | 5358 LILIBET CT DELHI TOWNSHIP OH 45238 | Lilibet Ct | 45238 | Delhi Hills | OH | 39.11552 | -84.61902 | 0.754 | range | geocoded | 39061021303 | 0.0380034 | 0.9396114 | 83385 | 0.0236515 | 0.0250104 | 0.0128779 | 0.2087159 |
| 13100070229 | 1922 CATALINA AV CINCINNATI OH 45237 | Catalina Ave | 45237 | Cincinnati | OH | 39.17112 | -84.46176 | 0.922 | range | geocoded | 39061006300 | 0.1149033 | 0.8787645 | 38395 | 0.0391429 | 0.1641705 | 0.1284085 | 0.3569748 |
| 11200020024 | 630 GREENWOOD AV CINCINNATI OH 45229 | Greenwood Ave | 45229 | Cincinnati | OH | 39.15321 | -84.49236 | 0.922 | range | geocoded | 39061006800 | 0.3517316 | 0.8051400 | 19783 | 0.0579212 | 0.3901274 | 0.2309613 | 0.5527528 |
More information on the deprivation index
More information on the dep_index container
macOS example call:
docker run --rm -v "$PWD":/tmp ghcr.io/degauss-org/drivetime:1.3.0 my_address_file_geocoded_v3.0.2_dep_index_v0.1.csv cchmc
Windows (CMD) example call:
docker run --rm -v "%cd%":/tmp ghcr.io/degauss-org/drivetime:1.3.0 my_address_file_geocoded_v3.0.2_dep_index_v0.1.csv cchmc
Replace
my_address_file_geocoded_v3.0.2_dep_index_v0.1.csv with the
name of the CSV file created in Step 4, and replace cchmc
with the abbrevation for your care center from this list:
| center_name | abbreviation |
|---|---|
| Children’s Hospital of Philadelphia | chop |
| Riley Hospital for Children, Indiana University | riley |
| Seattle Children’s Hospital | seattle |
| Children’s Mercy Hospital | mercy |
| Emory University | emory |
| Johns Hopkins University | jhu |
| Cleveland Clinc | cc |
| Levine Children’s | levine |
| St. Louis Children’s Hospital | stl |
| Oregon Health and Science University | ohsu |
| University of Michigan Health System | umich |
| Children’s Hospital of Alabama | al |
| Cincinnati Children’s Hospital Medical Center - Main Campus | cchmc |
| Cincinnati Children’s Hospital Medical Center - Liberty Campus | liberty |
| Nationwide Children’s Hospital | nat |
| University of California, Los Angeles | ucla |
| Boston Children’s Hospital | bch |
| Medical College of Wisconsin | mcw |
| St. Jude’s Children’s Hospital | stj |
| Martha Eliot Health Center | mehc |
| Northwestern / Ann & Lurie Children’s Northwestern | nwu |
| Lurie Children’s Outpatient Center in Northbrook | lcclp |
| Lurie Children’s Outpatient Center in Lincoln Park | lcclp |
| Lurie Children’s Outpatient Center in Uptown | lccu |
| Dr. Lio’s and Dr. Aggarwal’s clinics | lac |
| Recruited from Eczema Expo 2018 | expo |
| University of California San Francisco Benioff Children’s Hospital | ucsf |
| Nicklaus Children’s Hospital | nicklaus |
| Medical University of South Carolina Children’s Hospital | musc |
| Children’s National Medical Center | cnmc |
| Children’s Hospital of Pittsburgh of UPMC | upmc |
| Methodist LeBonheur Children’s Hospital | methodist |
| Texas Children’s Hospital | texas |
| Arkansas Children’s Hospital | arkansas |
| Primary Children’s Medical Center | primary |
| Children’s Healthcare of Atlanta | atlanta |
| Children’s Medical Center of Dallas | dallas |
| Lucile Packard Children’s Hospital Stanford | packard |
| Toronto Hospital for Sick Children | toronto |
| Cook Children’s Medical Center | cook |
| Children’s Hospital & Medical Center - Omaha | omaha |
| Children’s Hospital Colorado | colorado |
| Arnold Palmer Hospital for Children | palmer |
| Children’s Hospital & Clinics of Minnesota | minn |
| University of Virginia Hospital | uva |
| Joe Dimaggio Children’s Hospital | dimaggio |
| Cohen Children’s Medical Center of New York at Northwell Health | cohen |
| Dell Children’s Medical Center of Central Texas | dell |
| A.I. duPont Hospital for Children | dupont |
| Rainbow Babies and Children’s Hospital | rainbow |
| UNC Hospitals Children’s Specialty Clinic | unc |
| Barbara Bush Children’s Hospital at Maine Medical | maine |
| Children’s Hospital of New Orleans | chnola |
| Rady Children’s Hospital | rady |
| Children’s Hospital Los Angeles | chla |
| Monroe Carell Jr. Children’s Hospital at Vanderbilt | vandy |
The output file is written to the same directory and
in our example, will be called
my_address_file_geocoded_v3.0.2_dep_index_v0.1_drivetime_1.3.0_cchmc.csv.
Example output:
| id | address | matched_street | matched_zip | matched_city | matched_state | lat | lon | score | precision | geocode_result | fips_tract_id | fraction_assisted_income | fraction_high_school_edu | median_income | fraction_no_health_ins | fraction_poverty | fraction_vacant_housing | dep_index | drive_time | distance |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 54000600136 | 5358 LILIBET CT DELHI TOWNSHIP OH 45238 | Lilibet Ct | 45238 | Delhi Hills | OH | 39.11552 | -84.61902 | 0.754 | range | geocoded | 39061021303 | 0.0380034 | 0.9396114 | 83385 | 0.0236515 | 0.0250104 | 0.0128779 | 0.2087159 | 30 | 10219.326 |
| 13100070229 | 1922 CATALINA AV CINCINNATI OH 45237 | Catalina Ave | 45237 | Cincinnati | OH | 39.17112 | -84.46176 | 0.922 | range | geocoded | 39061006300 | 0.1149033 | 0.8787645 | 38395 | 0.0391429 | 0.1641705 | 0.1284085 | 0.3569748 | 18 | 5004.925 |
| 11200020024 | 630 GREENWOOD AV CINCINNATI OH 45229 | Greenwood Ave | 45229 | Cincinnati | OH | 39.15321 | -84.49236 | 0.922 | range | geocoded | 39061006800 | 0.3517316 | 0.8051400 | 19783 | 0.0579212 | 0.3901274 | 0.2309613 | 0.5527528 | 6 | 1755.939 |
More information on drivetime
Before sharing your data, remove the following columns:
addressmatched_streetmatched_citymatched_zipmatched_statelatlonfips_tract_id