You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At present, addresses are ingested into a storage table (currently user_data.address_table for normalized addresses and user_data.std_address_table for standardized addresses) and then geocoded in place. During geocoding, if the rating of the best candidate result is above some threshold value (currently 22), no geocoded candidate result is accepted and the value '-1' is inserted into the rating column. Subsequent runs of the geocoder (using the included geocoding functions) will only geocode rows where the rating is NULL, which essentially means only newly added distinct rows will be geocoded.
Even with the rating thresholding, the normalization or standardization rules can still misinterpret the address and locate it far from the correct location. If users don't have an easy method for flagging these results or just purging their results and imputing a rating of '-1', the address table will become polluted and it will become a chore to work with.
Possible Solutions:
(greater prevention) facilitate use the restrict_region parameter in geocoding
(post-hoc cleanup) develop functionality to allow us to select the records geocoded in a specific geocoding run (perhaps the filename and a timestamp of the start-time) and provide a boundary polygon where only records geocoded to locations within the polygon (from a geocoding run) are retained.
The text was updated successfully, but these errors were encountered:
At present, addresses are ingested into a storage table (currently
user_data.address_table
for normalized addresses anduser_data.std_address_table
for standardized addresses) and then geocoded in place. During geocoding, if therating
of the best candidate result is above some threshold value (currently 22), no geocoded candidate result is accepted and the value '-1' is inserted into therating
column. Subsequent runs of the geocoder (using the included geocoding functions) will only geocode rows where therating
isNULL
, which essentially means only newly added distinct rows will be geocoded.Even with the
rating
thresholding, the normalization or standardization rules can still misinterpret the address and locate it far from the correct location. If users don't have an easy method for flagging these results or just purging their results and imputing arating
of '-1', the address table will become polluted and it will become a chore to work with.Possible Solutions:
restrict_region
parameter in geocodingThe text was updated successfully, but these errors were encountered: