Al
|
6388a79bf0
|
[addresses] strip "-", etc. in addr:housenumber
|
2016-12-21 01:53:23 -05:00 |
|
Al
|
c33db4f04d
|
[addresses] normalize existing sub-building components
|
2016-12-21 01:28:43 -05:00 |
|
Al
|
cc4098fb05
|
[openaddresses] abbreviate states as well in OpenAddresses when full version is specified
|
2016-12-20 17:24:12 -05:00 |
|
Al
|
9e44fcb2bb
|
[addresses] abbreviating neighborhoods/city_districts
|
2016-12-20 03:01:34 -05:00 |
|
Al
|
53723bbf3d
|
[fix] passing argument through to normalized_place_name
|
2016-12-20 02:21:38 -05:00 |
|
Al
|
6d02fbb9b8
|
[addresses] switch for phrases that come from components so they only get stripped if they contain another phrase a la Washington, D.C. Consolidating always_use_full_names and random_key options
|
2016-12-20 01:42:40 -05:00 |
|
Al
|
f35fd97735
|
[boundaries] add abbreviated state names to valid component names
|
2016-12-19 00:51:05 -05:00 |
|
Al
|
d02a18a5a8
|
[fix] all_names, use values instead of name keys
|
2016-12-18 17:29:15 -05:00 |
|
Al
|
e9c7bc43e3
|
[fix] check fixed list of keys in all_names as well
|
2016-12-18 17:26:43 -05:00 |
|
Al
|
2727572822
|
[addresses] using the name key disttribution in AddressComponents.all_names. Returning names and valid components from the new function instead of the full gazetteer (can be build later)
|
2016-12-18 17:22:13 -05:00 |
|
Al
|
d308473686
|
[addresses] separating boundary phrase gazetteer construction into its own method
|
2016-12-18 15:47:20 -05:00 |
|
Al
|
846b88cde5
|
[addresses] let the place config take care of adding/removing neighborhoods rather than doing it as part of the add_neighborhoods method
|
2016-12-14 03:15:07 -05:00 |
|
Al
|
5946ead37f
|
[addresses] using the defined component from the neighborhoods index for city_district (they're fairly rare, just NYC boroughs basically)
|
2016-12-14 03:10:07 -05:00 |
|
Al
|
5846943b70
|
[addresses] removing place_type override requirement from the neighborhoods index (NYC boroughs, etc.)
|
2016-12-14 02:16:57 -05:00 |
|
Al
|
40cd86c3be
|
[addresses] only add city relacement if a city is not found first
|
2016-12-13 16:10:52 -05:00 |
|
Al
|
d158751d92
|
[addresses] same rules for state_district apply to state, no alt_names etc. unless a city is present
|
2016-12-12 05:31:32 -05:00 |
|
Al
|
da4fe37fb4
|
[addresses] option to add city points, no random keys for state_district if city or replacement is not present
|
2016-12-11 16:24:16 -05:00 |
|
Al
|
dfc88a47b2
|
[fix] typo
|
2016-12-11 02:46:03 -05:00 |
|
Al
|
e8abf44c16
|
[neighborhoods] check if there's no defined place-type before classifying a polygon as city_district
|
2016-12-11 02:44:02 -05:00 |
|
Al
|
fffc81a17a
|
[fix] default value
|
2016-12-10 18:14:25 -05:00 |
|
Al
|
91982528c6
|
[fix] normalize place names after adding admin boundaries as well
|
2016-12-10 18:07:41 -05:00 |
|
Al
|
34d3ae7e9e
|
[addresses] fixing normalized_place_name so it deals with things like Washington DC where Washington DC may actually be one of the OSM names
|
2016-12-10 17:52:38 -05:00 |
|
Al
|
4550f00f03
|
[fix] var name
|
2016-12-10 15:18:09 -05:00 |
|
Al
|
72771741c3
|
[fix] order
|
2016-12-10 15:16:35 -05:00 |
|
Al
|
8595d8da05
|
[addresses] don't add components to the trie that have the same normalized name as the given component
|
2016-12-10 15:12:40 -05:00 |
|
Al
|
ffc584f679
|
[states] adding all forms of the state abbreviation to the trie when doing place name normalization to handle the D.C./DC case
|
2016-12-10 13:45:22 -05:00 |
|
Al
|
5098599ed6
|
[addresses] remove Quattroshapes/GeoNames cities as they may have problematic names, and in any case we have point-based cities from OSM now
|
2016-12-10 02:08:40 -05:00 |
|
Al
|
18c5fd0855
|
[fix] check for non-None city
|
2016-12-10 01:23:06 -05:00 |
|
Al
|
dc022f8652
|
[osm] adding normalized_place_name to Quattroshapes city
|
2016-12-10 01:20:40 -05:00 |
|
Al
|
c7b1818695
|
[fix] imports
|
2016-12-09 19:53:17 -05:00 |
|
Al
|
675552d254
|
[addresses] using normalized tokens when stripping off compound place names for things like D.C.
|
2016-12-09 17:52:57 -05:00 |
|
Al
|
f9945103ba
|
[addresses] if suburb/city_district is already listed, and we're finding the closest city by point rather than by boundary, use the closest actual city, not something smaller like a village/hamlet
|
2016-12-08 02:39:27 -05:00 |
|
Al
|
7436d9693a
|
[names] adding new name_affixes call to replace both prefixes/suffixes in one call, using in GeoPlanet training and the generic AddressComponents normalizations
|
2016-12-07 05:49:16 -05:00 |
|
Al
|
e13787a6f6
|
[fix] var name again
|
2016-12-05 18:49:23 -05:00 |
|
Al
|
e1c6eff5e2
|
[fix] var
|
2016-12-05 18:46:49 -05:00 |
|
Al
|
da36b71829
|
[addresses] adding new places index in OSM and OpenAddresses training data
|
2016-12-05 18:36:17 -05:00 |
|
Al
|
628fecea59
|
[addresses] adding point-based city/equivalent reverse geocoding for places that don't have as many defined polygons in OSM
|
2016-12-05 18:30:46 -05:00 |
|
Al
|
ef243fbb18
|
[fix] var name
|
2016-11-25 13:41:07 -08:00 |
|
Al
|
cdbc102821
|
[boundaries] in addition to population, check if a city has an unambiguous Wikipedia
|
2016-11-25 13:36:49 -08:00 |
|
Al
|
3dc2a922fb
|
[addresses/languages] if there's only one default language and we don't have a road name or a unicode script to disambiguate, assume the default (e.g. English in the US unless there's a Spanish/French road name). Can affect things like state abbreviations
|
2016-11-22 18:27:54 -05:00 |
|
Al
|
de9bf29af0
|
[addresses] allowing osm_components argument to AddressComponents.expanded
|
2016-11-19 01:38:02 -05:00 |
|
Al
|
ca89a6ca2e
|
[fix] args
|
2016-11-18 18:09:48 -05:00 |
|
Al
|
4e30a23313
|
[addresses] Adding toponym abbreviation to the input admin components as well as those obtained through reverse geocoding. Also was doing two random tests before abbreviating toponyms, reducing their frequency in the training data, now correctly using a single test.
|
2016-11-17 19:53:09 -05:00 |
|
Al
|
15b66f541c
|
[fix] refactor to use ComponentDependencies class
|
2016-11-15 17:07:10 -05:00 |
|
Al
|
653b2d09c0
|
[addresses] moving component dependency graphs to a new module
|
2016-11-14 16:45:15 -05:00 |
|
Al
|
495b27470e
|
[addresses] refactoring address component dependency graphs
|
2016-11-12 18:09:36 -05:00 |
|
Al
|
e9106698d2
|
[fix] convert newlines
|
2016-10-27 12:01:48 -04:00 |
|
Al
|
d51a1d6196
|
[addresses] doing hyphenation for existing components in component expansion (i.e. OSM training data)
|
2016-10-21 22:02:19 -04:00 |
|
Al
|
00ebdfed7f
|
[osm] adding alt_place_names to the shared formatting class AddressComponents and making them classmethods
|
2016-10-20 20:41:22 -04:00 |
|
Al
|
51afc2619b
|
[fix] only replace whitespace between words, not for instance whitespace around an existing hyphen, and reducing to one space for spaced hyphens
|
2016-10-19 01:24:54 -04:00 |
|