Al
|
09f808ca47
|
[geoplanet] only add short postal codes to GeoPlanet data set if they match the Google regexes
|
2016-12-13 17:03:26 -05:00 |
|
Al
|
34db27b80c
|
[openaddresses] Mendocino County, CA
|
2016-12-13 16:44:22 -05:00 |
|
Al
|
6b04711195
|
[neighborhoods] adjust cache size when building neighborhoods index
|
2016-12-13 16:11:42 -05:00 |
|
Al
|
40cd86c3be
|
[addresses] only add city relacement if a city is not found first
|
2016-12-13 16:10:52 -05:00 |
|
Al
|
7e65661884
|
[openaddresses] Pierce County, WA
|
2016-12-13 14:03:16 -05:00 |
|
Al
|
cd91068f0f
|
[neighborhoods] fix neighborhoods index checks to include the borough points while still not making letting something like Santa Monica pass as a neighborhoods when it's a proper city
|
2016-12-13 02:30:24 -05:00 |
|
Al
|
cb475d8245
|
[openaddresses] adding Sunshine Coast, BC and Sardegna, Italy
|
2016-12-12 17:42:47 -05:00 |
|
Al Barrentine
|
bcf6b3cc68
|
Merge pull request #137 from openvenues/fix_address_parser_train
Fix address_parser_train
|
2016-12-12 11:54:16 -05:00 |
|
Al
|
8f1e69960f
|
[fix] loading transliteration module in address_parser_test.c as well
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
3939dd0ca6
|
[fix] cstring_array_split calls
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
a42d0e917a
|
[fix] brace
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
ced8f9ae27
|
[parser] Ignore multiple spaces in parser input post-normalization. If normalizing the string creates several distinct tokens (namely in Vulgar fractions e.g. ½ => 1/2), add all the sub-tokens with the same label as the parent
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
b1816e9b70
|
[utils] Adding cstring_array_split_ignore_consecutive
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
6baa7087fe
|
[fix] calls and NULL checks
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
5e07f5e8c5
|
[fix] tokenized_string_t should copy its source string
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
521a094a47
|
[fix] Need to load transliteration module for Latin-ASCII normalization
|
2016-12-12 11:37:27 -05:00 |
|
Al
|
d158751d92
|
[addresses] same rules for state_district apply to state, no alt_names etc. unless a city is present
|
2016-12-12 05:31:32 -05:00 |
|
Al
|
bf3e9749ca
|
[osm] during place formatting, add point-based cities for any places/polygons that are smaller than cities e.g. suburb or city_district, use admin_center as the point for reverse geocoding if available (instead of representative_point() which can be expensive or centroid which can be inaccurate)
|
2016-12-12 05:29:39 -05:00 |
|
Al
|
33dd9223dc
|
[places] allowing state_district to depend on state in the US
|
2016-12-11 17:04:24 -05:00 |
|
Al
|
5d98f3115c
|
[boundareis] adding two exceptions for admin_level=9 in US
|
2016-12-11 16:58:16 -05:00 |
|
Al
|
da4fe37fb4
|
[addresses] option to add city points, no random keys for state_district if city or replacement is not present
|
2016-12-11 16:24:16 -05:00 |
|
Al
|
dfc88a47b2
|
[fix] typo
|
2016-12-11 02:46:03 -05:00 |
|
Al
|
e8abf44c16
|
[neighborhoods] check if there's no defined place-type before classifying a polygon as city_district
|
2016-12-11 02:44:02 -05:00 |
|
Al
|
01d6bc27b6
|
[fix] "District of" is only a valid prefix in the non-US Anglophone world
|
2016-12-11 02:11:51 -05:00 |
|
Al
|
9b95601e42
|
[states] adding abbreviations with internal periods for multi-word US states
|
2016-12-11 01:17:27 -05:00 |
|
Al
|
fffc81a17a
|
[fix] default value
|
2016-12-10 18:14:25 -05:00 |
|
Al
|
371198da3c
|
[fix] typo
|
2016-12-10 18:14:11 -05:00 |
|
Al
|
91982528c6
|
[fix] normalize place names after adding admin boundaries as well
|
2016-12-10 18:07:41 -05:00 |
|
Al
|
34d3ae7e9e
|
[addresses] fixing normalized_place_name so it deals with things like Washington DC where Washington DC may actually be one of the OSM names
|
2016-12-10 17:52:38 -05:00 |
|
Al
|
80ee34cc3a
|
[text] adding normalization with whitespace
|
2016-12-10 17:50:53 -05:00 |
|
Al
|
4550f00f03
|
[fix] var name
|
2016-12-10 15:18:09 -05:00 |
|
Al
|
72771741c3
|
[fix] order
|
2016-12-10 15:16:35 -05:00 |
|
Al
|
8595d8da05
|
[addresses] don't add components to the trie that have the same normalized name as the given component
|
2016-12-10 15:12:40 -05:00 |
|
Al
|
bb12d0940e
|
[fix] options/docs in osm address training
|
2016-12-10 13:45:37 -05:00 |
|
Al
|
ffc584f679
|
[states] adding all forms of the state abbreviation to the trie when doing place name normalization to handle the D.C./DC case
|
2016-12-10 13:45:22 -05:00 |
|
Al
|
5098599ed6
|
[addresses] remove Quattroshapes/GeoNames cities as they may have problematic names, and in any case we have point-based cities from OSM now
|
2016-12-10 02:08:40 -05:00 |
|
Al
|
18c5fd0855
|
[fix] check for non-None city
|
2016-12-10 01:23:06 -05:00 |
|
Al
|
dc022f8652
|
[osm] adding normalized_place_name to Quattroshapes city
|
2016-12-10 01:20:40 -05:00 |
|
Al
|
7edb983566
|
[openaddresses] adding D.C. with periodds as the state for the DC data set
|
2016-12-09 19:58:57 -05:00 |
|
Al
|
c7b1818695
|
[fix] imports
|
2016-12-09 19:53:17 -05:00 |
|
Al
|
973466bb13
|
[states] adding multiple state abbreviations for states that can have periods in the naem like D.C., D.F. in Mexico and Brasil, etc.
|
2016-12-09 19:48:59 -05:00 |
|
Al
|
d575caba8a
|
[data] using UTC for libpostal data files on the Mac version of the download script as well
|
2016-12-09 19:43:05 -05:00 |
|
Al
|
c3f3896b48
|
[fix] update test for date function in data download script
|
2016-12-09 19:29:00 -05:00 |
|
Al
|
675552d254
|
[addresses] using normalized tokens when stripping off compound place names for things like D.C.
|
2016-12-09 17:52:57 -05:00 |
|
Al
|
c0a468d7e8
|
[normalization] adding a normalize_token function and some token options for deleting periods
|
2016-12-09 17:46:26 -05:00 |
|
Al
|
318773ffe7
|
[parser] header changes for the data set struct
|
2016-12-09 13:37:45 -05:00 |
|
Al
|
69ca4a85ce
|
[openaddresses] adding units to Olpympia training data
|
2016-12-09 03:45:15 -05:00 |
|
Al
|
8f30987bdf
|
[fix] checking if building is a rail station
|
2016-12-09 02:57:47 -05:00 |
|
Al
|
e92963de50
|
[openaddresses] adding new counties from OpenAddresses, strip commas option for thousands separators
|
2016-12-09 01:57:21 -05:00 |
|
Al
|
b60b7c9009
|
[geoplanet] adding an index of state_districts, states, etc. that contain a city with an identical name. Alias to the city if it's the only contained place, otherwise don't allow the admin name without the city.
|
2016-12-08 17:00:29 -05:00 |
|