Commit Graph

4420 Commits

Author SHA1 Message Date
Al
d208397ecb [addresses] checking if component is generated in combining fields 2016-12-26 16:58:10 -05:00
Al
654fc2c463 [fix] memory cleanup in address_parser_data_set, logging any bad input lines 2016-12-26 16:18:15 -05:00
Al
e6d7b09e08 [expansions] adding generated expansion data 2016-12-26 16:16:59 -05:00
Al
4cdd245dc2 [logging] log error in address_dictionary_get_expansions 2016-12-26 16:16:26 -05:00
Al Barrentine
46a1be3443 Merge pull request #144 from bradh/utcdateref
[fix] Use UTC date reference to avoid repeating S3 downloads.
2016-12-26 14:04:41 -05:00
Al
42cf686b8e [normalization] adding LATIN_ASCII_SIMPLE option to normalize_string_latin 2016-12-26 04:15:58 -05:00
Al
0284913aa7 [utils] ignore initial separators when splitting on delimiter 2016-12-26 04:14:20 -05:00
Al
69506d01e0 [fix] escape YAML keyword in Italian province mapping 2016-12-25 21:23:00 -05:00
Brad Hards
fb68e22bbf [fix] Use UTC date reference to avoid repeating S3 downloads.
Resolves https://github.com/openvenues/libpostal/issues/143
2016-12-26 12:04:02 +11:00
Al
afe29abf6c [fix] name 2016-12-25 11:38:18 -05:00
Al
e31ab33fe0 [fix] kwarg 2016-12-25 03:19:41 -05:00
Al
6a852f02bd [fix] var 2016-12-25 03:17:05 -05:00
Al
a185441ffa [osm] adding amenity=post_office to the generic place types (shouldn't be added as venue unless there's a known phrase in the name) 2016-12-25 03:14:07 -05:00
Al
4edaca7d37 [fix] var name 2016-12-25 02:50:29 -05:00
Al
4cf40f8deb [addresses] sort combined Japanese suburbs by admin level 2016-12-25 02:29:06 -05:00
Al
98e427f027 [boundaries] city, Ankara city_district, and suburbs in Turkey 2016-12-25 01:41:39 -05:00
Al
5b5a3fe235 [fix] adding Taiwan, Hong Kong, and Macao to the CJK patterns since language affects the order 2016-12-25 01:20:59 -05:00
Al
6da092e144 [fix] update language to English when using English names in CJK countries 2016-12-25 01:18:54 -05:00
Al
51802035de [fix] var name 2016-12-25 01:08:44 -05:00
Al
11dc8c9f24 [fix] non-dict keys in OSM boundary configs 2016-12-25 00:49:57 -05:00
Al
57f17a5d38 [addresses] remove generated components in combined house numbers if the other components were not numeric. Add house number phrase after the units, etc. are generated so it may be applied to a combined house number as well 2016-12-25 00:44:48 -05:00
Al
dad57dc57e [fix] moving CJK check into the if block so language gets changed more often even if the street sign-based language is unk 2016-12-24 21:20:38 -05:00
Al
07f299837c [boundaries] add override for Kantō region in Japan, make it country_region 2016-12-24 21:03:58 -05:00
Al
826cbc7f24 [addresses/JP] more checks for matching major/minor neighborhood polygons with nodes in Japan 2016-12-24 20:21:25 -05:00
Al
e4e86261d1 [addresses/JP] just remove addr:neighborhood, addr:quarter, etc. in Japan as they're not applied consistently outside of cities 2016-12-24 20:13:48 -05:00
Al
aea0b83619 [fix] brackets 2016-12-24 20:04:12 -05:00
Al
52a15a7c3c [addresses/JP] same deal for reverse geocoding in Japan (combine admin_level 9 and 10 into a single phrase, including points with certain conditions) 2016-12-24 20:01:11 -05:00
Al
9928d249a6 [addresses/JP] combining addr:quarter and addr:neighbourhood in Japan (based on info in https://wiki.openstreetmap.org/wiki/JA:%E4%BD%8F%E6%89%80) 2016-12-24 19:54:54 -05:00
Al
b0062a35b8 [fix] var name 2016-12-24 18:52:34 -05:00
Al
840d6c25d8 [addresses] move suffix checks to the end of expanded(), check language_suffix rather than language for Romaji, so language can remain Japanese. For CJK languages, change language before adding generated components 2016-12-24 18:27:53 -05:00
Al
77efcb3f89 [fix] only accept language suffixes that are valid scripts or transliterations of CJK languages. Set language to language suffix so Romaji forms get used, etc. 2016-12-24 17:17:09 -05:00
Al
67d7d94eea [numbers] adding function to format full-width numbers as ASCII 2016-12-24 16:07:31 -05:00
Al
54b0af7f68 [addresses] add chome form for Japanese neighborhoods 2016-12-24 16:06:29 -05:00
Al
85b402063b [fix] escape literal backslash in address dictionaries 2016-12-24 16:05:45 -05:00
Al
1774d7dfa9 [formatting] adding units/levels, etc. to East Asian system address formats 2016-12-24 15:14:37 -05:00
Al
494165d4cb [dictionaries] adding French abbreviations for batiment 2016-12-24 14:22:04 -05:00
Al
441ec00289 [openaddresses] using the new fuzzy equivalence comparison to check if suburb and city names are equal 2016-12-23 02:08:53 -05:00
Al
0814381d7f [fix] dehyphenate multiword names before city/suburb comparison 2016-12-23 01:53:09 -05:00
Al
151287856d [openaddresses] fixing regexes for house number validation 2016-12-23 01:18:46 -05:00
Al
e73eb337c9 [openaddresses] no units by default 2016-12-23 01:11:45 -05:00
Al
70b98c877d [fix] except when None 2016-12-22 23:25:20 -05:00
Al
00f3f3f94d [fix] now that neighborhood is classified at index construction time, no longer need to assume suburb for components that might otherwise be a city, etc. 2016-12-22 23:21:08 -05:00
Al
481bc248a1 [fix] make city more likely, eliminate admin components from the set if they don't have names 2016-12-22 22:56:52 -05:00
Al
5ed4f35eac [places] adding suburb/city_district as city replacements for Italy (sometimes tiny places are used instead of city) 2016-12-22 22:32:08 -05:00
Al
5df9dd9810 [fix] popping city name component 2016-12-22 21:40:09 -05:00
Al
46f421e455 [fix] names 2016-12-22 20:56:58 -05:00
Al
9db3c8ee4a [fix] name 2016-12-22 20:15:21 -05:00
Al
80c404899c [neighborhoods] adding some of the pure Quattroshapes neighborhoods (not matched to OSM) to be classified as city_district, starting with districts of Prague 2016-12-22 17:58:34 -05:00
Al
5461e195c0 [openaddresses] suburb overrides the city in Australia and Italy 2016-12-22 17:50:51 -05:00
Al
043dafc12a [openaddresses] add osm_neighborhood_overrides_city option for some countries that list what-we-otherwise-think-are-suburbs as the city 2016-12-22 17:50:21 -05:00