Commit Graph

131 Commits

Author SHA1 Message Date
Al
e92963de50 [openaddresses] adding new counties from OpenAddresses, strip commas option for thousands separators 2016-12-09 01:57:21 -05:00
Al
3ff472c8cf [openaddresses] fixing house numbers with multiple consecutive hyphens 2016-12-06 22:50:14 -05:00
Al
da36b71829 [addresses] adding new places index in OSM and OpenAddresses training data 2016-12-05 18:36:17 -05:00
Al
cdbc102821 [boundaries] in addition to population, check if a city has an unambiguous Wikipedia 2016-11-25 13:36:49 -08:00
Al
87634a36e1 [openaddresses] for cases where city populations are not known (i.e. not getting boundaries from OSM, most of the sources in OpenAddresses), place-only records should have at least two identifying components. Helps when city names, etc. are highly ambiguous and need to be qualified 2016-11-25 00:56:38 -08:00
Al
e07c74f077 [fix] config 2016-11-24 03:57:52 -05:00
Al
46b7043dc7 [fix] typo 2016-11-24 03:50:11 -05:00
Al
fcf4717335 [openaddresses] adding city_replacements handling to OA formatter 2016-11-23 20:16:48 -05:00
Al
5cabd9b4f7 [fix] country languages in OpenAddresses 2016-10-24 17:35:39 -04:00
Al
35d3d8cc73 [openaddresses] countries are known a priori, so if the boundaries don't quite line up with OSM, use the country from the path 2016-10-23 19:50:54 -04:00
Al
1658c425c5 [fix] clear country cache only at each new country, not each file 2016-10-23 00:57:52 -04:00
Al
7199ff17e0 [fix] truncate postcodes that are longer than specified length 2016-10-23 00:52:24 -04:00
Al
889e914dfc [openaddresses] clear all polygon caches 2016-10-23 00:11:54 -04:00
Al
63edd53fb3 [openaddresses] adding clear_cache method to clear the LRU cache for point-in-polygon indices and using it in OpenAddresses import since it heavily reuses polygons and only for the current file 2016-10-22 20:28:59 -04:00
Al
2a355b2cf8 [openaddresses] adding address only 10% of the time in OpenAddresses 2016-10-20 23:57:30 -04:00
Al
d965ea9371 [openaddresses] adding hyphenation/dehyphenation to the OpenAddresses formatter 2016-10-20 20:55:17 -04:00
Al
ecd71ee10d [fix] var name 2016-10-06 15:36:51 -04:00
Al
6b0186782d [openaddresses] doing country-specific cleanups in OpenAddresses 2016-10-05 17:07:29 -04:00
Al
432f9dd42e [fix] format of candidate_languages in the new OSM rtree 2016-10-05 03:12:07 -04:00
Al
faf418decb [languages] using country_and_languages method in OSM, neighborhoods and OpenAddresses 2016-10-05 02:49:55 -04:00
Al
ad6ddd1ede [fix] var names 2016-10-04 14:35:45 -04:00
Al
373708b595 [openaddresses] replace name affixes (remove things like "city of"), prune duplicate names, remove numeric boundary names, cleanup boundary names, and add house number + postcode phrases where appropriate 2016-09-22 00:57:11 -04:00
Al
d667039397 [openaddresses] for configs with add_osm_boundaries=true, skip adding boundary fields from the OA file altogether when they're specified 2016-09-16 01:55:36 -04:00
Al
95cf6ad0fa [fix] default again 2016-09-16 01:11:59 -04:00
Al
d5a5104de9 [fix] default 2016-09-16 01:10:19 -04:00
Al
32ad1d7bd0 [fix] var name 2016-09-16 01:07:10 -04:00
Al
b618d1eaf2 [fix] var name 2016-09-16 01:02:47 -04:00
Al
9b250a9393 [openaddresses] adding zero-padding option for postcodes and using in Puerto Rico 2016-09-15 11:22:55 -04:00
Al
e8408d39fd [fix] unzip_file checks status code 2016-09-12 16:42:02 -04:00
Al
551cce8cb1 [fix] making a separate gazetteer for toponym abbreviations 2016-09-10 01:08:58 -04:00
Al
bcde9e2fe7 [fix] toponym abbreviations after country name, may want to use it 2016-09-10 00:49:31 -04:00
Al
bbc5131cb6 [fix] toponym abbreviations 2016-09-10 00:48:31 -04:00
Al
19a044f7f3 [fix] imports 2016-09-10 00:09:11 -04:00
Al
ae02b0769d [openaddresses] abbreviating boundary components for OpenAddresses 2016-09-10 00:04:11 -04:00
Al
5d26ab41e7 [openaddresses] removing OpenAddresses hacks now that upstream changes are merged 2016-09-09 09:40:45 -04:00
Al
4c6bcda3b2 [fix] config 2016-09-08 15:21:19 -04:00
Al
d1e3c6a24a [openaddresses] adding Italy countrywide to a pre_release_downloads set so it can be used in libpostal without having been merged yet 2016-09-08 15:16:35 -04:00
Al
170e8d74d8 [fix] checking for components 2016-09-08 03:19:10 -04:00
Al
769a65b808 [openaddresses] adding place-only and place+postcode probability to OpenAddresses to capture more place names not in OSM as standalone queries 2016-09-08 03:17:21 -04:00
Al
317e4caca9 [fix] only percent quote the filename 2016-09-07 13:55:34 -04:00
Al
f061d4239b [fix] quote, not urlencode 2016-09-07 13:51:08 -04:00
Al
5c915855c9 [fix] urlencode 2016-09-07 13:48:39 -04:00
Al
62c8fa9048 [fix] encoding III 2016-09-07 13:39:38 -04:00
Al
6e5908385a [fix] encoding again 2016-09-07 13:34:04 -04:00
Al
3bc122f7a0 [fix] encoding 2016-09-07 13:28:18 -04:00
Al
7e7ee7462a [fix] dutch house number formatting, strip spaces 2016-09-02 14:47:52 -04:00
Al
95384e5a2c [openaddresses adding hack for Honolulu until join function can handle null in OpenAddresses 2016-09-02 14:29:40 -04:00
Al
4e9f88594b [fix] /safe_encode/safe_decode/ 2016-09-02 13:50:48 -04:00
Al
8fd69b5e4a [fix] args 2016-09-02 12:03:24 -04:00
Al
df8e781e02 [openaddresses] adding hack for Italy until machine's join function handles null fields 2016-09-02 12:01:04 -04:00