Commit Graph

1598 Commits

Author SHA1 Message Date
Al
7b75eb2243 [fix] script docs 2016-09-23 01:31:37 -04:00
Al
d66ea835b1 [fix] allowing latitude 90 for validation purposes (North Pole) 2016-09-23 01:28:13 -04:00
Al
996a38d017 [places] adding probability distributions on added place components so can have West Indies, W.I. etc. 2016-09-22 17:45:14 -04:00
Al
373708b595 [openaddresses] replace name affixes (remove things like "city of"), prune duplicate names, remove numeric boundary names, cleanup boundary names, and add house number + postcode phrases where appropriate 2016-09-22 00:57:11 -04:00
Al
ca5bcba85e [osm] set -e so script errors out if anything fails and add --quiet to wget for I/O redirection purposes 2016-09-22 00:52:00 -04:00
Al
764a74fae4 [osm] overwrite downloaded files 2016-09-19 03:21:37 -04:00
Al
c3afcdfce5 [osm] expanding criteria for the buildings data set (buildlings with addr:housenumber, addr:housename, addr:street, or addr:postcode are useful) 2016-09-19 03:15:07 -04:00
Al
d667039397 [openaddresses] for configs with add_osm_boundaries=true, skip adding boundary fields from the OA file altogether when they're specified 2016-09-16 01:55:36 -04:00
Al
95cf6ad0fa [fix] default again 2016-09-16 01:11:59 -04:00
Al
d5a5104de9 [fix] default 2016-09-16 01:10:19 -04:00
Al
32ad1d7bd0 [fix] var name 2016-09-16 01:07:10 -04:00
Al
b618d1eaf2 [fix] var name 2016-09-16 01:02:47 -04:00
Al
1cec0570d6 [formatting] only using alias country insertions if the given country has not defined its own (e.g. look at Puerto Rico first, then use the US if there's nothing defined) 2016-09-15 11:45:46 -04:00
Al
9b250a9393 [openaddresses] adding zero-padding option for postcodes and using in Puerto Rico 2016-09-15 11:22:55 -04:00
Al
260aedc9d3 [geoplanet] Script to download GeoPlanet directly and load into a SQLite db for extracting the postal codes (admin areas need cleanupi in a number of countries, that's next) 2016-09-14 14:29:14 -04:00
Al
b70c140a00 [fix] casing in state abbreviaitons dictionary 2016-09-13 21:56:27 -04:00
Al
14c20091f4 [fix] abbreviations in hyphenated phrases like Saint-Germaine. Hyphenation should use the phrase length not the token length 2016-09-12 22:20:25 -04:00
Al
62a5f6002d [geonames] adding query method to GeoNamesDB 2016-09-12 16:42:42 -04:00
Al
e8408d39fd [fix] unzip_file checks status code 2016-09-12 16:42:02 -04:00
Al
23c9fbe3fb [wof] don't need to crawl for admin data now that whosonfirst-data is separate from venues, etc. 2016-09-12 13:52:49 -04:00
Al
6fd43a1f20 [wof] script to download all the WoF postal code repos and their dependencies 2016-09-12 04:03:41 -04:00
Al
d41905aa18 [wof] geven-based hierarchy crawler for WoF 2016-09-12 04:03:02 -04:00
Al
45338b43b5 [wof] adding basic WhosOnFirst client for interaction with S3, downloading by ID, etc. 2016-09-12 03:56:27 -04:00
Al
55e9ab1978 [places] adding world_region tag and adding the phrase West Indies with small random probability for English-speaking Caribbean nations. Ref: #113 2016-09-11 21:54:56 -04:00
Al
551cce8cb1 [fix] making a separate gazetteer for toponym abbreviations 2016-09-10 01:08:58 -04:00
Al
bcde9e2fe7 [fix] toponym abbreviations after country name, may want to use it 2016-09-10 00:49:31 -04:00
Al
bbc5131cb6 [fix] toponym abbreviations 2016-09-10 00:48:31 -04:00
Al
19a044f7f3 [fix] imports 2016-09-10 00:09:11 -04:00
Al
ae02b0769d [openaddresses] abbreviating boundary components for OpenAddresses 2016-09-10 00:04:11 -04:00
Al
604e898d65 [fix] using toponym_gazetteer in OSM boundary abbreviations 2016-09-10 00:02:59 -04:00
Al
5d26ab41e7 [openaddresses] removing OpenAddresses hacks now that upstream changes are merged 2016-09-09 09:40:45 -04:00
Al
9cdcd7f21a [fix] indentation 2016-09-09 08:59:50 -04:00
Al
a14202fc7a [fix] default value 2016-09-09 01:46:03 -04:00
Al
85ad3bf0f4 [formatting] allowing a non-default option for components that can be inserted between road and house number 2016-09-09 01:38:39 -04:00
Al
4c6bcda3b2 [fix] config 2016-09-08 15:21:19 -04:00
Al
d1e3c6a24a [openaddresses] adding Italy countrywide to a pre_release_downloads set so it can be used in libpostal without having been merged yet 2016-09-08 15:16:35 -04:00
Al
0edbe5a593 [formatting] don't allow insertions between house number and road name 2016-09-08 15:00:36 -04:00
Al
170e8d74d8 [fix] checking for components 2016-09-08 03:19:10 -04:00
Al
769a65b808 [openaddresses] adding place-only and place+postcode probability to OpenAddresses to capture more place names not in OSM as standalone queries 2016-09-08 03:17:21 -04:00
Al
6543afaaa3 [utils] checking for 200 from download_file 2016-09-07 14:13:42 -04:00
Al
317e4caca9 [fix] only percent quote the filename 2016-09-07 13:55:34 -04:00
Al
f061d4239b [fix] quote, not urlencode 2016-09-07 13:51:08 -04:00
Al
5c915855c9 [fix] urlencode 2016-09-07 13:48:39 -04:00
Al
62c8fa9048 [fix] encoding III 2016-09-07 13:39:38 -04:00
Al
6e5908385a [fix] encoding again 2016-09-07 13:34:04 -04:00
Al
3bc122f7a0 [fix] encoding 2016-09-07 13:28:18 -04:00
Al
cb3fe5273a [components] using gnis:class=Populated Place to map to city for the US when admin_level is not specified and the place key is not specified/not mapped 2016-09-07 11:49:44 -04:00
Al
ec24c4b6ac [boundaries] removing the place=borough mapping because it's used on the east coast and PA to mean city 2016-09-07 11:24:37 -04:00
Al
7e7ee7462a [fix] dutch house number formatting, strip spaces 2016-09-02 14:47:52 -04:00
Al
95384e5a2c [openaddresses adding hack for Honolulu until join function can handle null in OpenAddresses 2016-09-02 14:29:40 -04:00