Commit Graph

1620 Commits

Author SHA1 Message Date
Al
adeedef39e [fix] import 2016-10-01 01:19:36 -04:00
Al
5d7405b2fd [osm] country and postal code polygon readers 2016-10-01 01:11:35 -04:00
Al
c77e36deab [osm] Prevent user-defined lat/lon keys from overriding the lat/lon on the node 2016-10-01 00:38:13 -04:00
Al
64b419215d [geoplanet] adding place name fixes for ZA, MY, AT, CN, NZ, PH, and LT 2016-09-28 21:02:18 -04:00
Al
0b67c32857 [geoplanet] adding place name fixes for BG, ID, LU, CH, FI, CZ and HU 2016-09-25 18:43:42 -04:00
Al
cd9fe4eb7b [boundaries] Adding option to still check for global overrides but only if nothing else was found using admin_level, etc. Updating South Korea and adding this option to Luxembourg. 2016-09-24 15:36:03 -04:00
Al
5a50082055 [geoplanet] removing the Ankara local admins 2016-09-24 02:16:00 -04:00
Al
df0387ce91 [boundaries] removing Ankara metropolitan districts (they're in many other cities as well, would be better to change on the OSM level if needed) 2016-09-23 22:52:10 -04:00
Al
09ccec21f9 [geoplanet] fixing place classifications in GeoPlanet so they line up with OSM for places in GB, CA, PT, JP, US, DE, IN, FR, PL, SE, BR, RO, ES, TW, IT, NL and NO 2016-09-23 16:49:53 -04:00
Al
7b75eb2243 [fix] script docs 2016-09-23 01:31:37 -04:00
Al
d66ea835b1 [fix] allowing latitude 90 for validation purposes (North Pole) 2016-09-23 01:28:13 -04:00
Al
996a38d017 [places] adding probability distributions on added place components so can have West Indies, W.I. etc. 2016-09-22 17:45:14 -04:00
Al
373708b595 [openaddresses] replace name affixes (remove things like "city of"), prune duplicate names, remove numeric boundary names, cleanup boundary names, and add house number + postcode phrases where appropriate 2016-09-22 00:57:11 -04:00
Al
ca5bcba85e [osm] set -e so script errors out if anything fails and add --quiet to wget for I/O redirection purposes 2016-09-22 00:52:00 -04:00
Al
764a74fae4 [osm] overwrite downloaded files 2016-09-19 03:21:37 -04:00
Al
c3afcdfce5 [osm] expanding criteria for the buildings data set (buildlings with addr:housenumber, addr:housename, addr:street, or addr:postcode are useful) 2016-09-19 03:15:07 -04:00
Al
d667039397 [openaddresses] for configs with add_osm_boundaries=true, skip adding boundary fields from the OA file altogether when they're specified 2016-09-16 01:55:36 -04:00
Al
95cf6ad0fa [fix] default again 2016-09-16 01:11:59 -04:00
Al
d5a5104de9 [fix] default 2016-09-16 01:10:19 -04:00
Al
32ad1d7bd0 [fix] var name 2016-09-16 01:07:10 -04:00
Al
b618d1eaf2 [fix] var name 2016-09-16 01:02:47 -04:00
Al
1cec0570d6 [formatting] only using alias country insertions if the given country has not defined its own (e.g. look at Puerto Rico first, then use the US if there's nothing defined) 2016-09-15 11:45:46 -04:00
Al
9b250a9393 [openaddresses] adding zero-padding option for postcodes and using in Puerto Rico 2016-09-15 11:22:55 -04:00
Al
260aedc9d3 [geoplanet] Script to download GeoPlanet directly and load into a SQLite db for extracting the postal codes (admin areas need cleanupi in a number of countries, that's next) 2016-09-14 14:29:14 -04:00
Al
b70c140a00 [fix] casing in state abbreviaitons dictionary 2016-09-13 21:56:27 -04:00
Al
14c20091f4 [fix] abbreviations in hyphenated phrases like Saint-Germaine. Hyphenation should use the phrase length not the token length 2016-09-12 22:20:25 -04:00
Al
62a5f6002d [geonames] adding query method to GeoNamesDB 2016-09-12 16:42:42 -04:00
Al
e8408d39fd [fix] unzip_file checks status code 2016-09-12 16:42:02 -04:00
Al
23c9fbe3fb [wof] don't need to crawl for admin data now that whosonfirst-data is separate from venues, etc. 2016-09-12 13:52:49 -04:00
Al
2057536bd9 [requirements] adding boto3 and gevent to requirements.txt for geodata package 2016-09-12 04:09:15 -04:00
Al
6fd43a1f20 [wof] script to download all the WoF postal code repos and their dependencies 2016-09-12 04:03:41 -04:00
Al
d41905aa18 [wof] geven-based hierarchy crawler for WoF 2016-09-12 04:03:02 -04:00
Al
45338b43b5 [wof] adding basic WhosOnFirst client for interaction with S3, downloading by ID, etc. 2016-09-12 03:56:27 -04:00
Al
55e9ab1978 [places] adding world_region tag and adding the phrase West Indies with small random probability for English-speaking Caribbean nations. Ref: #113 2016-09-11 21:54:56 -04:00
Al
551cce8cb1 [fix] making a separate gazetteer for toponym abbreviations 2016-09-10 01:08:58 -04:00
Al
bcde9e2fe7 [fix] toponym abbreviations after country name, may want to use it 2016-09-10 00:49:31 -04:00
Al
bbc5131cb6 [fix] toponym abbreviations 2016-09-10 00:48:31 -04:00
Al
19a044f7f3 [fix] imports 2016-09-10 00:09:11 -04:00
Al
ae02b0769d [openaddresses] abbreviating boundary components for OpenAddresses 2016-09-10 00:04:11 -04:00
Al
604e898d65 [fix] using toponym_gazetteer in OSM boundary abbreviations 2016-09-10 00:02:59 -04:00
Al
5d26ab41e7 [openaddresses] removing OpenAddresses hacks now that upstream changes are merged 2016-09-09 09:40:45 -04:00
Al
9cdcd7f21a [fix] indentation 2016-09-09 08:59:50 -04:00
Al
a14202fc7a [fix] default value 2016-09-09 01:46:03 -04:00
Al
85ad3bf0f4 [formatting] allowing a non-default option for components that can be inserted between road and house number 2016-09-09 01:38:39 -04:00
Al
4c6bcda3b2 [fix] config 2016-09-08 15:21:19 -04:00
Al
d1e3c6a24a [openaddresses] adding Italy countrywide to a pre_release_downloads set so it can be used in libpostal without having been merged yet 2016-09-08 15:16:35 -04:00
Al
0edbe5a593 [formatting] don't allow insertions between house number and road name 2016-09-08 15:00:36 -04:00
Al
170e8d74d8 [fix] checking for components 2016-09-08 03:19:10 -04:00
Al
769a65b808 [openaddresses] adding place-only and place+postcode probability to OpenAddresses to capture more place names not in OSM as standalone queries 2016-09-08 03:17:21 -04:00
Al
6543afaaa3 [utils] checking for 200 from download_file 2016-09-07 14:13:42 -04:00