Commit Graph

3846 Commits

Author SHA1 Message Date
Al
7b75eb2243 [fix] script docs 2016-09-23 01:31:37 -04:00
Al
d66ea835b1 [fix] allowing latitude 90 for validation purposes (North Pole) 2016-09-23 01:28:13 -04:00
Al
996a38d017 [places] adding probability distributions on added place components so can have West Indies, W.I. etc. 2016-09-22 17:45:14 -04:00
Al
373708b595 [openaddresses] replace name affixes (remove things like "city of"), prune duplicate names, remove numeric boundary names, cleanup boundary names, and add house number + postcode phrases where appropriate 2016-09-22 00:57:11 -04:00
Al
ca5bcba85e [osm] set -e so script errors out if anything fails and add --quiet to wget for I/O redirection purposes 2016-09-22 00:52:00 -04:00
Al
c8cabb3dff [openaddresses] adding City of Oakville, ON 2016-09-22 00:46:09 -04:00
Al
320760f511 [openaddresses] adding Galveston, TX 2016-09-20 02:52:58 -04:00
Al
fa12d36923 [openaddresses] adding Marion County, OR 2016-09-19 23:49:14 -04:00
Al
eb0fe4a7ac [openaddresses] adding Warren County, KY 2016-09-19 23:44:26 -04:00
Al
764a74fae4 [osm] overwrite downloaded files 2016-09-19 03:21:37 -04:00
Al
c3afcdfce5 [osm] expanding criteria for the buildings data set (buildlings with addr:housenumber, addr:housename, addr:street, or addr:postcode are useful) 2016-09-19 03:15:07 -04:00
Al
c294326679 [openaddresses] adding newly adjusted sources from PA/MN 2016-09-17 00:01:14 -04:00
Al
953daa6920 [boundaries] updates to Taiwan boundaries 2016-09-16 19:20:43 -04:00
Al
d667039397 [openaddresses] for configs with add_osm_boundaries=true, skip adding boundary fields from the OA file altogether when they're specified 2016-09-16 01:55:36 -04:00
Al
95cf6ad0fa [fix] default again 2016-09-16 01:11:59 -04:00
Al
d5a5104de9 [fix] default 2016-09-16 01:10:19 -04:00
Al
32ad1d7bd0 [fix] var name 2016-09-16 01:07:10 -04:00
Al
b618d1eaf2 [fix] var name 2016-09-16 01:02:47 -04:00
Al
d43d14a34d [parser] adding state_district as one of the possible contexts for venue name (name + county could be fine as an address in some places) 2016-09-16 01:01:00 -04:00
Al
263a173d5e [boundaries] adding exceptions for a few cities in England that have designation=non_metropolitan_district 2016-09-16 00:58:54 -04:00
Al
fed3ab0741 [openaddresses] typo in Honolulu 2016-09-16 00:58:12 -04:00
Al
fe12445f55 [openaddresses] adding Elko County, NV 2016-09-15 15:38:12 -04:00
Al
c5977c2ad4 [openaddresses] adding Westchester county (NY postcodes FTW\!) 2016-09-15 15:37:51 -04:00
Al
2e4033f527 [openaddresses] use ISO 3166 alpha 2 code for PR most of the time (again mimics US state) 2016-09-15 15:36:52 -04:00
Al
0e55ad7e3a [boundaries] Changing admin_level=6 in Spain to state_district 2016-09-15 13:48:40 -04:00
Al
e963d06370 [formatting] Puerto Rico template insertions, put country before postcode most of the time (often written like a US state) 2016-09-15 13:18:25 -04:00
Al
22d5f85cc7 [openaddresses] adding Ketchikan Borough, AK, USA 2016-09-15 13:16:56 -04:00
Al
1cec0570d6 [formatting] only using alias country insertions if the given country has not defined its own (e.g. look at Puerto Rico first, then use the US if there's nothing defined) 2016-09-15 11:45:46 -04:00
Al
9b250a9393 [openaddresses] adding zero-padding option for postcodes and using in Puerto Rico 2016-09-15 11:22:55 -04:00
Al
260aedc9d3 [geoplanet] Script to download GeoPlanet directly and load into a SQLite db for extracting the postal codes (admin areas need cleanupi in a number of countries, that's next) 2016-09-14 14:29:14 -04:00
Al
d9a81f767c [boundaries] mapping sub-regiao to state_district in Portugal 2016-09-14 13:36:52 -04:00
Al
b70c140a00 [fix] casing in state abbreviaitons dictionary 2016-09-13 21:56:27 -04:00
Al
c9c1d912b0 [dictionaries] removing spaces in the Japanese dictionaries 2016-09-12 22:21:27 -04:00
Al
14c20091f4 [fix] abbreviations in hyphenated phrases like Saint-Germaine. Hyphenation should use the phrase length not the token length 2016-09-12 22:20:25 -04:00
Al
0f8e7cd9dc [boundaries] adding exception for the direct-controlled municipalities in China (Shanghai, Beijing, Tianjin and Chongqing) to be treated as city instead of state. These will use the admin_centre properties 60% of the time, relation propreties 40% of the time 2016-09-12 18:46:08 -04:00
Al
5bd2fac514 [openaddresses] CITY => city_district in distrito-federal (Mexico City), add OSM boundaries to pick up Ciudad de México 2016-09-12 18:24:40 -04:00
Al
62a5f6002d [geonames] adding query method to GeoNamesDB 2016-09-12 16:42:42 -04:00
Al
e8408d39fd [fix] unzip_file checks status code 2016-09-12 16:42:02 -04:00
Al
23c9fbe3fb [wof] don't need to crawl for admin data now that whosonfirst-data is separate from venues, etc. 2016-09-12 13:52:49 -04:00
Al
2057536bd9 [requirements] adding boto3 and gevent to requirements.txt for geodata package 2016-09-12 04:09:15 -04:00
Al
6fd43a1f20 [wof] script to download all the WoF postal code repos and their dependencies 2016-09-12 04:03:41 -04:00
Al
d41905aa18 [wof] geven-based hierarchy crawler for WoF 2016-09-12 04:03:02 -04:00
Al
45338b43b5 [wof] adding basic WhosOnFirst client for interaction with S3, downloading by ID, etc. 2016-09-12 03:56:27 -04:00
Al
eaacf55f42 [boundaries] exceptions for city_district and barrios in Buenos Aires and Rosario in Argentina 2016-09-12 02:57:49 -04:00
Al
55e9ab1978 [places] adding world_region tag and adding the phrase West Indies with small random probability for English-speaking Caribbean nations. Ref: #113 2016-09-11 21:54:56 -04:00
Al
069e4c348c [openaddresses] adding a few US counties from the latest PR 2016-09-11 16:19:39 -04:00
Al
917493834d [dictionaries] removing R as an abbreviation for river, can re-add it in a street name only dictionary if needed, but don't want to abbreviate towns with River in the name simply to R 2016-09-10 16:19:31 -04:00
Al
551cce8cb1 [fix] making a separate gazetteer for toponym abbreviations 2016-09-10 01:08:58 -04:00
Al
bcde9e2fe7 [fix] toponym abbreviations after country name, may want to use it 2016-09-10 00:49:31 -04:00
Al
bbc5131cb6 [fix] toponym abbreviations 2016-09-10 00:48:31 -04:00