Commit Graph

3410 Commits

Author SHA1 Message Date
Al
d9b70d3404 [fix] mapping the nodes for NYC boroughs to city_district 2016-07-27 12:22:50 -04:00
Al
ad4da98bd7 [fix] lowercase language code 2016-07-27 11:51:17 -04:00
Al
3f4c18ddb6 [fix] None case for names 2016-07-27 01:16:05 -04:00
Al
4e14926169 [osm] choosing random name for semicolons and first name for commas in OSM name components 2016-07-27 01:06:14 -04:00
Al
862c1b677e [fix] minimum of 5 references for unknown populations 2016-07-27 00:31:31 -04:00
Al
985ea79e02 [fix] cap the number of population-based references 2016-07-26 22:38:41 -04:00
Al
9a95c4c82f [fix] typo 2016-07-26 21:04:10 -04:00
Al
51f9d06a85 [fix] for commas in OSM place names, pick the first 2016-07-26 21:00:28 -04:00
Al
da7a5e46c7 [osm] Zero fill number ranges like 01234-01240 2016-07-26 20:53:39 -04:00
Al
a89d7f71d7 [fix] if component name can't be mapped, return None 2016-07-26 20:34:31 -04:00
Al
274f31b37e [osm] map place=district to state_district 2016-07-26 20:30:47 -04:00
Al
53cbb52cb2 [languages] Adding Tibetan language to regional languages for the Tibet region 2016-07-26 19:07:37 -04:00
Al
614300d423 [fix] typo 2016-07-26 18:37:48 -04:00
Al
bdba0a4200 [osm] In the case of semicolon delimited names, choose one at random 2016-07-26 18:20:56 -04:00
Al
0c1b12b65c [fix] Use local language with script e.g. ja_rm in place training data 2016-07-26 18:00:38 -04:00
Al
72c3723b43 [osm] Validate postcode with a regex for the given country code before sending on to parser_osm_number_range (some postcodes can also look like ranges e.g. 83-101 so validate for the given country) 2016-07-26 17:45:23 -04:00
Al
1ef57ee7d2 [i18n/postcodes] Fetching postcode regexes from the data source used by Google's libaddressinput, caches requests for the length of the running program (e.g. generating parser data, so the regexes will get updated over time). 2016-07-26 17:42:50 -04:00
Al
50b5eb7ea4 [fix] make place_tags iterable in the null case 2016-07-26 03:16:26 -04:00
Al
5f0a3bce9c [fix] None tuple length if no matches can be found 2016-07-26 02:58:21 -04:00
Al
5448d9bff2 [fix] using UNKNOWN_LANGUAGE instead of None so it can be treated as a string downstream 2016-07-26 02:55:04 -04:00
Al
8b24072566 [fix] reference before assignment 2016-07-26 02:52:58 -04:00
Al
6c3128edee [fix] adding country_region to places config 2016-07-26 02:51:05 -04:00
Al
890f691d7d [fix] import 2016-07-26 02:47:03 -04:00
Al
eff884986e [osm] Place component dropout in place training data 2016-07-26 02:43:05 -04:00
Al
5a9e5ef8dd [fix] iteration 2016-07-26 02:33:31 -04:00
Al
7b25d1edfb [fix] config updates for contained_by overrides in OSM admin components 2016-07-25 17:10:15 -04:00
Al
eae7a6a78c [osm/boundaries] extend admin overrides in the UK to Greater London which includes London and the City of London 2016-07-25 16:56:39 -04:00
Al
38e67f5013 [boundaries] More fun with mapping UK admin boundaries. Non-metroplitan counties and non-metropolitan districts map to state_district. admin_level=6 maps to state district except for London where it's the city minus City of London. admin_level=8 (e.g. Manchester) maps to city except in London where it maps to city_district. admin_level=10 is suburb unless designation=civil_parish, in which case it's treated as a city boundary (individual towns/villages may be city or suburb depending on their place tag). Just complicated enough to be valid UK law :-). 2016-07-25 16:02:00 -04:00
Al
6a8209dc98 [places] Adding country_region to places config, increasing importance of county in England outside of London, increasing importance of city globally 2016-07-25 15:09:37 -04:00
Al
4b67cf79f4 [boundaries/osm] Mapping regions of England to state 2016-07-25 15:02:22 -04:00
Al
4e58a7c12e [test] Adding test for intersection phrases and fixing a test failure for the Czech config 2016-07-25 03:19:52 -04:00
Al
ffece04855 [osm] Place training data from OSM script 2016-07-25 02:45:16 -04:00
Al
4d94495d45 [osm] place training data comes from both admin nodes and the polygons in the OSM index (using representative_point) 2016-07-25 02:39:53 -04:00
Al
024d47a8a5 [osm] Adding admin_center handling to OSM address components 2016-07-25 02:14:51 -04:00
Al
1058b17a61 [osm] Moving admin_center overrides to OSM parser config 2016-07-25 02:02:48 -04:00
Al
c9aa0bc913 [boundaries/osm] Use name:en most of the time for New Zealand and occasionally name 2016-07-25 01:53:43 -04:00
Al
776145cf8e [osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city 2016-07-25 01:24:13 -04:00
Al
1ccea09a92 [osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc. 2016-07-25 01:16:27 -04:00
Al
3957aea430 [fix] add postal_code alias 2016-07-25 00:48:55 -04:00
Al
ee795211bc [polygons] Include designation in OSM admin properties (for UK) 2016-07-25 00:27:27 -04:00
Al
f0dea9cba1 [fix] No random_key for non-local languages 2016-07-25 00:16:22 -04:00
Al
b31d71bbc1 [fix] parens 2016-07-25 00:14:36 -04:00
Al
e5b84205bc [osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node 2016-07-25 00:13:21 -04:00
Al
b50cb0cdf9 [osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components 2016-07-25 00:04:44 -04:00
Al
dbc5957fa6 [fix] reverting, random state abbreviations should be fine 2016-07-24 23:47:30 -04:00
Al
cf84b5727e [osm] always_use_full_names=True for encompassing boundaries on place queries 2016-07-24 23:21:14 -04:00
Al
0fa372f2c0 [fix] tags.get as nodes may not have type/id 2016-07-24 23:04:09 -04:00
Al
273f5ecf58 [fix] language defaults 2016-07-24 23:02:39 -04:00
Al
43e6f2433a [fix] use ISO3166-1:alpha2 2016-07-24 23:00:59 -04:00
Al
53906c4833 [fix] parens 2016-07-24 22:57:58 -04:00