Commit Graph

108 Commits

Author SHA1 Message Date
Al
afbb79b81d [osm/parser] Making a much lower probability of generating sub-building components for named venues (usually on the ground floor, etc.) 2016-07-31 20:40:44 -04:00
Al
0f3c4276b4 [fix] args 2016-07-31 19:53:39 -04:00
Al
3871869d4b [osm] Check that OSM venue names contain at least one word-like token 2016-07-31 19:50:45 -04:00
Al
0bdcae252f [fix] building tag updates 2016-07-31 18:43:55 -04:00
Al
f8e9d39e12 [places] Implementing population-based place components in both place and address component expansion 2016-07-30 19:15:03 -04:00
Al
5bfc29d3f6 [osm/places] Using num_references / 2 for non-default languages and min_references / 2 for alternate name tags 2016-07-30 12:46:54 -04:00
Al
9dc52ea3c4 [osm] Add more English + non-local language names for places in OSM 2016-07-29 10:31:26 -04:00
Al
ed0b867c13 [osm] For formatting places from the polygon index, use centroid if representative_point fails 2016-07-29 07:13:41 -04:00
Al
f38bb151e2 [fix] var name 2016-07-28 23:53:55 -04:00
Al
854e6d901f [osm] Add CLDR country before dropout 2016-07-28 14:41:14 -04:00
Al
bebb33fe64 [osm] Include CLDR country even if the place didn't match simplified OSM polygons 2016-07-28 14:11:31 -04:00
Al
ea1226082e [fix] wrong instance 2016-07-28 02:56:17 -04:00
Al
fc118acd90 [fix] language None for ambiguous case 2016-07-28 02:48:45 -04:00
Al
db51cc91c2 [fix] property 2016-07-28 02:41:26 -04:00
Al
543048bc26 [osm] use CLDR country names with random probability 2016-07-28 02:37:12 -04:00
Al
d276611b9c [fix] poly.context 2016-07-28 01:46:12 -04:00
Al
4cc49b7ca4 [fix] typo 2016-07-27 12:48:35 -04:00
Al
9e61b9409f [osm] For componens at or below the city level that are the admin_center of their smallest containing boundary with the same name, use the boundary's component name instead of the point's 2016-07-27 12:46:43 -04:00
Al
ad4da98bd7 [fix] lowercase language code 2016-07-27 11:51:17 -04:00
Al
862c1b677e [fix] minimum of 5 references for unknown populations 2016-07-27 00:31:31 -04:00
Al
985ea79e02 [fix] cap the number of population-based references 2016-07-26 22:38:41 -04:00
Al
9a95c4c82f [fix] typo 2016-07-26 21:04:10 -04:00
Al
51f9d06a85 [fix] for commas in OSM place names, pick the first 2016-07-26 21:00:28 -04:00
Al
a89d7f71d7 [fix] if component name can't be mapped, return None 2016-07-26 20:34:31 -04:00
Al
614300d423 [fix] typo 2016-07-26 18:37:48 -04:00
Al
bdba0a4200 [osm] In the case of semicolon delimited names, choose one at random 2016-07-26 18:20:56 -04:00
Al
0c1b12b65c [fix] Use local language with script e.g. ja_rm in place training data 2016-07-26 18:00:38 -04:00
Al
72c3723b43 [osm] Validate postcode with a regex for the given country code before sending on to parser_osm_number_range (some postcodes can also look like ranges e.g. 83-101 so validate for the given country) 2016-07-26 17:45:23 -04:00
Al
50b5eb7ea4 [fix] make place_tags iterable in the null case 2016-07-26 03:16:26 -04:00
Al
5f0a3bce9c [fix] None tuple length if no matches can be found 2016-07-26 02:58:21 -04:00
Al
5448d9bff2 [fix] using UNKNOWN_LANGUAGE instead of None so it can be treated as a string downstream 2016-07-26 02:55:04 -04:00
Al
8b24072566 [fix] reference before assignment 2016-07-26 02:52:58 -04:00
Al
890f691d7d [fix] import 2016-07-26 02:47:03 -04:00
Al
eff884986e [osm] Place component dropout in place training data 2016-07-26 02:43:05 -04:00
Al
5a9e5ef8dd [fix] iteration 2016-07-26 02:33:31 -04:00
Al
4d94495d45 [osm] place training data comes from both admin nodes and the polygons in the OSM index (using representative_point) 2016-07-25 02:39:53 -04:00
Al
776145cf8e [osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city 2016-07-25 01:24:13 -04:00
Al
1ccea09a92 [osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc. 2016-07-25 01:16:27 -04:00
Al
3957aea430 [fix] add postal_code alias 2016-07-25 00:48:55 -04:00
Al
f0dea9cba1 [fix] No random_key for non-local languages 2016-07-25 00:16:22 -04:00
Al
b31d71bbc1 [fix] parens 2016-07-25 00:14:36 -04:00
Al
e5b84205bc [osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node 2016-07-25 00:13:21 -04:00
Al
b50cb0cdf9 [osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components 2016-07-25 00:04:44 -04:00
Al
dbc5957fa6 [fix] reverting, random state abbreviations should be fine 2016-07-24 23:47:30 -04:00
Al
cf84b5727e [osm] always_use_full_names=True for encompassing boundaries on place queries 2016-07-24 23:21:14 -04:00
Al
0fa372f2c0 [fix] tags.get as nodes may not have type/id 2016-07-24 23:04:09 -04:00
Al
273f5ecf58 [fix] language defaults 2016-07-24 23:02:39 -04:00
Al
43e6f2433a [fix] use ISO3166-1:alpha2 2016-07-24 23:00:59 -04:00
Al
53906c4833 [fix] parens 2016-07-24 22:57:58 -04:00
Al
38b76701d8 [osm] Falling back on OSM country/languages if the point doesn't match the Quattroshapes geometry 2016-07-24 22:56:53 -04:00