Commit Graph

618 Commits

Author SHA1 Message Date
Al
374c46ada5 [fix] metro station properties 2016-08-06 19:56:13 -04:00
Al
0edfbe0d61 [osm] Adding metro stations index to training data options 2016-08-06 19:52:21 -04:00
Al
195278cfea [osm] Reverse geocoding to metro station only for addresess in Japan 2016-08-06 19:50:18 -04:00
Al
964728a02d [fix] block phrases for Japanese and namespaced language handling in case Romaji is chosen before normalization 2016-08-06 14:50:39 -04:00
Al
445e8082c8 [addresses] Adding per-country overrides for address component dependencies 2016-08-06 02:36:47 -04:00
Al
813f29f299 [osm] Removing the call to normalize_place_names in place data formatting as we should be able to trust the places more than the addresses 2016-08-02 16:29:34 -04:00
Al
c40ad99ec7 [osm] removing postcode phrase from place training data and adding CLDR countries only after all the other normalizations 2016-08-02 14:52:12 -04:00
Al
5117fb21d3 [fix] access 2016-08-02 03:20:42 -04:00
Al
bd780d3424 [fix] typo 2016-08-02 03:19:22 -04:00
Al
c74d883344 [fix] unindent 2016-08-02 03:17:42 -04:00
Al
f29d043544 [places] Using all of the ideas that apply to places from address formatting for the places-only data set 2016-08-02 03:16:08 -04:00
Al
a1f0c1a3c9 [fix] import 2016-08-02 01:50:17 -04:00
Al
4c8b662648 [fix] block numbers 2016-08-01 14:36:28 -04:00
Al
1fb8185b75 [osm/boundaries] Allowing OSM entities to map to NULL 2016-08-01 00:52:58 -04:00
Al
2faffc81e7 [fix] import 2016-08-01 00:06:47 -04:00
Al
afbb79b81d [osm/parser] Making a much lower probability of generating sub-building components for named venues (usually on the ground floor, etc.) 2016-07-31 20:40:44 -04:00
Al
0f3c4276b4 [fix] args 2016-07-31 19:53:39 -04:00
Al
3871869d4b [osm] Check that OSM venue names contain at least one word-like token 2016-07-31 19:50:45 -04:00
Al
0bdcae252f [fix] building tag updates 2016-07-31 18:43:55 -04:00
Al
f8e9d39e12 [places] Implementing population-based place components in both place and address component expansion 2016-07-30 19:15:03 -04:00
Al
5bfc29d3f6 [osm/places] Using num_references / 2 for non-default languages and min_references / 2 for alternate name tags 2016-07-30 12:46:54 -04:00
Al
9dc52ea3c4 [osm] Add more English + non-local language names for places in OSM 2016-07-29 10:31:26 -04:00
Al
ed0b867c13 [osm] For formatting places from the polygon index, use centroid if representative_point fails 2016-07-29 07:13:41 -04:00
Al
f38bb151e2 [fix] var name 2016-07-28 23:53:55 -04:00
Al
854e6d901f [osm] Add CLDR country before dropout 2016-07-28 14:41:14 -04:00
Al
bebb33fe64 [osm] Include CLDR country even if the place didn't match simplified OSM polygons 2016-07-28 14:11:31 -04:00
Al
ea1226082e [fix] wrong instance 2016-07-28 02:56:17 -04:00
Al
fc118acd90 [fix] language None for ambiguous case 2016-07-28 02:48:45 -04:00
Al
db51cc91c2 [fix] property 2016-07-28 02:41:26 -04:00
Al
543048bc26 [osm] use CLDR country names with random probability 2016-07-28 02:37:12 -04:00
Al
d276611b9c [fix] poly.context 2016-07-28 01:46:12 -04:00
Al
4cc49b7ca4 [fix] typo 2016-07-27 12:48:35 -04:00
Al
9e61b9409f [osm] For componens at or below the city level that are the admin_center of their smallest containing boundary with the same name, use the boundary's component name instead of the point's 2016-07-27 12:46:43 -04:00
Al
ad4da98bd7 [fix] lowercase language code 2016-07-27 11:51:17 -04:00
Al
862c1b677e [fix] minimum of 5 references for unknown populations 2016-07-27 00:31:31 -04:00
Al
985ea79e02 [fix] cap the number of population-based references 2016-07-26 22:38:41 -04:00
Al
9a95c4c82f [fix] typo 2016-07-26 21:04:10 -04:00
Al
51f9d06a85 [fix] for commas in OSM place names, pick the first 2016-07-26 21:00:28 -04:00
Al
da7a5e46c7 [osm] Zero fill number ranges like 01234-01240 2016-07-26 20:53:39 -04:00
Al
a89d7f71d7 [fix] if component name can't be mapped, return None 2016-07-26 20:34:31 -04:00
Al
274f31b37e [osm] map place=district to state_district 2016-07-26 20:30:47 -04:00
Al
614300d423 [fix] typo 2016-07-26 18:37:48 -04:00
Al
bdba0a4200 [osm] In the case of semicolon delimited names, choose one at random 2016-07-26 18:20:56 -04:00
Al
0c1b12b65c [fix] Use local language with script e.g. ja_rm in place training data 2016-07-26 18:00:38 -04:00
Al
72c3723b43 [osm] Validate postcode with a regex for the given country code before sending on to parser_osm_number_range (some postcodes can also look like ranges e.g. 83-101 so validate for the given country) 2016-07-26 17:45:23 -04:00
Al
50b5eb7ea4 [fix] make place_tags iterable in the null case 2016-07-26 03:16:26 -04:00
Al
5f0a3bce9c [fix] None tuple length if no matches can be found 2016-07-26 02:58:21 -04:00
Al
5448d9bff2 [fix] using UNKNOWN_LANGUAGE instead of None so it can be treated as a string downstream 2016-07-26 02:55:04 -04:00
Al
8b24072566 [fix] reference before assignment 2016-07-26 02:52:58 -04:00
Al
890f691d7d [fix] import 2016-07-26 02:47:03 -04:00