Commit Graph

485 Commits

Author SHA1 Message Date
Al
b41ba7374b [intersections] intersections training data, using a Cartesian product of all names in the same language, including something like tiger:name_base 2016-08-18 01:19:14 -04:00
Al
701bcb1d79 [intersections] Using name cleanup on intersections, including tiger:name_base which sometimes has semicolon delimiters as well 2016-08-17 18:47:07 -04:00
Al
145af9331e [osm] build OSM training data for intersections using the JSON output from intersections.py rather having to compute each time 2016-08-17 18:11:55 -04:00
Al
a3ae1eb330 [intersections] Adding a read classmethod to intersections to read the intermediate JSON file 2016-08-17 15:29:59 -04:00
Al
96c753e8c6 [fix] adding logging on new intersections script 2016-08-16 23:55:22 -04:00
Al
5b172ad2d7 [intersections] Caching intersection creation in an intermediate script to save time diagnosing issues downstream 2016-08-16 23:52:58 -04:00
Al
7ff0cb2704 [fix] name and a few things for intersections data 2016-08-15 21:26:54 -04:00
Al
7ab6af4335 [fix] bounds 2016-08-15 12:01:22 -04:00
Al
060d3a1f86 [fix] var name 2016-08-15 11:18:00 -04:00
Al
29fc198aba [osm] giving parse_osm_number_range a parameter for max range and setting it to 1000 for postal codes e.g. for major cities that may have several hundred postal codes 2016-08-15 10:34:24 -04:00
Al
637baad629 [osm] Adding at least min_references entries for every selected postcode 2016-08-15 10:30:28 -04:00
Al
aa6b9cd858 [fix] var name for place tags coming from the admin rtree 2016-08-15 10:25:19 -04:00
Al
bc8acb196c [osm] Pulling valid postal codes out into a method 2016-08-13 01:49:26 -04:00
Al
b993e9a163 [fix] add Japanese-language variant if metro station is added 2016-08-06 21:17:14 -04:00
Al
39bd562d04 [addresses] only set language if we needed it for Japanese house_numbers 2016-08-06 21:06:01 -04:00
Al
e68fee7c68 [fix] null check 2016-08-06 20:39:28 -04:00
Al
e35649f09d [fix] import 2016-08-06 20:01:38 -04:00
Al
374c46ada5 [fix] metro station properties 2016-08-06 19:56:13 -04:00
Al
0edfbe0d61 [osm] Adding metro stations index to training data options 2016-08-06 19:52:21 -04:00
Al
195278cfea [osm] Reverse geocoding to metro station only for addresess in Japan 2016-08-06 19:50:18 -04:00
Al
964728a02d [fix] block phrases for Japanese and namespaced language handling in case Romaji is chosen before normalization 2016-08-06 14:50:39 -04:00
Al
445e8082c8 [addresses] Adding per-country overrides for address component dependencies 2016-08-06 02:36:47 -04:00
Al
813f29f299 [osm] Removing the call to normalize_place_names in place data formatting as we should be able to trust the places more than the addresses 2016-08-02 16:29:34 -04:00
Al
c40ad99ec7 [osm] removing postcode phrase from place training data and adding CLDR countries only after all the other normalizations 2016-08-02 14:52:12 -04:00
Al
5117fb21d3 [fix] access 2016-08-02 03:20:42 -04:00
Al
bd780d3424 [fix] typo 2016-08-02 03:19:22 -04:00
Al
c74d883344 [fix] unindent 2016-08-02 03:17:42 -04:00
Al
f29d043544 [places] Using all of the ideas that apply to places from address formatting for the places-only data set 2016-08-02 03:16:08 -04:00
Al
a1f0c1a3c9 [fix] import 2016-08-02 01:50:17 -04:00
Al
4c8b662648 [fix] block numbers 2016-08-01 14:36:28 -04:00
Al
1fb8185b75 [osm/boundaries] Allowing OSM entities to map to NULL 2016-08-01 00:52:58 -04:00
Al
2faffc81e7 [fix] import 2016-08-01 00:06:47 -04:00
Al
afbb79b81d [osm/parser] Making a much lower probability of generating sub-building components for named venues (usually on the ground floor, etc.) 2016-07-31 20:40:44 -04:00
Al
0f3c4276b4 [fix] args 2016-07-31 19:53:39 -04:00
Al
3871869d4b [osm] Check that OSM venue names contain at least one word-like token 2016-07-31 19:50:45 -04:00
Al
0bdcae252f [fix] building tag updates 2016-07-31 18:43:55 -04:00
Al
f8e9d39e12 [places] Implementing population-based place components in both place and address component expansion 2016-07-30 19:15:03 -04:00
Al
5bfc29d3f6 [osm/places] Using num_references / 2 for non-default languages and min_references / 2 for alternate name tags 2016-07-30 12:46:54 -04:00
Al
9dc52ea3c4 [osm] Add more English + non-local language names for places in OSM 2016-07-29 10:31:26 -04:00
Al
ed0b867c13 [osm] For formatting places from the polygon index, use centroid if representative_point fails 2016-07-29 07:13:41 -04:00
Al
f38bb151e2 [fix] var name 2016-07-28 23:53:55 -04:00
Al
854e6d901f [osm] Add CLDR country before dropout 2016-07-28 14:41:14 -04:00
Al
bebb33fe64 [osm] Include CLDR country even if the place didn't match simplified OSM polygons 2016-07-28 14:11:31 -04:00
Al
ea1226082e [fix] wrong instance 2016-07-28 02:56:17 -04:00
Al
fc118acd90 [fix] language None for ambiguous case 2016-07-28 02:48:45 -04:00
Al
db51cc91c2 [fix] property 2016-07-28 02:41:26 -04:00
Al
543048bc26 [osm] use CLDR country names with random probability 2016-07-28 02:37:12 -04:00
Al
d276611b9c [fix] poly.context 2016-07-28 01:46:12 -04:00
Al
4cc49b7ca4 [fix] typo 2016-07-27 12:48:35 -04:00
Al
9e61b9409f [osm] For componens at or below the city level that are the admin_center of their smallest containing boundary with the same name, use the boundary's component name instead of the point's 2016-07-27 12:46:43 -04:00