Commit Graph

3542 Commits

Author SHA1 Message Date
Al
a3ae1eb330 [intersections] Adding a read classmethod to intersections to read the intermediate JSON file 2016-08-17 15:29:59 -04:00
Al
96c753e8c6 [fix] adding logging on new intersections script 2016-08-16 23:55:22 -04:00
Al
5b172ad2d7 [intersections] Caching intersection creation in an intermediate script to save time diagnosing issues downstream 2016-08-16 23:52:58 -04:00
Al
330edc2c93 [utils] cstring_array_get_phrase requires a char_array to be passed in so it doesn't have to do any memory allocation 2016-08-16 13:11:45 -04:00
Al
92e66fd60c [utils] string_next_hyphen_index 2016-08-16 12:49:52 -04:00
Al
7ff0cb2704 [fix] name and a few things for intersections data 2016-08-15 21:26:54 -04:00
Al
7ab6af4335 [fix] bounds 2016-08-15 12:01:22 -04:00
Al
060d3a1f86 [fix] var name 2016-08-15 11:18:00 -04:00
Al
29fc198aba [osm] giving parse_osm_number_range a parameter for max range and setting it to 1000 for postal codes e.g. for major cities that may have several hundred postal codes 2016-08-15 10:34:24 -04:00
Al
637baad629 [osm] Adding at least min_references entries for every selected postcode 2016-08-15 10:30:28 -04:00
Al
aa6b9cd858 [fix] var name for place tags coming from the admin rtree 2016-08-15 10:25:19 -04:00
Al
5cff7b85bd [geonames] Adding basic GeoNames admin mappings for all countries we have postal codes lists for so some form of training data can be created for postcodes not listed in OSM 2016-08-15 01:09:17 -04:00
Al
7f4e636fc5 [fix] accidentally had Vietnam country code switched with Virgin Islands 2016-08-14 18:43:24 -04:00
Al
8a5da5f860 [boundaries/osm] Reverting admin_level=10 back to city_district for India so it'll match the current training data, can revisit later 2016-08-13 22:51:42 -04:00
Al
bc8acb196c [osm] Pulling valid postal codes out into a method 2016-08-13 01:49:26 -04:00
Al
55895369b8 [boundaries] Using state again for UK countries (England, Scotland, Wales, Northern Ireland). country_region was created mostly for non-administrative regions of a country (usually admin_level=3 in OSM). The UK is a bit more complicated in that there are multiple non-sovereign countries, but it's probably not worth creating a different tag and different set of parameters just to have a distinct name for 1st level admin in the UK 2016-08-11 23:47:31 -04:00
Al
d51a6693ac [fix] reverting commit that was lumped in with geonames script 2016-08-11 21:49:29 -04:00
Al
74d042e3c7 [boundaries] For India, making admin_level 10 map to suburb rather than city_district 2016-08-11 21:47:10 -04:00
Al
29081a0699 [fix] adding English template insertions for the UK regardless of language 2016-08-11 21:32:54 -04:00
Al
22123b80ba [fix] refactoring geonames script a bit 2016-08-11 21:31:39 -04:00
Al
48755ec218 [boundaries] Adding regex replacements for boundary names such as Lyon 2e Arrondissement where putting Lyon is the OSM convention but we might sometimes want just 2e Arrondissement to appear in the training data next to Lyon 2016-08-11 13:09:24 -04:00
Al
10a41309b8 [addresses] Increasing Romaji probability to 0.4 2016-08-06 21:27:32 -04:00
Al
b993e9a163 [fix] add Japanese-language variant if metro station is added 2016-08-06 21:17:14 -04:00
Al
39bd562d04 [addresses] only set language if we needed it for Japanese house_numbers 2016-08-06 21:06:01 -04:00
Al
cdd5a96346 [addresses] metro station can also be used for plain venues without a house number so we get more in the training set 2016-08-06 20:52:29 -04:00
Al
5ec752e887 [fix] order of ops 2016-08-06 20:43:13 -04:00
Al
e68fee7c68 [fix] null check 2016-08-06 20:39:28 -04:00
Al
3e34012e69 [fix] if the language is given already, use it as a suffix rather than choosing at random 2016-08-06 20:36:56 -04:00
Al
606c464db6 [fix] house number phrases 2016-08-06 20:11:32 -04:00
Al
e35649f09d [fix] import 2016-08-06 20:01:38 -04:00
Al
0e7cb2b06c [fix] var name II 2016-08-06 20:00:35 -04:00
Al
8d88820d30 [fix] var name 2016-08-06 19:59:53 -04:00
Al
374c46ada5 [fix] metro station properties 2016-08-06 19:56:13 -04:00
Al
0edfbe0d61 [osm] Adding metro stations index to training data options 2016-08-06 19:52:21 -04:00
Al
195278cfea [osm] Reverse geocoding to metro station only for addresess in Japan 2016-08-06 19:50:18 -04:00
Al
6ef54bcc6f [addresses] Adding metro stations to AddressComponents expansion 2016-08-06 19:36:57 -04:00
Al
da2985a4ae [places] Metro station dropout probabilities 2016-08-06 19:34:56 -04:00
Al
6ce882cb55 [addresses] Metro station component dependencies (road or house_number) 2016-08-06 19:34:39 -04:00
Al
668aa20996 [addresses] Metro station phrases for Japanese Romaji 2016-08-06 19:34:07 -04:00
Al
9cbbca5e47 [addresses] Metro station phrase for Japanese 2016-08-06 19:33:42 -04:00
Al
d59ab82701 [metro stations] Adding metro station phrase generator 2016-08-06 19:33:21 -04:00
Al
1e27ad1124 [metro stations] Adding metro station component to address formatter 2016-08-06 19:13:20 -04:00
Al
5cff119d25 [fix] command line arg 2016-08-06 18:36:27 -04:00
Al
406666362c [fix] command-line index creation 2016-08-06 18:36:01 -04:00
Al
7ddd553129 [fix] metro stations reverse geocoder 2016-08-06 18:30:54 -04:00
Al
5e44f6954b [metro stations] Adding metro stations reverse geocoder 2016-08-06 18:24:25 -04:00
Al
954bb08a8d [points] Fixes to point index 2016-08-06 18:23:30 -04:00
Al
964728a02d [fix] block phrases for Japanese and namespaced language handling in case Romaji is chosen before normalization 2016-08-06 14:50:39 -04:00
Al
684550ea7d [fix] only add house_number phrase to numeric inputs 2016-08-06 14:49:28 -04:00
Al
8b5d44e173 [fix] Japanese house numbers aren't without dependencies, just have different ones (road or suburb or city_district) 2016-08-06 03:38:44 -04:00