Al
|
a89d7f71d7
|
[fix] if component name can't be mapped, return None
|
2016-07-26 20:34:31 -04:00 |
|
Al
|
274f31b37e
|
[osm] map place=district to state_district
|
2016-07-26 20:30:47 -04:00 |
|
Al
|
614300d423
|
[fix] typo
|
2016-07-26 18:37:48 -04:00 |
|
Al
|
bdba0a4200
|
[osm] In the case of semicolon delimited names, choose one at random
|
2016-07-26 18:20:56 -04:00 |
|
Al
|
0c1b12b65c
|
[fix] Use local language with script e.g. ja_rm in place training data
|
2016-07-26 18:00:38 -04:00 |
|
Al
|
72c3723b43
|
[osm] Validate postcode with a regex for the given country code before sending on to parser_osm_number_range (some postcodes can also look like ranges e.g. 83-101 so validate for the given country)
|
2016-07-26 17:45:23 -04:00 |
|
Al
|
1ef57ee7d2
|
[i18n/postcodes] Fetching postcode regexes from the data source used by Google's libaddressinput, caches requests for the length of the running program (e.g. generating parser data, so the regexes will get updated over time).
|
2016-07-26 17:42:50 -04:00 |
|
Al
|
50b5eb7ea4
|
[fix] make place_tags iterable in the null case
|
2016-07-26 03:16:26 -04:00 |
|
Al
|
5f0a3bce9c
|
[fix] None tuple length if no matches can be found
|
2016-07-26 02:58:21 -04:00 |
|
Al
|
5448d9bff2
|
[fix] using UNKNOWN_LANGUAGE instead of None so it can be treated as a string downstream
|
2016-07-26 02:55:04 -04:00 |
|
Al
|
8b24072566
|
[fix] reference before assignment
|
2016-07-26 02:52:58 -04:00 |
|
Al
|
6c3128edee
|
[fix] adding country_region to places config
|
2016-07-26 02:51:05 -04:00 |
|
Al
|
890f691d7d
|
[fix] import
|
2016-07-26 02:47:03 -04:00 |
|
Al
|
eff884986e
|
[osm] Place component dropout in place training data
|
2016-07-26 02:43:05 -04:00 |
|
Al
|
5a9e5ef8dd
|
[fix] iteration
|
2016-07-26 02:33:31 -04:00 |
|
Al
|
7b25d1edfb
|
[fix] config updates for contained_by overrides in OSM admin components
|
2016-07-25 17:10:15 -04:00 |
|
Al
|
4e58a7c12e
|
[test] Adding test for intersection phrases and fixing a test failure for the Czech config
|
2016-07-25 03:19:52 -04:00 |
|
Al
|
ffece04855
|
[osm] Place training data from OSM script
|
2016-07-25 02:45:16 -04:00 |
|
Al
|
4d94495d45
|
[osm] place training data comes from both admin nodes and the polygons in the OSM index (using representative_point)
|
2016-07-25 02:39:53 -04:00 |
|
Al
|
024d47a8a5
|
[osm] Adding admin_center handling to OSM address components
|
2016-07-25 02:14:51 -04:00 |
|
Al
|
776145cf8e
|
[osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city
|
2016-07-25 01:24:13 -04:00 |
|
Al
|
1ccea09a92
|
[osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc.
|
2016-07-25 01:16:27 -04:00 |
|
Al
|
3957aea430
|
[fix] add postal_code alias
|
2016-07-25 00:48:55 -04:00 |
|
Al
|
ee795211bc
|
[polygons] Include designation in OSM admin properties (for UK)
|
2016-07-25 00:27:27 -04:00 |
|
Al
|
f0dea9cba1
|
[fix] No random_key for non-local languages
|
2016-07-25 00:16:22 -04:00 |
|
Al
|
b31d71bbc1
|
[fix] parens
|
2016-07-25 00:14:36 -04:00 |
|
Al
|
e5b84205bc
|
[osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node
|
2016-07-25 00:13:21 -04:00 |
|
Al
|
b50cb0cdf9
|
[osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components
|
2016-07-25 00:04:44 -04:00 |
|
Al
|
dbc5957fa6
|
[fix] reverting, random state abbreviations should be fine
|
2016-07-24 23:47:30 -04:00 |
|
Al
|
cf84b5727e
|
[osm] always_use_full_names=True for encompassing boundaries on place queries
|
2016-07-24 23:21:14 -04:00 |
|
Al
|
0fa372f2c0
|
[fix] tags.get as nodes may not have type/id
|
2016-07-24 23:04:09 -04:00 |
|
Al
|
273f5ecf58
|
[fix] language defaults
|
2016-07-24 23:02:39 -04:00 |
|
Al
|
43e6f2433a
|
[fix] use ISO3166-1:alpha2
|
2016-07-24 23:00:59 -04:00 |
|
Al
|
53906c4833
|
[fix] parens
|
2016-07-24 22:57:58 -04:00 |
|
Al
|
38b76701d8
|
[osm] Falling back on OSM country/languages if the point doesn't match the Quattroshapes geometry
|
2016-07-24 22:56:53 -04:00 |
|
Al
|
4b26962793
|
[osm] Don't return language from node_place_tags as the list of tags contains the various languages already
|
2016-07-24 22:17:42 -04:00 |
|
Al
|
87a47a825e
|
[fix] var reference before assignment
|
2016-07-24 22:00:07 -04:00 |
|
Al
|
696448981c
|
[fix] var name
|
2016-07-24 21:58:56 -04:00 |
|
Al
|
bfb89adaab
|
[osm] use containing ids in component mapping
|
2016-07-24 21:57:04 -04:00 |
|
Al
|
2a9185874a
|
[fix] component index
|
2016-07-24 21:55:02 -04:00 |
|
Al
|
1158076154
|
[fix] default language suffix is ''
|
2016-07-24 21:34:59 -04:00 |
|
Al
|
60d4fd3102
|
[fix] another import
|
2016-07-24 21:31:52 -04:00 |
|
Al
|
648c016b05
|
[fix] import and return values
|
2016-07-24 21:30:53 -04:00 |
|
Al
|
09b77b52a6
|
[osm] Adding place training set. Every place, even nodes, in OSM will get population / 10000 + 1 simple place queries like city + state included in the training set, even if there are no OSM addresses for that city. Where postcodes are available, they'll also be added to the training examples
|
2016-07-24 20:09:56 -04:00 |
|
Al
|
39c193d52d
|
[osm] Fixing parse_osm_number_range. Only treat it as a range if the number on the right is greater, make letter range parsing optional
|
2016-07-24 19:49:20 -04:00 |
|
Al
|
4151ce7919
|
[osm] Adding rail stations to venues data set if they have a street address and a Wikipedia
|
2016-07-24 14:13:38 -04:00 |
|
Al
|
75d9c31395
|
[text] Adding NORMALIZE_STRING_COMPOSE constant in pynormalize.c
|
2016-07-24 03:37:43 -04:00 |
|
Al
|
7b3f4e9175
|
[text] Adding utils.py for is_numeric/is_numeric_strict
|
2016-07-24 03:37:11 -04:00 |
|
Al
|
a620cae6e0
|
[fix] var
|
2016-07-23 15:45:07 -04:00 |
|
Al
|
487d589531
|
[fix] remove var
|
2016-07-23 15:17:47 -04:00 |
|