Al
|
db51cc91c2
|
[fix] property
|
2016-07-28 02:41:26 -04:00 |
|
Al
|
543048bc26
|
[osm] use CLDR country names with random probability
|
2016-07-28 02:37:12 -04:00 |
|
Al
|
d276611b9c
|
[fix] poly.context
|
2016-07-28 01:46:12 -04:00 |
|
Al
|
4cc49b7ca4
|
[fix] typo
|
2016-07-27 12:48:35 -04:00 |
|
Al
|
9e61b9409f
|
[osm] For componens at or below the city level that are the admin_center of their smallest containing boundary with the same name, use the boundary's component name instead of the point's
|
2016-07-27 12:46:43 -04:00 |
|
Al
|
ad4da98bd7
|
[fix] lowercase language code
|
2016-07-27 11:51:17 -04:00 |
|
Al
|
862c1b677e
|
[fix] minimum of 5 references for unknown populations
|
2016-07-27 00:31:31 -04:00 |
|
Al
|
985ea79e02
|
[fix] cap the number of population-based references
|
2016-07-26 22:38:41 -04:00 |
|
Al
|
9a95c4c82f
|
[fix] typo
|
2016-07-26 21:04:10 -04:00 |
|
Al
|
51f9d06a85
|
[fix] for commas in OSM place names, pick the first
|
2016-07-26 21:00:28 -04:00 |
|
Al
|
da7a5e46c7
|
[osm] Zero fill number ranges like 01234-01240
|
2016-07-26 20:53:39 -04:00 |
|
Al
|
a89d7f71d7
|
[fix] if component name can't be mapped, return None
|
2016-07-26 20:34:31 -04:00 |
|
Al
|
274f31b37e
|
[osm] map place=district to state_district
|
2016-07-26 20:30:47 -04:00 |
|
Al
|
614300d423
|
[fix] typo
|
2016-07-26 18:37:48 -04:00 |
|
Al
|
bdba0a4200
|
[osm] In the case of semicolon delimited names, choose one at random
|
2016-07-26 18:20:56 -04:00 |
|
Al
|
0c1b12b65c
|
[fix] Use local language with script e.g. ja_rm in place training data
|
2016-07-26 18:00:38 -04:00 |
|
Al
|
72c3723b43
|
[osm] Validate postcode with a regex for the given country code before sending on to parser_osm_number_range (some postcodes can also look like ranges e.g. 83-101 so validate for the given country)
|
2016-07-26 17:45:23 -04:00 |
|
Al
|
50b5eb7ea4
|
[fix] make place_tags iterable in the null case
|
2016-07-26 03:16:26 -04:00 |
|
Al
|
5f0a3bce9c
|
[fix] None tuple length if no matches can be found
|
2016-07-26 02:58:21 -04:00 |
|
Al
|
5448d9bff2
|
[fix] using UNKNOWN_LANGUAGE instead of None so it can be treated as a string downstream
|
2016-07-26 02:55:04 -04:00 |
|
Al
|
8b24072566
|
[fix] reference before assignment
|
2016-07-26 02:52:58 -04:00 |
|
Al
|
890f691d7d
|
[fix] import
|
2016-07-26 02:47:03 -04:00 |
|
Al
|
eff884986e
|
[osm] Place component dropout in place training data
|
2016-07-26 02:43:05 -04:00 |
|
Al
|
5a9e5ef8dd
|
[fix] iteration
|
2016-07-26 02:33:31 -04:00 |
|
Al
|
7b25d1edfb
|
[fix] config updates for contained_by overrides in OSM admin components
|
2016-07-25 17:10:15 -04:00 |
|
Al
|
ffece04855
|
[osm] Place training data from OSM script
|
2016-07-25 02:45:16 -04:00 |
|
Al
|
4d94495d45
|
[osm] place training data comes from both admin nodes and the polygons in the OSM index (using representative_point)
|
2016-07-25 02:39:53 -04:00 |
|
Al
|
776145cf8e
|
[osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city
|
2016-07-25 01:24:13 -04:00 |
|
Al
|
1ccea09a92
|
[osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc.
|
2016-07-25 01:16:27 -04:00 |
|
Al
|
3957aea430
|
[fix] add postal_code alias
|
2016-07-25 00:48:55 -04:00 |
|
Al
|
f0dea9cba1
|
[fix] No random_key for non-local languages
|
2016-07-25 00:16:22 -04:00 |
|
Al
|
b31d71bbc1
|
[fix] parens
|
2016-07-25 00:14:36 -04:00 |
|
Al
|
e5b84205bc
|
[osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node
|
2016-07-25 00:13:21 -04:00 |
|
Al
|
b50cb0cdf9
|
[osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components
|
2016-07-25 00:04:44 -04:00 |
|
Al
|
dbc5957fa6
|
[fix] reverting, random state abbreviations should be fine
|
2016-07-24 23:47:30 -04:00 |
|
Al
|
cf84b5727e
|
[osm] always_use_full_names=True for encompassing boundaries on place queries
|
2016-07-24 23:21:14 -04:00 |
|
Al
|
0fa372f2c0
|
[fix] tags.get as nodes may not have type/id
|
2016-07-24 23:04:09 -04:00 |
|
Al
|
273f5ecf58
|
[fix] language defaults
|
2016-07-24 23:02:39 -04:00 |
|
Al
|
43e6f2433a
|
[fix] use ISO3166-1:alpha2
|
2016-07-24 23:00:59 -04:00 |
|
Al
|
53906c4833
|
[fix] parens
|
2016-07-24 22:57:58 -04:00 |
|
Al
|
38b76701d8
|
[osm] Falling back on OSM country/languages if the point doesn't match the Quattroshapes geometry
|
2016-07-24 22:56:53 -04:00 |
|
Al
|
4b26962793
|
[osm] Don't return language from node_place_tags as the list of tags contains the various languages already
|
2016-07-24 22:17:42 -04:00 |
|
Al
|
87a47a825e
|
[fix] var reference before assignment
|
2016-07-24 22:00:07 -04:00 |
|
Al
|
2a9185874a
|
[fix] component index
|
2016-07-24 21:55:02 -04:00 |
|
Al
|
1158076154
|
[fix] default language suffix is ''
|
2016-07-24 21:34:59 -04:00 |
|
Al
|
60d4fd3102
|
[fix] another import
|
2016-07-24 21:31:52 -04:00 |
|
Al
|
648c016b05
|
[fix] import and return values
|
2016-07-24 21:30:53 -04:00 |
|
Al
|
09b77b52a6
|
[osm] Adding place training set. Every place, even nodes, in OSM will get population / 10000 + 1 simple place queries like city + state included in the training set, even if there are no OSM addresses for that city. Where postcodes are available, they'll also be added to the training examples
|
2016-07-24 20:09:56 -04:00 |
|
Al
|
39c193d52d
|
[osm] Fixing parse_osm_number_range. Only treat it as a range if the number on the right is greater, make letter range parsing optional
|
2016-07-24 19:49:20 -04:00 |
|
Al
|
4151ce7919
|
[osm] Adding rail stations to venues data set if they have a street address and a Wikipedia
|
2016-07-24 14:13:38 -04:00 |
|