Commit Graph

222 Commits

Author SHA1 Message Date
Al
776145cf8e [osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city 2016-07-25 01:24:13 -04:00
Al
1ccea09a92 [osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc. 2016-07-25 01:16:27 -04:00
Al
3957aea430 [fix] add postal_code alias 2016-07-25 00:48:55 -04:00
Al
f0dea9cba1 [fix] No random_key for non-local languages 2016-07-25 00:16:22 -04:00
Al
b31d71bbc1 [fix] parens 2016-07-25 00:14:36 -04:00
Al
e5b84205bc [osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node 2016-07-25 00:13:21 -04:00
Al
b50cb0cdf9 [osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components 2016-07-25 00:04:44 -04:00
Al
dbc5957fa6 [fix] reverting, random state abbreviations should be fine 2016-07-24 23:47:30 -04:00
Al
cf84b5727e [osm] always_use_full_names=True for encompassing boundaries on place queries 2016-07-24 23:21:14 -04:00
Al
0fa372f2c0 [fix] tags.get as nodes may not have type/id 2016-07-24 23:04:09 -04:00
Al
273f5ecf58 [fix] language defaults 2016-07-24 23:02:39 -04:00
Al
43e6f2433a [fix] use ISO3166-1:alpha2 2016-07-24 23:00:59 -04:00
Al
53906c4833 [fix] parens 2016-07-24 22:57:58 -04:00
Al
38b76701d8 [osm] Falling back on OSM country/languages if the point doesn't match the Quattroshapes geometry 2016-07-24 22:56:53 -04:00
Al
4b26962793 [osm] Don't return language from node_place_tags as the list of tags contains the various languages already 2016-07-24 22:17:42 -04:00
Al
87a47a825e [fix] var reference before assignment 2016-07-24 22:00:07 -04:00
Al
2a9185874a [fix] component index 2016-07-24 21:55:02 -04:00
Al
1158076154 [fix] default language suffix is '' 2016-07-24 21:34:59 -04:00
Al
60d4fd3102 [fix] another import 2016-07-24 21:31:52 -04:00
Al
648c016b05 [fix] import and return values 2016-07-24 21:30:53 -04:00
Al
09b77b52a6 [osm] Adding place training set. Every place, even nodes, in OSM will get population / 10000 + 1 simple place queries like city + state included in the training set, even if there are no OSM addresses for that city. Where postcodes are available, they'll also be added to the training examples 2016-07-24 20:09:56 -04:00
Al
9681d4dc8e [merge] 2016-07-22 18:55:55 -04:00
Al
c8e426a94d [osm] If sub-building tags are specified in OSM tags (e.g. addr:floor), only include them if the values are numeric 2016-07-22 18:47:31 -04:00
Al
9bece91bd5 [osm] When choosing a namespaced language, alias all namespaced tags, not just the addr:* tags 2016-07-22 14:56:07 -04:00
Al
9a6279d73b [fix] normalize building component tags, not regular tags 2016-07-22 14:54:18 -04:00
Al
b1b797171c [osm] Combining addr:block_number and addr:housenumber in Japan (randomly adds phrases for the 番号/bango system) 2016-07-22 14:52:16 -04:00
Al
8383d5bb12 [numex] Adding numeric expression spellout in the Python geodata module for generating training data 2016-07-21 17:04:57 -04:00
Al
72e647902d [fix] name 2016-07-21 17:04:57 -04:00
Al
9059c2af60 [addresses] Don't generate sub-building components at all if there's no house number 2016-07-21 17:04:57 -04:00
Al
420ceb6c38 [intersections] Only requiring a tag to share at least two ways 2016-07-21 17:04:57 -04:00
Al
202dc0c58a [fix] name 2016-07-21 17:04:57 -04:00
Al
89f6793243 [fix] args 2016-07-21 17:04:57 -04:00
Al
f7680e9b65 [fix] name 2016-07-21 17:04:57 -04:00
Al
baf8fbb381 [fix] import 2016-07-21 17:04:57 -04:00
Al
b4a70a9a56 [fix] import 2016-07-21 17:04:57 -04:00
Al
8aada7086f [intersections] intersections training data 2016-07-21 17:04:57 -04:00
Al
11d1acc3bc [parser] Sample chain store alternate names from the cross-language dictionary 2016-07-21 17:04:57 -04:00
Al
69e1c846ba [parser] Fixing config keys so OSM streets/venues get abbreviated. Selecting namespaced address fields in cases like Brussels or Hong Kong where everything is bilingual. Adding the ability to pass a known language into address component expansion 2016-07-21 17:04:57 -04:00
Al
a1421d4a68 [fix] floors 2016-07-21 17:04:57 -04:00
Al
2e4ba6e6cc [subdivisions/buildings] Adding subdivisions and buildings rtree to training data for getting building height, zone 2016-07-21 17:04:57 -04:00
Al
97d2bfb508 [osm] venue names 2016-07-21 17:04:57 -04:00
Al
a9959651f8 [fix] parent streets 2016-07-21 17:04:57 -04:00
Al
e4d84fac7e [parser/osm] Adding address sans name for venues probabilistically 2016-07-21 17:04:57 -04:00
Al
3d47d7d13c [fix] place components 2016-07-21 17:04:57 -04:00
Al
2db4105c71 [fix] import 2016-07-21 17:04:57 -04:00
Al
f4c7032842 [fix] field name 2016-07-21 17:04:57 -04:00
Al
57347899af [fix] filenames 2016-07-21 17:04:57 -04:00
Al
ed4db5150f [fix] coding=utf8 2016-07-21 17:04:57 -04:00
Al
840a83976b [parser/osm] OSM address formatter using the new component expansion 2016-07-21 17:04:57 -04:00
Al
e2a9a57269 [numex] Adding numeric expression spellout in the Python geodata module for generating training data 2016-06-24 16:10:36 -04:00