Commit Graph

3390 Commits

Author SHA1 Message Date
Al
8b24072566 [fix] reference before assignment 2016-07-26 02:52:58 -04:00
Al
6c3128edee [fix] adding country_region to places config 2016-07-26 02:51:05 -04:00
Al
890f691d7d [fix] import 2016-07-26 02:47:03 -04:00
Al
eff884986e [osm] Place component dropout in place training data 2016-07-26 02:43:05 -04:00
Al
5a9e5ef8dd [fix] iteration 2016-07-26 02:33:31 -04:00
Al
7b25d1edfb [fix] config updates for contained_by overrides in OSM admin components 2016-07-25 17:10:15 -04:00
Al
eae7a6a78c [osm/boundaries] extend admin overrides in the UK to Greater London which includes London and the City of London 2016-07-25 16:56:39 -04:00
Al
38e67f5013 [boundaries] More fun with mapping UK admin boundaries. Non-metroplitan counties and non-metropolitan districts map to state_district. admin_level=6 maps to state district except for London where it's the city minus City of London. admin_level=8 (e.g. Manchester) maps to city except in London where it maps to city_district. admin_level=10 is suburb unless designation=civil_parish, in which case it's treated as a city boundary (individual towns/villages may be city or suburb depending on their place tag). Just complicated enough to be valid UK law :-). 2016-07-25 16:02:00 -04:00
Al
6a8209dc98 [places] Adding country_region to places config, increasing importance of county in England outside of London, increasing importance of city globally 2016-07-25 15:09:37 -04:00
Al
4b67cf79f4 [boundaries/osm] Mapping regions of England to state 2016-07-25 15:02:22 -04:00
Al
4e58a7c12e [test] Adding test for intersection phrases and fixing a test failure for the Czech config 2016-07-25 03:19:52 -04:00
Al
ffece04855 [osm] Place training data from OSM script 2016-07-25 02:45:16 -04:00
Al
4d94495d45 [osm] place training data comes from both admin nodes and the polygons in the OSM index (using representative_point) 2016-07-25 02:39:53 -04:00
Al
024d47a8a5 [osm] Adding admin_center handling to OSM address components 2016-07-25 02:14:51 -04:00
Al
1058b17a61 [osm] Moving admin_center overrides to OSM parser config 2016-07-25 02:02:48 -04:00
Al
c9aa0bc913 [boundaries/osm] Use name:en most of the time for New Zealand and occasionally name 2016-07-25 01:53:43 -04:00
Al
776145cf8e [osm] Adding new option to control whether we drop non-city OSM boundary names that have the same name as the enclosed city 2016-07-25 01:24:13 -04:00
Al
1ccea09a92 [osm] Don't call components.normalize_place_names in OSM address formatting, only add place components population / 10000 + 1 times for the name tag itself, not loc_name, int_name, etc. 2016-07-25 01:16:27 -04:00
Al
3957aea430 [fix] add postal_code alias 2016-07-25 00:48:55 -04:00
Al
ee795211bc [polygons] Include designation in OSM admin properties (for UK) 2016-07-25 00:27:27 -04:00
Al
f0dea9cba1 [fix] No random_key for non-local languages 2016-07-25 00:16:22 -04:00
Al
b31d71bbc1 [fix] parens 2016-07-25 00:14:36 -04:00
Al
e5b84205bc [osm] Use int_name tag and add English boundary names even if only a raw name is available for the original place node 2016-07-25 00:13:21 -04:00
Al
b50cb0cdf9 [osm] add random variations of the containing components' names in building place training data. For places with small or unknown populations, use the default names of the containing components 2016-07-25 00:04:44 -04:00
Al
dbc5957fa6 [fix] reverting, random state abbreviations should be fine 2016-07-24 23:47:30 -04:00
Al
cf84b5727e [osm] always_use_full_names=True for encompassing boundaries on place queries 2016-07-24 23:21:14 -04:00
Al
0fa372f2c0 [fix] tags.get as nodes may not have type/id 2016-07-24 23:04:09 -04:00
Al
273f5ecf58 [fix] language defaults 2016-07-24 23:02:39 -04:00
Al
43e6f2433a [fix] use ISO3166-1:alpha2 2016-07-24 23:00:59 -04:00
Al
53906c4833 [fix] parens 2016-07-24 22:57:58 -04:00
Al
38b76701d8 [osm] Falling back on OSM country/languages if the point doesn't match the Quattroshapes geometry 2016-07-24 22:56:53 -04:00
Al
4b26962793 [osm] Don't return language from node_place_tags as the list of tags contains the various languages already 2016-07-24 22:17:42 -04:00
Al
87a47a825e [fix] var reference before assignment 2016-07-24 22:00:07 -04:00
Al
696448981c [fix] var name 2016-07-24 21:58:56 -04:00
Al
bfb89adaab [osm] use containing ids in component mapping 2016-07-24 21:57:04 -04:00
Al
2a9185874a [fix] component index 2016-07-24 21:55:02 -04:00
Al
1158076154 [fix] default language suffix is '' 2016-07-24 21:34:59 -04:00
Al
60d4fd3102 [fix] another import 2016-07-24 21:31:52 -04:00
Al
648c016b05 [fix] import and return values 2016-07-24 21:30:53 -04:00
Al
09b77b52a6 [osm] Adding place training set. Every place, even nodes, in OSM will get population / 10000 + 1 simple place queries like city + state included in the training set, even if there are no OSM addresses for that city. Where postcodes are available, they'll also be added to the training examples 2016-07-24 20:09:56 -04:00
Al
39c193d52d [osm] Fixing parse_osm_number_range. Only treat it as a range if the number on the right is greater, make letter range parsing optional 2016-07-24 19:49:20 -04:00
Al
4151ce7919 [osm] Adding rail stations to venues data set if they have a street address and a Wikipedia 2016-07-24 14:13:38 -04:00
Al
75d9c31395 [text] Adding NORMALIZE_STRING_COMPOSE constant in pynormalize.c 2016-07-24 03:37:43 -04:00
Al
7b3f4e9175 [text] Adding utils.py for is_numeric/is_numeric_strict 2016-07-24 03:37:11 -04:00
Al
a620cae6e0 [fix] var 2016-07-23 15:45:07 -04:00
Al
487d589531 [fix] remove var 2016-07-23 15:17:47 -04:00
Al
bfc75912bc [fix] Only skip Quattroshapes matching if place=neighborhood 2016-07-23 15:15:23 -04:00
Al
26225ee8bb [osm] Removing rail stations from venues and making them a separate data set for reverse geocoding, fixing building!=yes query, should not include records with no building tag at all 2016-07-23 03:57:05 -04:00
Al
83f39a3dc5 [fix] removing print 2016-07-23 03:26:02 -04:00
Al
2a634797ec [fix] make sure values are hashable in mapping OSM components 2016-07-23 03:04:23 -04:00