Commit Graph

1912 Commits

Author SHA1 Message Date
Al
334f22a41c [formatting] New formatter config including random component component order changes and default/per-country admin component ordering 2016-07-21 17:04:57 -04:00
Al
8b9e351961 [names] Name affixes respect hyphens and lack of whitespace (for ideographic languages) 2016-07-21 17:04:57 -04:00
Al
64ad15fa36 [boundaries] Adding boundary prefixes/suffixes for German, French, Japanese 2016-07-21 17:04:57 -04:00
Al
d739ca8268 [boundaries] Adding prefix/suffix replacements for English and Spanish 2016-07-21 17:04:57 -04:00
Al
fd95963c3a [boundaries] Using key name rather than boundary name in global boundary config 2016-07-21 17:04:57 -04:00
Al
72fad2f5e3 [fix] six.iteritems 2016-07-21 17:04:57 -04:00
Al
6f2c81c767 [addresses] Removing subdivisions/buildings rtree, moving probabilities to config 2016-07-21 17:04:57 -04:00
Al
35deb15a84 [fix] default option for Aliases.get 2016-07-21 17:04:57 -04:00
Al
d519054eb4 [fix] try/except for GeoJSON in neighborhoods index, some of the neighborhoods repo polygons were using 3 coordinates 2016-07-21 17:04:57 -04:00
Al
b8513c0dc6 [osm] Adding osm_type_and_id function for handling all-to-nodes output from osmfilter. Using in neighborhoods as well as admin rtree. 2016-07-21 17:04:57 -04:00
Al
f468ab84d2 [parser] Removing island exceptions from parser default config 2016-07-21 17:04:57 -04:00
Al
18072094f5 [boundaries] Macao boundary type mapping 2016-07-21 17:04:57 -04:00
Al
6491a5c3c4 [countries] Wrapping CLDR country names/alpha3 codes 2016-07-21 17:04:57 -04:00
Al
c33f404e1a [addresses] Using the new parser config for address component expansions 2016-07-21 17:04:57 -04:00
Al
62b35b318f [parser] Parser default config 2016-07-21 17:04:57 -04:00
Al
1cf1527acd [boundaries] removing omissions from boundary names config 2016-07-21 17:04:57 -04:00
Al
d232d7afcd [boundaries] Adding component-specific admin name probabilities to config (e.g. choose the ISO alpha-2 code 20% of the time, etc.) 2016-07-21 17:04:57 -04:00
Al
1bf32c7320 [boundaries] Config for boundary name changes (Kings County is a state_district but Brooklyn should not be used for that context) and omissions (usually we add islands as address components, but not e.g. Manhattan Island) 2016-07-21 17:04:57 -04:00
Al
ad81095879 [osm] moving osm_address_components to its own module 2016-07-21 17:04:57 -04:00
Al
4271f7f84b [units] Refactoring unit generator to use base alphanumeric generator 2016-07-21 17:04:57 -04:00
Al
c18f53027f [floors] Refactoring floor/level generator to use base alphanumeric generator 2016-07-21 17:04:57 -04:00
Al
d38918ca7f [directions] wrapper for adding cardinal/relative directions probabilistically 2016-07-21 17:04:57 -04:00
Al
ed12d7bbe4 [names] component expansion uses the new configurable affix replacements 2016-07-21 17:04:57 -04:00
Al
6f8e94b851 [config] Adding default/alternative probability distribution to config.utils 2016-07-21 17:04:57 -04:00
Al
88b25a2d22 [names] Adding name affix normalizations to a YAML config 2016-07-21 17:04:57 -04:00
Al
7185bfa4f1 [dictionaries] More Spanish abbreviations 2016-07-21 17:04:57 -04:00
Al
f96bcd1727 [numbering] choose_alphanumeric_type for base NumericPhrase 2016-07-21 17:04:57 -04:00
Al
6cec1d99f7 [fix] import 2016-07-21 17:04:57 -04:00
Al
3263535cf1 [fix] name 2016-07-21 17:04:57 -04:00
Al
3bbe616b5e [fix] uppercased state abbreviations 2016-07-21 17:04:57 -04:00
Al
2f784ddec6 [fix] state abbreviation instead of full name 2016-07-21 17:04:57 -04:00
Al
8c82125788 [fix] defaults 2016-07-21 17:04:57 -04:00
Al
62c0801683 [fix] state name 2016-07-21 17:04:57 -04:00
Al
670da6c4bc [fix] method name 2016-07-21 17:04:57 -04:00
Al
477c96ea00 [fix] var name 2016-07-21 17:04:57 -04:00
Al
00be2e4d0e [fix] all_languages var 2016-07-21 17:04:57 -04:00
Al
1e2894a665 [fix] normalize place names before adding OSM components, modify components in place, delete keys and use the boundary components if the component is ambiguous 2016-07-21 17:04:57 -04:00
Al
cec0d6f6df [fix] tuple 2016-07-21 17:04:57 -04:00
Al
e198bbf23e [fix] whitespace 2016-07-21 17:04:57 -04:00
Al
b190c88cc1 [fix] state 2016-07-21 17:04:57 -04:00
Al
3c6691d295 [fix] kwargs 2016-07-21 17:04:57 -04:00
Al
4909fa7ee1 [fix] deriving whitespace and state in normalized_place_name, adding all candidate languages to arguments 2016-07-21 17:04:57 -04:00
Al
330394ff51 [fix] raw OSM reverse geocoded components vs. versus normalized version 2016-07-21 17:04:57 -04:00
Al
f7697cf20d [fix] import 2016-07-21 17:04:57 -04:00
Al
e5fdd915d0 [fix] check the first phrase for components and bail if it matches something other than the specified tag 2016-07-21 17:04:57 -04:00
Al
8370a41ec0 [fix] import 2016-07-21 17:04:57 -04:00
Al
651bc32650 [addresses] more thoroughly solving the addr:city='Harlem' issue 2016-07-21 17:04:57 -04:00
Al
5a31b60cbe [addresses] Adding normalized_place_name, a method for separating compound fields like addr:city='New York NY' into simply 'New York', solving the compound phrase problem. Also solves the mislabeled place name problem, causing the system to ignore the user tag and fall back on reverse geocoded components in cases e.g. where addr:city='Harlem', which is a known neighborhood but not a city when reverse geocoded. A few other refactors for expanded address components 2016-07-21 17:04:57 -04:00
Al
52246e0cd0 [formatting] Defining some of the new tag names in AddressFormatter as well as insert_component which reparses the address formatter template and inserts a given components, removing it from an existing block if necessary 2016-07-21 17:04:57 -04:00
Al
b22fb669b9 [aliases] Adding get method for aliases 2016-07-21 17:04:57 -04:00