Commit Graph

2053 Commits

Author SHA1 Message Date
Al
69e1c846ba [parser] Fixing config keys so OSM streets/venues get abbreviated. Selecting namespaced address fields in cases like Brussels or Hong Kong where everything is bilingual. Adding the ability to pass a known language into address component expansion 2016-07-21 17:04:57 -04:00
Al
e5e0cf3b92 [fix] loading transliteration module in address_parser_test.c as well 2016-07-21 17:04:57 -04:00
Al
8e338c5ffb [fix] ON needs to be quotes in YAML, uppercase Yukon abbreviation 2016-07-21 17:04:57 -04:00
Al
b8d43dc601 [fix] cstring_array_split calls 2016-07-21 17:04:57 -04:00
Al
b19cd3f60a [fix] brace 2016-07-21 17:04:57 -04:00
Al
994b2f18e4 [parser] Ignore multiple spaces in parser input post-normalization. If normalizing the string creates several distinct tokens (namely in Vulgar fractions e.g. ½ => 1/2), add all the sub-tokens with the same label as the parent 2016-07-21 17:04:57 -04:00
Al
b664ab1cea [utils] Adding cstring_array_split_ignore_consecutive 2016-07-21 17:04:57 -04:00
Al
8e90ee45d2 [fix] calls and NULL checks 2016-07-21 17:04:57 -04:00
Al
e3cffaf0d1 [fix] tokenized_string_t should copy its source string 2016-07-21 17:04:57 -04:00
Al
16501aba17 [fix] Need to load transliteration module for Latin-ASCII normalization 2016-07-21 17:04:57 -04:00
Al
b326e209fb [places] Adding Town of to English prefixes 2016-07-21 17:04:57 -04:00
Al
366c4995af [parser] lower full-name probability for states 2016-07-21 17:04:57 -04:00
Al
d88be7ef5d [fix] use simple language code if language_script cannot be found 2016-07-21 17:04:57 -04:00
Al
90467e9098 [fix] global formatter config 2016-07-21 17:04:57 -04:00
Al
16a91528d6 [fix] config key name 2016-07-21 17:04:57 -04:00
Al
d3b936067e [fix] neighborhood reverse geocoder using the new OSM definitions module which keeps track of whatever the data fetching script defines as being a valid {neighborhood, admin boundary, etc.} 2016-07-21 17:04:57 -04:00
Al
b294b891dd [boundaries] lines sharing a point are added to the polygon head-to-tail, reversing the node order as needed, produces accurate OSM polygons for reverse geocoding lookups 2016-07-21 17:04:57 -04:00
Al
75aa713792 [fix] moving language code replacements out of address components 2016-07-21 17:04:57 -04:00
Al
6cb834b3a3 [boundaries] admin_level=8 is city_district in Japan 2016-07-21 17:04:57 -04:00
Al
308080f6ee [formatting] Moving language country overrides to formatter config so actual language is retained 2016-07-21 17:04:57 -04:00
Al
e59e3a173c [fix] place=municipality 2016-07-21 17:04:57 -04:00
Al
3c16973cac [fix] OSM neighborhood ids 2016-07-21 17:04:57 -04:00
Al
d86443a697 [fix] Adding basic Han numeral replacement to neighborhood deduping 2016-07-21 17:04:57 -04:00
Al
046f445a56 [fix] component bitsets 2016-07-21 17:04:57 -04:00
Al
0dbfd79b72 [fix] language format changes only apply to local languages 2016-07-21 17:04:57 -04:00
Al
12f86875e2 [formatting] Increase probability of postcode before city 2016-07-21 17:04:57 -04:00
Al
890268aa87 [languages] Use English formats for Romanized CJK 2016-07-21 17:04:57 -04:00
Al
ad4b197ead [fix] floor samples 2016-07-21 17:04:57 -04:00
Al
e53e61358d [fix] Don't remove chome from Japanese, as the neighborhoods are usually just plain numbers 2016-07-21 17:04:57 -04:00
Al
110be7a245 [fix] args 2016-07-21 17:04:57 -04:00
Al
9772e85c87 [fix] US/Canada probabilities for industrial/commercial 2016-07-21 17:04:57 -04:00
Al
d4e913c55f [boundaries] Adding CP and civil parish to English place suffixes 2016-07-21 17:04:57 -04:00
Al
a5331f7107 [osm] Venue name depends on one of {house_number, road, suburb, city_district, city, postcode} 2016-07-21 17:04:57 -04:00
Al
2d1e7ca990 [fix] Spanish office probabilities 2016-07-21 17:04:57 -04:00
Al
a1421d4a68 [fix] floors 2016-07-21 17:04:57 -04:00
Al
5ea570835e [fix] args again 2016-07-21 17:04:57 -04:00
Al
7c41d84d8f [fix] args 2016-07-21 17:04:57 -04:00
Al
2e4ba6e6cc [subdivisions/buildings] Adding subdivisions and buildings rtree to training data for getting building height, zone 2016-07-21 17:04:57 -04:00
Al
52aa95c213 [subdivisions] Adding zone types 2016-07-21 17:04:57 -04:00
Al
91db1ec371 [fix] removing unnecessary vars 2016-07-21 17:04:57 -04:00
Al
694020ddf3 [fix] all_names returns a list not a set 2016-07-21 17:04:57 -04:00
Al
97d2bfb508 [osm] venue names 2016-07-21 17:04:57 -04:00
Al
66e35d517d [fix] language disambiguation 2016-07-21 17:04:57 -04:00
Al
2a4f8c5634 [fix] set 2016-07-21 17:04:57 -04:00
Al
4c71cab6a0 [languages] Adding script-only disambiguation 2016-07-21 17:04:57 -04:00
Al
a9959651f8 [fix] parent streets 2016-07-21 17:04:57 -04:00
Al
e4d84fac7e [parser/osm] Adding address sans name for venues probabilistically 2016-07-21 17:04:57 -04:00
Al
3d47d7d13c [fix] place components 2016-07-21 17:04:57 -04:00
Al
e5a0dc5a4f [fix] dependencies 2016-07-21 17:04:57 -04:00
Al
35b4f598f8 [fix] drop invalid components 2016-07-21 17:04:57 -04:00