Commit Graph

2062 Commits

Author SHA1 Message Date
Al
dccbdc4ccc [dictionaries] Refactoring existing unit_types/level_types dictionaries to use the new more granular dictionary structure 2016-05-27 11:27:34 -04:00
Al
572759885f [parser] Sample chain store alternate names from the cross-language dictionary 2016-05-26 12:09:10 -04:00
Al
5daa64faef [parser] Fixing config keys so OSM streets/venues get abbreviated. Selecting namespaced address fields in cases like Brussels or Hong Kong where everything is bilingual. Adding the ability to pass a known language into address component expansion 2016-05-26 12:05:46 -04:00
Al
206a471732 [fix] loading transliteration module in address_parser_test.c as well 2016-05-25 19:54:01 -04:00
Al
34f5d833a2 [fix] ON needs to be quotes in YAML, uppercase Yukon abbreviation 2016-05-25 19:12:15 -04:00
Al
f59150b047 [fix] cstring_array_split calls 2016-05-25 17:58:30 -04:00
Al
5065917f41 [fix] brace 2016-05-25 17:52:00 -04:00
Al
679d3efcdc [parser] Ignore multiple spaces in parser input post-normalization. If normalizing the string creates several distinct tokens (namely in Vulgar fractions e.g. ½ => 1/2), add all the sub-tokens with the same label as the parent 2016-05-25 17:50:29 -04:00
Al
370744ccfd [utils] Adding cstring_array_split_ignore_consecutive 2016-05-25 17:07:20 -04:00
Al
5c7d24c71b [fix] calls and NULL checks 2016-05-25 15:50:53 -04:00
Al
349df20720 [fix] tokenized_string_t should copy its source string 2016-05-25 15:48:03 -04:00
Al
00784a897d [fix] Need to load transliteration module for Latin-ASCII normalization 2016-05-25 15:25:34 -04:00
Al
bf50d27b0e [places] Adding Town of to English prefixes 2016-05-25 11:23:31 -04:00
Al
5a88294dbc [parser] lower full-name probability for states 2016-05-25 00:47:36 -04:00
Al
5377a831ab [fix] use simple language code if language_script cannot be found 2016-05-24 19:49:08 -04:00
Al
a4064ecd02 [fix] global formatter config 2016-05-24 19:44:40 -04:00
Al
3661a1e5eb [fix] config key name 2016-05-24 19:39:12 -04:00
Al
26bbd2916b [fix] neighborhood reverse geocoder using the new OSM definitions module which keeps track of whatever the data fetching script defines as being a valid {neighborhood, admin boundary, etc.} 2016-05-24 19:27:38 -04:00
Al
1a66fc3396 [boundaries] lines sharing a point are added to the polygon head-to-tail, reversing the node order as needed, produces accurate OSM polygons for reverse geocoding lookups 2016-05-24 19:24:41 -04:00
Al
206cd56cd2 [fix] moving language code replacements out of address components 2016-05-24 16:55:46 -04:00
Al
c4aebeebc3 [boundaries] admin_level=8 is city_district in Japan 2016-05-24 16:53:42 -04:00
Al
bdb6bb03e3 [formatting] Moving language country overrides to formatter config so actual language is retained 2016-05-24 16:52:08 -04:00
Al
97582e9c64 [fix] place=municipality 2016-05-24 15:35:33 -04:00
Al
6af06d904a [fix] OSM neighborhood ids 2016-05-24 15:13:07 -04:00
Al
c4eab01176 [fix] Adding basic Han numeral replacement to neighborhood deduping 2016-05-24 14:55:54 -04:00
Al
a5a24fb3b9 [fix] component bitsets 2016-05-24 13:07:32 -04:00
Al
cf2bbcb4e0 [fix] language format changes only apply to local languages 2016-05-24 12:59:32 -04:00
Al
bb2da53311 [formatting] Increase probability of postcode before city 2016-05-24 12:21:04 -04:00
Al
aedb249ad7 [languages] Use English formats for Romanized CJK 2016-05-24 12:14:06 -04:00
Al
7186cf13de [fix] floor samples 2016-05-24 11:16:57 -04:00
Al
eb83ae91cb [fix] Don't remove chome from Japanese, as the neighborhoods are usually just plain numbers 2016-05-23 18:17:29 -04:00
Al
028b7a460e [fix] args 2016-05-23 17:42:34 -04:00
Al
48a41eaceb [fix] US/Canada probabilities for industrial/commercial 2016-05-23 16:22:27 -04:00
Al
f2f98043ab [boundaries] Adding CP and civil parish to English place suffixes 2016-05-23 15:48:13 -04:00
Al
32e017a3ab [osm] Venue name depends on one of {house_number, road, suburb, city_district, city, postcode} 2016-05-23 15:46:59 -04:00
Al
5f78d4f3a0 [fix] Spanish office probabilities 2016-05-23 15:35:55 -04:00
Al
698804b230 [fix] floors 2016-05-23 15:18:10 -04:00
Al
b8e43fa7f8 [fix] args again 2016-05-23 15:01:58 -04:00
Al
d6c11dde0f [fix] args 2016-05-23 14:59:22 -04:00
Al
1e2ffd9847 [subdivisions/buildings] Adding subdivisions and buildings rtree to training data for getting building height, zone 2016-05-23 14:51:44 -04:00
Al
dbc41a931b [subdivisions] Adding zone types 2016-05-23 14:45:55 -04:00
Al
edff5b9730 [fix] removing unnecessary vars 2016-05-23 13:04:25 -04:00
Al
b0f49db9be [fix] all_names returns a list not a set 2016-05-23 13:04:00 -04:00
Al
f20cff3b2a [osm] venue names 2016-05-23 12:51:28 -04:00
Al
85b3532333 [fix] language disambiguation 2016-05-23 11:54:36 -04:00
Al
9f95bdd4d0 [fix] set 2016-05-23 11:44:49 -04:00
Al
bd341417a3 [languages] Adding script-only disambiguation 2016-05-23 11:42:42 -04:00
Al
e6157915af [fix] parent streets 2016-05-23 10:22:25 -04:00
Al
8b87d224c9 [parser/osm] Adding address sans name for venues probabilistically 2016-05-23 05:28:37 -04:00
Al
5d590acbe0 [fix] place components 2016-05-23 05:21:00 -04:00