Al
|
1d1ada1bc1
|
[normalize] Adding NORMALIZE_STRING_COMPOSE for NFC unicode normalization
|
2016-05-28 19:25:12 -04:00 |
|
Al
|
1fd57fdda3
|
[tokenization] Adding ability to tokenize 's Gravenhage
|
2016-05-28 19:24:19 -04:00 |
|
Al
|
514aaf7377
|
[fix] warnings/size_t in libpostal.c
|
2016-05-28 19:19:31 -04:00 |
|
Al
|
c0e8578b9c
|
[gazetteers] Adding new gazetteer types/address components
|
2016-05-28 19:19:18 -04:00 |
|
Al
|
acd97a0081
|
[dictionaries] Adding letra to Spanish numbered unit dictionaries
|
2016-05-28 19:15:02 -04:00 |
|
Al
|
bac86be6a3
|
[dictionaries] Adding new dictionary types to generator script
|
2016-05-28 17:16:43 -04:00 |
|
Al
|
cff23c77ab
|
[boundaries] Adding Bucharest sectors as city_district
|
2016-05-27 20:22:56 -04:00 |
|
Al
|
5e0e22a666
|
[dictionaries] More dictionary refactoring
|
2016-05-27 19:40:20 -04:00 |
|
Al
|
5590c89a5e
|
[addresses] Allowing null_phrase_probability for alpha, and alpha+digits instead of just for ordinals (mostly for Spain)
|
2016-05-27 13:40:38 -04:00 |
|
Al
|
bdd6d99f56
|
[addresses] Adding increasing null_phrase_probability for plain numerics in Spain so things like 2o B make it into the training data
|
2016-05-27 13:37:48 -04:00 |
|
Al
|
cc453cfbbd
|
[places] setting probability of including island to 0.5 for Hawaii, 0.8 seems too high given all the Honolulu, HI addresses (not often seen as Honolulu, Oahu, HI)
|
2016-05-27 11:32:52 -04:00 |
|
Al
|
f69d9e2e1c
|
[dictionaries] Italian CAP abbreviations
|
2016-05-27 11:31:16 -04:00 |
|
Al
|
fc96cf145f
|
[dictionaries] Russian place names
|
2016-05-27 11:28:50 -04:00 |
|
Al
|
ec0df1410b
|
[dictionaries] Adding more fleshed out Greek dictionaries from a recent Nominatim NameFinder wiki update
|
2016-05-27 11:28:23 -04:00 |
|
Al
|
dccbdc4ccc
|
[dictionaries] Refactoring existing unit_types/level_types dictionaries to use the new more granular dictionary structure
|
2016-05-27 11:27:34 -04:00 |
|
Al
|
572759885f
|
[parser] Sample chain store alternate names from the cross-language dictionary
|
2016-05-26 12:09:10 -04:00 |
|
Al
|
5daa64faef
|
[parser] Fixing config keys so OSM streets/venues get abbreviated. Selecting namespaced address fields in cases like Brussels or Hong Kong where everything is bilingual. Adding the ability to pass a known language into address component expansion
|
2016-05-26 12:05:46 -04:00 |
|
Al
|
206a471732
|
[fix] loading transliteration module in address_parser_test.c as well
|
2016-05-25 19:54:01 -04:00 |
|
Al
|
34f5d833a2
|
[fix] ON needs to be quotes in YAML, uppercase Yukon abbreviation
|
2016-05-25 19:12:15 -04:00 |
|
Al
|
f59150b047
|
[fix] cstring_array_split calls
|
2016-05-25 17:58:30 -04:00 |
|
Al
|
5065917f41
|
[fix] brace
|
2016-05-25 17:52:00 -04:00 |
|
Al
|
679d3efcdc
|
[parser] Ignore multiple spaces in parser input post-normalization. If normalizing the string creates several distinct tokens (namely in Vulgar fractions e.g. ½ => 1/2), add all the sub-tokens with the same label as the parent
|
2016-05-25 17:50:29 -04:00 |
|
Al
|
370744ccfd
|
[utils] Adding cstring_array_split_ignore_consecutive
|
2016-05-25 17:07:20 -04:00 |
|
Al
|
5c7d24c71b
|
[fix] calls and NULL checks
|
2016-05-25 15:50:53 -04:00 |
|
Al
|
349df20720
|
[fix] tokenized_string_t should copy its source string
|
2016-05-25 15:48:03 -04:00 |
|
Al
|
00784a897d
|
[fix] Need to load transliteration module for Latin-ASCII normalization
|
2016-05-25 15:25:34 -04:00 |
|
Al
|
bf50d27b0e
|
[places] Adding Town of to English prefixes
|
2016-05-25 11:23:31 -04:00 |
|
Al
|
5a88294dbc
|
[parser] lower full-name probability for states
|
2016-05-25 00:47:36 -04:00 |
|
Al
|
5377a831ab
|
[fix] use simple language code if language_script cannot be found
|
2016-05-24 19:49:08 -04:00 |
|
Al
|
a4064ecd02
|
[fix] global formatter config
|
2016-05-24 19:44:40 -04:00 |
|
Al
|
3661a1e5eb
|
[fix] config key name
|
2016-05-24 19:39:12 -04:00 |
|
Al
|
26bbd2916b
|
[fix] neighborhood reverse geocoder using the new OSM definitions module which keeps track of whatever the data fetching script defines as being a valid {neighborhood, admin boundary, etc.}
|
2016-05-24 19:27:38 -04:00 |
|
Al
|
1a66fc3396
|
[boundaries] lines sharing a point are added to the polygon head-to-tail, reversing the node order as needed, produces accurate OSM polygons for reverse geocoding lookups
|
2016-05-24 19:24:41 -04:00 |
|
Al
|
206cd56cd2
|
[fix] moving language code replacements out of address components
|
2016-05-24 16:55:46 -04:00 |
|
Al
|
c4aebeebc3
|
[boundaries] admin_level=8 is city_district in Japan
|
2016-05-24 16:53:42 -04:00 |
|
Al
|
bdb6bb03e3
|
[formatting] Moving language country overrides to formatter config so actual language is retained
|
2016-05-24 16:52:08 -04:00 |
|
Al
|
97582e9c64
|
[fix] place=municipality
|
2016-05-24 15:35:33 -04:00 |
|
Al
|
6af06d904a
|
[fix] OSM neighborhood ids
|
2016-05-24 15:13:07 -04:00 |
|
Al
|
c4eab01176
|
[fix] Adding basic Han numeral replacement to neighborhood deduping
|
2016-05-24 14:55:54 -04:00 |
|
Al
|
a5a24fb3b9
|
[fix] component bitsets
|
2016-05-24 13:07:32 -04:00 |
|
Al
|
cf2bbcb4e0
|
[fix] language format changes only apply to local languages
|
2016-05-24 12:59:32 -04:00 |
|
Al
|
bb2da53311
|
[formatting] Increase probability of postcode before city
|
2016-05-24 12:21:04 -04:00 |
|
Al
|
aedb249ad7
|
[languages] Use English formats for Romanized CJK
|
2016-05-24 12:14:06 -04:00 |
|
Al
|
7186cf13de
|
[fix] floor samples
|
2016-05-24 11:16:57 -04:00 |
|
Al
|
eb83ae91cb
|
[fix] Don't remove chome from Japanese, as the neighborhoods are usually just plain numbers
|
2016-05-23 18:17:29 -04:00 |
|
Al
|
028b7a460e
|
[fix] args
|
2016-05-23 17:42:34 -04:00 |
|
Al
|
48a41eaceb
|
[fix] US/Canada probabilities for industrial/commercial
|
2016-05-23 16:22:27 -04:00 |
|
Al
|
f2f98043ab
|
[boundaries] Adding CP and civil parish to English place suffixes
|
2016-05-23 15:48:13 -04:00 |
|
Al
|
32e017a3ab
|
[osm] Venue name depends on one of {house_number, road, suburb, city_district, city, postcode}
|
2016-05-23 15:46:59 -04:00 |
|
Al
|
5f78d4f3a0
|
[fix] Spanish office probabilities
|
2016-05-23 15:35:55 -04:00 |
|