Commit Graph

2116 Commits

Author SHA1 Message Date
Al
77a4476b8e [openaddresses] CLDR country names for OpenAddresses training set 2016-07-21 17:04:57 -04:00
Al
7d62a3a762 [fix] gauche 2016-07-21 17:04:57 -04:00
Al
afa58e6edb [openaddresses] Removing New Zealand city as the field is not specific enough and may conflict with OSM names, needs to be reverse geocoded. Adding cldr country probabilities so we can add localized names/codes given the country 2016-07-21 17:04:57 -04:00
Al
e91b318121 [addresses] French address levels alphanumeric 2016-07-21 17:04:57 -04:00
Al
9059c2af60 [addresses] Don't generate sub-building components at all if there's no house number 2016-07-21 17:04:57 -04:00
Al
9c090302f7 [addresses] Topological sort of address component dependencies so they get checked/removed in order 2016-07-21 17:04:57 -04:00
Al
cd7cd292b7 [states] State abbreviations for Brazil and Mexico 2016-07-21 17:04:57 -04:00
Al
90a2f2b2e0 [parser] road has no dependencies 2016-07-21 17:04:57 -04:00
Al
29d16c9c80 [openaddresses] Country code for Belgium, removing Flanders as it has encoding issues, removing region from New Zealand formats as it appears to be conflated with districts 2016-07-21 17:04:57 -04:00
Al
419f5961a5 [fix] unused var 2016-07-21 17:04:57 -04:00
Al
7612e93fdf [addresses] French address config 2016-07-21 17:04:57 -04:00
Al
4b28791bb1 [addresses] Spanish PO box probabilities 2016-07-21 17:04:57 -04:00
Al
a57ace0be0 [openaddresses] OpenAddresses training script 2016-07-21 17:04:57 -04:00
Al
64824b90a9 [openaddresses] Only adding units for Australia, as they're known to contain both designator and number. US units seem to often have simple numbers/letters for the unit field 2016-07-21 17:04:57 -04:00
Al
584a4e0ee8 [openaddresses] Added components via OA config 2016-07-21 17:04:57 -04:00
Al
55d66af422 [openaddresses] Adding abbreviated unit 2016-07-21 17:04:57 -04:00
Al
2120adefff [openaddresses] Adding unit by default (only for files that have been vetted) 2016-07-21 17:04:57 -04:00
Al
d910c6ca94 [fix] OpenAddresses formatting 2016-07-21 17:04:57 -04:00
Al
802a5ee534 [fix] condition 2016-07-21 17:04:57 -04:00
Al
e6a1d11324 [fix] validators 2016-07-21 17:04:57 -04:00
Al
caa155c9c4 [fix] method name 2016-07-21 17:04:57 -04:00
Al
4d0caec3d3 [fix] return value 2016-07-21 17:04:57 -04:00
Al
0e09e1222f [fix] import again 2016-07-21 17:04:57 -04:00
Al
e5267996ea [fix] import 2016-07-21 17:04:57 -04:00
Al
10662e79d5 [fix] directory structure 2016-07-21 17:04:57 -04:00
Al
0c9f1aa30d [fix] import 2016-07-21 17:04:57 -04:00
Al
1d80d8b6b8 [openaddresses] OpenAddresses address formatter, using the config 2016-07-21 17:04:57 -04:00
Al
cc4b7109ab [openaddresses] OpenAddresses config specifying a few files 2016-07-21 17:04:57 -04:00
Al
91b06439e2 [openaddresses] Fetch script for OpenAddresses 2016-07-21 17:04:57 -04:00
Al
a32f6b5017 [addresses] Making address_language a classmethod 2016-07-21 17:04:57 -04:00
Al
420ceb6c38 [intersections] Only requiring a tag to share at least two ways 2016-07-21 17:04:57 -04:00
Al
cc7727b13e [intersections] Adding intersections to config 2016-07-21 17:04:57 -04:00
Al
202dc0c58a [fix] name 2016-07-21 17:04:57 -04:00
Al
73b2aec25e [fix] input file 2016-07-21 17:04:57 -04:00
Al
89f6793243 [fix] args 2016-07-21 17:04:57 -04:00
Al
51831e2111 [fix] add ways db dir 2016-07-21 17:04:57 -04:00
Al
f7680e9b65 [fix] name 2016-07-21 17:04:57 -04:00
Al
0a912766e4 [fix] logging for intersections data 2016-07-21 17:04:57 -04:00
Al
baf8fbb381 [fix] import 2016-07-21 17:04:57 -04:00
Al
b4a70a9a56 [fix] import 2016-07-21 17:04:57 -04:00
Al
8aada7086f [intersections] intersections training data 2016-07-21 17:04:57 -04:00
Al
5075128ada [intersections] Adding places to intersection template, intersection phrase generator 2016-07-21 17:04:57 -04:00
Al
701e67614a [fix] import 2016-07-21 17:04:57 -04:00
Al
2454b98c6d [tokenization] Reverting commit for tokenizing initial/final apostrophes as part of words as it may be more effective to handle during post-processing 2016-07-21 17:04:57 -04:00
Al
0a8f46bdc3 [parser] Using new geonames designations in parser features 2016-07-21 17:04:57 -04:00
Al
c383f8af88 [parser] Using NFC normalization for parser as well, @ sign not defined as separator since it may also be used in intersections 2016-07-21 17:04:57 -04:00
Al
c2ee5a45b3 [geodb] Adding separate bitset for geonames place types and using NFC normalization instead of NFD (requires retraining) 2016-07-21 17:04:57 -04:00
Al
6c39c663ff [normalize] Adding NORMALIZE_STRING_COMPOSE for NFC unicode normalization 2016-07-21 17:04:57 -04:00
Al
757c6147cb [tokenization] Adding ability to tokenize 's Gravenhage 2016-07-21 17:04:57 -04:00
Al
2e8888e331 [fix] warnings/size_t in libpostal.c 2016-07-21 17:04:57 -04:00