[expand] adding improvements to root expansions (using possible phrase roots even if they're abbreviated e.g. "E Ctr St", adding special valid components check for root expansions beyond what's stored in the build address dictionaries), removing spaces before checking unique strings, only splitting numeric from alpha in the case of non-ordinals, using cstring_array internally and char ** in the public API

This commit is contained in:
Al
2017-12-25 01:37:29 -05:00
parent b4ce042f80
commit 152761fcbc
3 changed files with 121 additions and 57 deletions

View File

@@ -14,6 +14,8 @@
typedef libpostal_token_t token_t;
#define NULL_TOKEN (token_t){0, 0, END}
VECTOR_INIT(token_array, token_t)
typedef struct tokenized_string {