Iestyn Pryce
6aa3cb61fd
Fix log_* formats which expect long long int but receive int64_t.
2017-05-21 10:29:34 +01:00
Iestyn Pryce
ecd07b18c1
Fix log_* formats which expect size_t but receive uint32_t.
2017-05-19 22:31:56 +01:00
Iestyn Pryce
87a76bf967
Fix log_{debug,info} formats which expect size_t but receive int.
2017-05-17 22:40:53 +01:00
Al Barrentine
2a0fb69ae5
Merge pull request #201 from iestynpryce/master
...
Fix log_debug formats which expect unsigned int but receive size_t
2017-05-14 20:53:15 -04:00
Iestyn Pryce
f34fc56fec
Fix log_debug formats which expect unsigned int but receive size_t
2017-05-14 17:48:26 +01:00
Al
a7e67c4967
[fix] adding maximum number of permutations for libpostal_expand_address to consider (n=100 for both the inner and outer loop, so max strings=10000), fixes #200
2017-05-13 14:11:08 -04:00
Al
5780a08b48
[fix] check that possible ordinal suffix also has non-zero digit length before normalizing
2017-05-12 15:48:20 -04:00
Al
cea3ced533
[fix] open files in binary format for #69
2017-05-03 17:34:38 -04:00
Al
6ea2273263
[fix] terminate the char_array if input token is zero-length in add_normalized_token
2017-04-28 11:25:07 -04:00
Al Barrentine
04eb2d4539
Merge pull request #189 from openvenues/fix_trie_search
...
Reset to root node in trie search on partial failed matches before rolling back pointer
2017-04-21 14:39:03 -04:00
Al
278679b7fb
[fix] in tokenized trie_search, in the case of a partial failed match, reset to the root node before rolling the pointer back to phrase start + 1
2017-04-21 13:51:07 -04:00
Travis
074b6ff802
[auto][ci skip] Adding data files from Travis build #231
2017-04-20 02:39:39 +00:00
Al Barrentine
004d3d98c9
Merge pull request #187 from openvenues/degree_symbol_ordinal_suffix
...
Ordinal suffix tests
2017-04-19 22:29:10 -04:00
Al
7bce358ca6
[fix] whitespace in numex config to trigger build
2017-04-19 21:14:54 -04:00
Al
676fb9bcbc
[fix] no parens in travis config grep for numex change detection
2017-04-19 21:14:19 -04:00
Al
86956db055
[fix] adding numex change to trigger build
2017-04-19 21:00:59 -04:00
Al
e81580287d
[test] adding tests for ordinal suffix normalization
2017-04-19 20:59:36 -04:00
Al
85297f3333
[fix] numex change detection in Travis build
2017-04-19 20:58:08 -04:00
Travis
4762ff2638
[auto][ci skip] Adding data files from Travis build #228
2017-04-20 00:51:42 +00:00
Al Barrentine
e92c3c2867
Merge pull request #186 from openvenues/degree_symbol_ordinal_suffix
...
Degree symbol ordinal suffix
2017-04-19 20:39:22 -04:00
Al
f3adde746e
[numex] adding ability to handle handle the degree symbol in numex parsing since it's technically a separate token
2017-04-19 20:18:21 -04:00
Al
19899b2f7d
[dictionaries] adding degree symbol "°" variant for any surface forms that have "º"
2017-04-19 19:25:25 -04:00
Al
c968dd4ecc
[numex] adding "°" as additional ordinal suffix for Spanish, Italian, and Portuguese
2017-04-19 19:22:28 -04:00
Al Barrentine
254f3622ea
Merge pull request #185 from Ironholds/master
...
Remove unused variable
2017-04-19 09:08:59 -04:00
Oliver Keyes
18a5d06427
Merge pull request #1 from Ironholds/Ironholds-patch-1
...
Remove unused variable
2017-04-18 21:53:24 -07:00
Oliver Keyes
35821f975e
Remove unused variable
...
What it says on the tin!
2017-04-18 21:25:00 -07:00
Al Barrentine
e0c82b5edb
Merge pull request #184 from openvenues/remove_ordinal_suffix
...
Remove ordinal suffixes in libpostal_expand_address
2017-04-18 22:33:00 -04:00
Al
9cd3ec37f9
[build] rebuild numex table in Travis if either the configs change or numex_table_builder.c changes
2017-04-18 21:42:09 -04:00
Al
f3cf119e58
[build] Makefile changes to support moving numeric expression parsing to normalize.c
2017-04-18 21:41:24 -04:00
Al
cddc368533
[numex] adding one form of normalization which strips ordinal suffixes so {96th, Ninety-sixth} => 96. This is an additional form of normalization, so there's still one form where the suffixes are kept. One case that's still not handled is something like "IXe Arrondissement"
2017-04-18 21:39:54 -04:00
Al
92051863ba
[numex] adding ordinal suffixes themselves to the numex trie so they can be removed from strings
2017-04-18 17:20:02 -04:00
Al Barrentine
63ac3cf921
Merge pull request #183 from openvenues/cdn
...
Hosting model files and training data on CloudFront CDN
2017-04-17 14:39:35 -04:00
Al
d2732922c2
[data] deployed model files and training data to CloudFront for easier downloading around the world and in places like China where the Great Fire Wall may prevent large downloads from abroad. TTL is set to 0 so it still caches the files themselves but checks with origin for the If-Modified-Since headers, allowing the files to be updated dynamically
2017-04-17 14:11:44 -04:00
Al Barrentine
5699ef3da0
Merge pull request #181 from eefi/bug/various/initializer
...
[fix] don't use unnamed fields in initializers
2017-04-13 16:22:33 -04:00
Al
36dc41af8c
Merge branch 'master' of https://github.com/openvenues/libpostal
2017-04-13 16:02:06 -04:00
Al
413c584f08
[fix] need to set prev_state to the NULL state in numex parsing after a non-space/non-hyphen is encountered and the previous match, if any, is added to the result array
2017-04-13 16:01:46 -04:00
Austin Chu
f9b57dbd42
[fix] don't use unnamed fields in initializers
...
GCC did not support assigning to unnamed fields from designated
initializers until 4.6 [1]. Unfortunately, CentOS 6 ships with GCC 4.4,
so avoiding this C99 feature is necessary to fix building in CentOS 6
environments.
[1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=10676
2017-04-13 14:44:20 -04:00
Al Barrentine
7bef84676e
Merge pull request #180 from eefi/bug/tagger/include-guard
...
[fix] add #include guard to tagger.h
2017-04-13 13:58:13 -04:00
Austin Chu
a966712e18
[fix] add #include guard to tagger.h
2017-04-13 13:02:03 -04:00
Al Barrentine
32c8662f8d
Merge pull request #177 from eefi/bug/matrix/clbas
...
[fix] typo in compiler warning when no CBLAS found
2017-04-12 20:58:00 -04:00
Austin Chu
19a04511ba
[fix] typo in compiler warning when no CBLAS found
2017-04-12 20:40:08 -04:00
Al
b464eb6c07
[numex] fix numex parsing when the spelled-out number is followed by a comma or other punctuation
2017-04-11 16:28:33 -04:00
Al
fc91471434
[osm/boundaries] check polygons with an ISO3166-2 as well in the country polygon index in case the country polygon is funky
2017-04-09 02:15:46 -04:00
Al
4ecd6c23c6
[formatting] removing the ability to insert city between house number and road in France from discussion in #27
2017-04-08 15:42:59 -04:00
Al
7f7aada32a
[build] add another housekeeping file in the datadir for data_version. Blow away the exiting files if that file either doesn't exist or doesn't contain a matching version string to help with upgrades
2017-04-07 17:40:27 -04:00
Al
4f9b0ef495
[docs][ci skip] adding note about using libpostal on mobile
2017-04-07 00:55:39 -04:00
Al
6984427eb9
[docs][ci skip] add link to the 1.0 blog post
2017-04-06 13:19:45 -04:00
Al
5605ba3185
[docs] adding note about the newly-trained language classifier trained with FTRL-Proximal (now 1/10th the size), which keeps its high accuracy while maintaining a sparse solution. This commit will trigger a build with the freshly uploaded model.
2017-04-06 11:55:11 -04:00
Al
5a96be5d5c
[fix][ci skip] S3 upload paths in data upload/download script
2017-04-06 00:37:12 -04:00
Travis
d8409f1f38
[auto][ci skip] Adding data files from Travis build #210
2017-04-06 04:06:16 +00:00