Commit Graph

57 Commits

Author SHA1 Message Date
Al
2290b0991e [fix] transliteration case where a context no match comes at the end of the string 2018-02-06 15:08:11 -05:00
Al
86fdaf7188 [fix] changing root expansion test for "E Broadway", which now returns "broadway" instead of "east", which is a better result anyway 2018-02-05 19:36:01 -05:00
Al
dc8bffd5a0 [fix] expansion test valgrind errors for #308 2018-01-21 01:20:40 -05:00
Al
0cb488ecea [test] different test case for expansion, male names are overemphasized as it is 2018-01-02 19:01:58 -05:00
Gregory Oschwald
2f6749fe03 Fix segfault in expand_alternative_phrase_option
string_tree_get_alternative can return NULL
2018-01-02 13:28:51 -08:00
Gregory Oschwald
95ea873498 Fix memory leaks in test_trie
The primary motivation is to make the test suite run clean under
Valgrind so that we don't need to wade through unimportant failures.
2018-01-02 10:38:12 -08:00
Al
d33b6693b9 [test] adding Suite/Ste tests for root expansion bugfix 2018-01-02 03:50:23 -05:00
Al
2afcd74779 [test] adding E Ctr St tests 2017-12-25 01:38:50 -05:00
Al
ff3c7ab3b6 [test] PO box expansion tests 2017-12-17 22:12:37 -05:00
Al
27f4eb2721 [test] unit expansion tests 2017-12-17 22:01:30 -05:00
Al
f7326e52f6 [test] level expansion tests 2017-12-17 22:00:03 -05:00
Al
1d22da603f [test] house number expansion tests 2017-12-17 20:17:28 -05:00
Al
bfdb6b8f87 [test] adding header to fix warning 2017-12-17 20:17:01 -05:00
Al
26a6d9684d [test] adding tests for root-only expansions. Mostly English tests for the moment to deal with the various edge cases, but is also important for Spanish where "Calle" is so common that it's often omitted, same with French and "rue", etc. 2017-12-17 20:00:48 -05:00
Al
ec4d683d1b Merge branch 'master' into lieu_api 2017-11-29 15:49:52 -05:00
Al
1a64ad682b [merge] merging in the Ohio expansion numex changes from master 2017-11-29 11:51:43 -05:00
Al Barrentine
18eb5ef9ee Merge pull request #272 from AeroXuk/master
Windows support via AppVeyor
2017-11-28 21:35:46 -05:00
AeroXuk
26ac9ab5c2 Removing EXPORT statements from all source files and most header files, leaving only the exports for the main API in libpostal.h. Modified Makefiles so that all the test apps build without having extra functions exported from libpostal. 2017-11-25 04:35:28 +00:00
Al
ebe7fc9be9 [test] missing paren in Columbus, OH test. Adding test for "oh" as part of a number in Nineteen oh one W El Segundo Blvd 2017-11-24 16:11:07 -05:00
Al
d7f22544b4 [test] adding an expansion test for the Columbus, OH case 2017-11-24 15:44:37 -05:00
Al
2f2d3da722 [test] test for utf8_equal_ignore_separators 2017-10-14 01:42:08 -04:00
Al
e81580287d [test] adding tests for ordinal suffix normalization 2017-04-19 20:59:36 -04:00
Al
8ec6e546f5 [test] adding more tests from the demo 2017-04-04 20:52:28 -04:00
Al
6d4c7984df [api] doing this now since we're bumping a major version. Using a libpostal prefixes for all public header functions and definitions 2017-03-31 03:35:51 -04:00
Al
f7b695c642 [build] add /usr/local/include as default include path for test Makefile as well 2017-03-30 15:57:17 -04:00
Al
65fadbeea3 [fix] add CBLAS_LIBS in the test Makefile 2017-03-29 21:38:54 -04:00
Al
9636ef6393 [fix] typo 2017-03-29 18:55:03 -04:00
Al
0bd1bdb6f2 [test] adding Brazil and Romania parses for demo 2017-03-29 13:03:05 -04:00
Al
03ceb18a41 [test] adding US tests for parser demo queries 2017-03-28 15:04:00 -04:00
Al
61d008f349 [test] making some of the test cases simpler/easier so they don't fail. In general this should just be for examples that are/are going to be in the docs. Improving overall aggregate statistics like held-out accuracy over time is preferable to worrying about one individual test failure. 2017-03-26 20:27:32 -04:00
Al
3aaa628b25 [test] add LaSalle, Montréal tests 2017-03-21 14:24:13 -04:00
Al
1f1dbe25e1 [test] adding a number of user-contributed test cases from Moz in #21. Almost all are working under the CRF parser trained on 10% of the data. There are a few problematic ones in the UK still that have been omitted here. We currently don't correctly format the training data for locailty + postal town pattern, which are both considered "city" by libpostal and thus one will usually get lumped in with the road or something like that. There may also be some utility in modelling comma usage (training data has commas, but they're ignored by the parser both at train and run time - might be useful to train on them but drop out randomly so the parser doesn't become too dependent on having them) 2017-03-21 03:08:09 -04:00
Al
b8a12e0517 [test] adding parser test cases in 22 countries. These may change, and I'm generlaly against putting every obscure test case in the world in here. It's better to measure accuracy in aggregate statistics instead of individual test cases (i.e. if a particular change to the parser improves overall performance but fails one test case, should we accept the improvement?) The thought here is: these represent parses that are used in documentation/examples, as well as most of those that have been brought up in Github issues from the initial release, and we want these specific tests to work from build to build. If a model fails one of these test cases, it shouldn't get pushed to our users. 2017-03-20 00:58:52 -04:00
Al
8e3c9d0269 [test] adding test of new latin-ascii-simple transliterator which only handles things like HTML entities 2017-03-17 18:27:18 -04:00
Al
be07bfe35d [test] adding printfs on expansion test failure so it's more clear what's going on 2017-03-17 17:46:22 -04:00
Al
f4a9e9d673 [fix] don't compare a double to 0 2017-03-15 14:59:33 -04:00
Al
7afba832e5 [test] adding the new tests to the Makefile 2017-03-11 14:34:27 -05:00
Al
dd0bead63a [test/utils] also a good thing to sanity check (in C especially): string handling code 2017-03-10 01:15:23 -05:00
Al
adab8ab51a [test/crf] test for crf_context, adapted from crf1dc_debug_context in CRFsuite. Always a good idea to sanity check numerical code 2017-03-10 01:13:40 -05:00
Al
3e8ab0ba3f [test] New expansion test with all options set to false 2016-07-15 13:26:14 -04:00
Al
879da6214a [test] Adding test for ambiguous expansion 2016-04-29 13:27:41 -04:00
Al
378459fde8 [fix] vignt => vingt in French numex 2016-03-29 02:18:37 -04:00
Al
37cfe8ab3b [test] Adding automated parser tests to the C library 2016-02-17 17:19:10 -05:00
Al
cf2a79bef1 [api] Default options accessible through getters, not static structs 2016-02-15 17:34:00 -05:00
Al
166119f5c3 [test] Adding numex tests for the concatenated case 2016-02-10 09:26:12 -05:00
Al
ffde22395c [numex] Adding test cases where stopword should not get swallowed into the numeric expression 2016-02-10 08:30:53 -05:00
Al
87ef471f8b [test] Adding test cases for tokenized tail search 2016-02-08 17:56:28 -05:00
Al
4ed0474f7e [fix] test for normalized output on an expansion with an accented canonical 2016-02-08 12:32:14 -05:00
Al
9ac0379a65 [phrases] Case where trie search finds a match, makes progress beyond the next token but has to fall back. Adding trie search test case 2016-02-08 01:07:56 -05:00
Al
9b3296914a [build] Defining LIBPOSTAL_DATA_DIR at compile time, not configure 2016-01-30 02:18:12 -05:00