This website requires JavaScript.
Explore
Help
Sign In
tommy
/
libpostal
Watch
1
Star
0
Fork
0
You've already forked libpostal
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
1f1dbe25e1621aff3283cf3a13370af4a47d5cd4
libpostal
/
test
History
Al
1f1dbe25e1
[test] adding a number of user-contributed test cases from Moz in
#21
. Almost all are working under the CRF parser trained on 10% of the data. There are a few problematic ones in the UK still that have been omitted here. We currently don't correctly format the training data for locailty + postal town pattern, which are both considered "city" by libpostal and thus one will usually get lumped in with the road or something like that. There may also be some utility in modelling comma usage (training data has commas, but they're ignored by the parser both at train and run time - might be useful to train on them but drop out randomly so the parser doesn't become too dependent on having them)
2017-03-21 03:08:09 -04:00
..
greatest.h
[tests] Using greatest (
https://github.com/silentbicycle/greatest
) for automated testing
2016-01-28 16:31:32 -05:00
Makefile.am
[test] adding the new tests to the Makefile
2017-03-11 14:34:27 -05:00
test_crf_context.c
[fix] don't compare a double to 0
2017-03-15 14:59:33 -04:00
test_expand.c
[test] adding printfs on expansion test failure so it's more clear what's going on
2017-03-17 17:46:22 -04:00
test_numex.c
[fix] vignt => vingt in French numex
2016-03-29 02:18:37 -04:00
test_parser.c
[test] adding a number of user-contributed test cases from Moz in
#21
. Almost all are working under the CRF parser trained on 10% of the data. There are a few problematic ones in the UK still that have been omitted here. We currently don't correctly format the training data for locailty + postal town pattern, which are both considered "city" by libpostal and thus one will usually get lumped in with the road or something like that. There may also be some utility in modelling comma usage (training data has commas, but they're ignored by the parser both at train and run time - might be useful to train on them but drop out randomly so the parser doesn't become too dependent on having them)
2017-03-21 03:08:09 -04:00
test_string_utils.c
[test/utils] also a good thing to sanity check (in C especially): string handling code
2017-03-10 01:15:23 -05:00
test_transliterate.c
[test] adding test of new latin-ascii-simple transliterator which only handles things like HTML entities
2017-03-17 18:27:18 -04:00
test_trie.c
[phrases] Case where trie search finds a match, makes progress beyond the next token but has to fall back. Adding trie search test case
2016-02-08 01:07:56 -05:00
test.c
[test/utils] also a good thing to sanity check (in C especially): string handling code
2017-03-10 01:15:23 -05:00