Al
|
6d4c7984df
|
[api] doing this now since we're bumping a major version. Using a libpostal prefixes for all public header functions and definitions
|
2017-03-31 03:35:51 -04:00 |
|
Al
|
8deb1716cb
|
[parser] adding polymorphic (as much as C does polymorphism) model type for the parser to allow it to handle either the greedy averaged perceptron or a CRF. During training, saving, and loading, we use a different filename for a parser trained with a CRF, which is still backward-compatible with models previously trained in parser-data. Making necessary modifications to address_parser.c, address_parser_train.c, and address_parser_test.c. Also adding an option in address_parser_test to print individual errors in addition to the confusion matrix.
|
2017-03-10 19:28:21 -05:00 |
|
Al
|
242b1364ae
|
[parser] using new API in address_parser_test
|
2017-03-07 16:24:34 -05:00 |
|
Al
|
8ea5405c20
|
[parser] using separate arrays for features requiring tag history and making the tagger responsible for those features so the feature function does not require passing in prev and prev2 explicitly (i.e. don't need to run the feature function multiple times if using global best-sequence prediction)
|
2017-02-19 14:21:58 -08:00 |
|
Al
|
2f4bcaeec2
|
[parser] address_parser_test memory cleanup, add print-errors option to print individual parser errors on held-out data
|
2017-02-12 16:05:11 -05:00 |
|
Al
|
e62101b8bf
|
[parser] remove geodb from address_parser_test, sort confusion matrix
|
2016-12-29 02:14:40 -05:00 |
|
Al
|
e5e0cf3b92
|
[fix] loading transliteration module in address_parser_test.c as well
|
2016-07-21 17:04:57 -04:00 |
|
Al
|
6ef7c90278
|
[fix] using string_equals, handles NULLs
|
2016-01-05 14:08:10 -05:00 |
|
Al
|
24208c209f
|
[parsing] Adding a training data derived index of complete phrases from suburb up to country. Only adding bias and word features for non phrases, using UNKNOWN_WORD and UNKNOWN_NUMERIC for infrequent tokens (not meeting minimum vocab count threshold).
|
2015-12-05 14:34:19 -05:00 |
|
Al
|
89677d94a3
|
[parsing] Initial commit of the address parser, training/testing, feature function, I/O
|
2015-11-30 14:48:13 -05:00 |
|