Commit Graph

793 Commits

Author SHA1 Message Date
Al
17cfdb0625 [fix] adding char_array_append_* methods to header 2015-09-18 13:19:42 -04:00
Al
f2f7db92ff [fix] phrases 2015-09-18 13:19:18 -04:00
Al
b74e92adad [fix] include 2015-09-18 13:18:49 -04:00
Al
2a869894d9 [fix] geodb 2015-09-18 13:18:26 -04:00
Al
9e9131bda0 [parser] Averaged perceptron tagger 2015-09-17 05:51:24 -04:00
Al
8a86f7ec64 [parser] Adding context struct to feature function 2015-09-17 05:48:00 -04:00
Al
87ed7d9a0f [geodb] Adding trie search methods for finding geodb phrases 2015-09-16 22:11:10 -04:00
Al
e62c75b9c6 [phrases] Adding _with_phrases versions of address dictionary methods for pre-allocated phrases 2015-09-16 21:24:28 -04:00
Al
23103a21d4 [phrases] Adding with_phrases versions of trie search methods for pre-allocated phrases 2015-09-16 21:23:34 -04:00
Al
d5ec005787 [transliteration] Similar init method for transliteration 2015-09-16 21:14:02 -04:00
Al
b11362ab98 [numex] using module init method for building, otherwise passing NULL path uses the default path 2015-09-16 21:13:05 -04:00
Al
3cba2e8df3 [api] Using default setup methods for submodules in libpostal setup 2015-09-15 14:01:33 -04:00
Al
e122824448 [expansion] Adding the ability to search address dictionary phrases with a NULL language, will return phrases in any language 2015-09-15 14:00:26 -04:00
Al
c47ff1b113 [utils] Adding source string to tokenized_string struct 2015-09-15 13:21:51 -04:00
Al
b2f690b6f6 [api] Error logging if modules can't be found 2015-09-15 13:21:15 -04:00
Al
9de3029dd3 [parser] Averaged perceptron training does full examples (greedily). During training, features are a hashtable, sorted and converted to a trie during finalize 2015-09-14 17:38:45 -04:00
Al
a5b5f80b04 [fix] new_copy 2015-09-14 16:50:23 -04:00
Al
3ea6358f77 [fix] vector zeros allocation 2015-09-14 16:50:08 -04:00
Al
c21f61b9b4 [parser] Default address parser path 2015-09-11 15:05:38 -07:00
Al
32c180528f [tokens] Adding a copy_tokens option for tokenized_string 2015-09-11 15:04:29 -07:00
Al
9ce658b7a3 [collections] Adding string_array for an array of char pointers 2015-09-10 16:34:16 -07:00
Al
35b9122a1a [utils] inlining a few functions 2015-09-10 16:33:54 -07:00
Al
35f1c02caf [polygons] Reducing simplify tolerance for language polys now that regional languages are handled separately 2015-09-10 12:44:13 -07:00
Al
440a8158b6 [polygons] Adding in country languages for regional polygons without a default language 2015-09-10 12:34:26 -07:00
Al
22c16b43cf [languages] Italian is also the regional default in Valle D'Aosta and Trentino-Alto Adige 2015-09-10 11:09:13 -07:00
Al
fca7f21b1d [polygons] Making simplify_tolerance and preserve_topology for polygon simplification configurable per class 2015-09-10 11:06:18 -07:00
Al
6a5b01b51b [parser] Averaged perceptron training 2015-09-10 10:26:24 -07:00
Al
0ddf50cb5f [utils] add to feature array with printf syntax 2015-09-10 10:24:51 -07:00
Al
b3f89a207a [utils] Version of string_split for single character delimiters which modifies the input string directly rather than creating (essentially) a copy 2015-09-09 18:07:31 -07:00
Al
c1da2fa94b [dictionaries] Adding 'Rang' to French dictionaries 2015-09-09 17:21:26 -07:00
Al
b85fe50fad [osm] Training data for toponyms only cares about valid languages for name field 2015-09-08 16:38:05 -07:00
Al
607a607b71 [doc] documentation fix for averaged perceptron 2015-09-08 16:37:23 -07:00
Al
c80d8b8067 [parsing] Averaged perceptron model data structure for storing the finalized, averaged, sparse weights 2015-09-08 12:42:54 -07:00
Al
8d642b45b9 [fix] trie was returning early on add_at_index and not incrementing the num_keys 2015-09-08 11:41:46 -07:00
Al
e566063343 [osm] Doing an all-to-nodes conversion and an additional filter on the borders data set 2015-09-08 09:18:08 -07:00
Al
ae7e30634b [features] Adding counter/bag-of-words representation of features 2015-09-08 00:17:26 -07:00
Al
49d389b9d8 [refactor] changing names in int-valued hash tables 2015-09-08 00:15:14 -07:00
Al
2fffd76af8 [fix] typo 2015-09-07 23:58:34 -07:00
Al
aa454c4430 [fix] removing char_array_copy from header 2015-09-07 23:58:05 -07:00
Al
3fd6552b44 [fix] void not void * in vector *_copy 2015-09-07 23:57:44 -07:00
Al
cddffdb65f [math] Adding column and row sums to sparse matrices 2015-09-07 00:34:00 -07:00
Al
8525529968 [osm] Not requiring qualified name tags to process OSM toponyms 2015-09-06 21:03:01 -07:00
Al
9d2ca08fc2 [utils] Adding _copy and _new_copy methods to vectors (the former copies data to a pre-allocated vector, the latter allocates a new vector) 2015-09-06 21:01:26 -07:00
Al
49fe504201 [math] Matrix get value at row, column index 2015-09-06 12:37:10 -07:00
Al
ec3ab7234a [utils] Adding index to cstring_array_foreach, similar to Python's enumerate 2015-09-04 19:34:06 -04:00
Al
df20e2cbc0 [osm] Including toponyms in the training data for countries where the unqualified place names can be assumed to be examples of a given language 2015-09-04 14:13:33 -04:00
Al
17fcfa8b59 [fix] adding house to ignore keys rather than aliasing it 2015-09-04 12:40:08 -04:00
Al
d64a27bc57 [osm] Converting relations to nodes in borders training data 2015-09-04 12:32:25 -04:00
Al
168b7f59da [fix] default indices in strip_component 2015-09-04 12:29:47 -04:00
Al
64db63e3eb [osm] Removing house tag 2015-09-04 12:23:47 -04:00