Commit Graph

20 Commits

Author SHA1 Message Date
Al
7bd1336b3b [fix] Freeing languages in Python 2015-12-31 01:46:04 -05:00
Al
faf8b00596 [python] libpostal includes 2015-12-15 02:56:02 -05:00
Al
7cf48acd20 [fix] standard headers in new extensions 2015-12-15 01:18:33 -05:00
Al
c40ab06dd6 [python] Forgot expand.py 2015-12-15 00:56:34 -05:00
Al
842ef4526b [python] Adding address parser Python API 2015-12-15 00:55:41 -05:00
Al
7af0e2d967 [python] Adding Python bindings to the expand API 2015-12-14 18:18:16 -05:00
Al
0f52f97621 [fix] Python 3 version of tokenize/normalize 2015-12-14 18:14:57 -05:00
Al
3401045b4f [fix] changing labels in Python normalize, adding a NULL check 2015-12-14 14:59:57 -05:00
Al
cbeb08f1d1 [python/normalize] importing options from the C module 2015-10-30 12:34:07 -04:00
Al
e7f783477f [python/normalize] Adding remove parentheses options in Python normalize (would require compiling with the scanner to do it from C, but could switch) 2015-10-30 01:27:16 -04:00
Al
cee9da05d6 [fix] using tokenize_raw API 2015-10-28 21:37:44 -04:00
Al
9a92a1154d [python] Making normalized_tokens return token classes as well, mimicking the tokenize API 2015-10-27 17:07:50 -04:00
Al
9f6e1387a0 [fix] Error condition in Python tokenize 2015-10-27 13:33:28 -04:00
Al
40918812e2 [normalize] Adding hyphen elimination as a string option (changes tokenization) 2015-10-27 13:32:47 -04:00
Al
f6b6a17335 [python/normalization] Adding Python bindings to the normalize module for use in OSM polygon matching 2015-10-26 18:07:53 -04:00
Al
8a188903b3 [python] Using tuples in pytokenize instead of list, pre-allocating 2015-10-26 18:04:13 -04:00
Al
4f784060a3 [python] Adding word_token_types 2015-10-25 18:33:09 -04:00
Al
236737eab3 [tokenization/osm] Using utf8 encoded version of string for tokens in python tokenizer 2015-09-21 17:27:43 -04:00
Al
5b2fd0be50 [fix] pytokenize compilation on Ubuntu/gcc 2015-09-21 03:24:14 -04:00
Al
5485ea2197 [python] Adding initial pypostal bindings for tokenize so we can remove address_normalizer dependency. Not tested on Python 3. 2015-09-20 14:59:39 -04:00