Commit Graph

1233 Commits

Author SHA1 Message Date
Al
57040b8733 [docs] README fixes 2015-12-21 17:45:55 -05:00
Al
ceda863e9f [fix] Encode strings as JSON in address parser cli 2015-12-21 17:45:09 -05:00
Al
e55ff54be1 [fix] Adding Korean-Latin-BGN to excluded transliterators 2015-12-21 16:24:50 -05:00
Al
c7fb7f685d [transliteration] Fixing group replacement in transliteration in the case of multiple groups, not adding to phrase length when checking context 2015-12-21 16:06:04 -05:00
Al
682c316775 [transliteration] Removing Korean-Latin-BGN, not a great transliterator and AFAICT, ICU doesn't use it either 2015-12-21 12:45:45 -05:00
Al
ab124465e6 [fix] regenerating transliteration data 2015-12-20 15:41:42 -05:00
Al
ccf509edb1 [fix] update to control characters for generating the transliteration rules 2015-12-20 15:40:38 -05:00
Al
5439f4679f [fix] Special tokens like emails/urls/phone numbers bypass normalization 2015-12-20 03:07:36 -05:00
Al
cf2a0efa11 [fix] Prefixes and suffixes that are the same length as the original token should be handled as regular expansions 2015-12-19 17:29:26 -05:00
Al
aaecd7961a [fix] Options out of order 2015-12-19 15:05:50 -05:00
Al
48cb2b5c7b [api] Node was complaining about non-trivial designated initializers (probably the bit fields), so converting to old-school initializer 2015-12-19 02:34:31 -05:00
Al
97906c86a8 [fix] Strip punctuation in final output in cases where there are no expansions 2015-12-19 02:10:41 -05:00
Al
4497c4501e [fix] do not add a token if prefix/suffix expansions are inseparable and canonical 2015-12-19 01:36:02 -05:00
Al
f8da44e8b0 [fix] Making a copy even on pure Latin-script transliteration since string_trim modifies in-place, occasionally causes issues 2015-12-19 01:31:56 -05:00
Al
39e83961ef [fix] Bug in suffix expansion affecting inseparable suffixes like burg as well as ordinal suffixes like first=>1st 2015-12-19 01:30:08 -05:00
Al
b2a944830a [transliteration] Making sure the Python script to generate transliteration data works on the new CLDR format 2015-12-19 00:34:30 -05:00
Al
b4a8a69226 [expansion] Fixing extra space on prefix/suffix expansions 2015-12-18 20:28:59 -05:00
Al
df47dad817 [fix] Partial matches, ultimate misses in concatenated suffixes 2015-12-18 17:37:06 -05:00
Al
66073c17d5 [fix] Handling case of concatenated suffixes like straße when they stand alone 2015-12-18 17:17:35 -05:00
Al
b71755bf7f [fix] Moving Python bindings up-front in the README 2015-12-17 14:28:36 -05:00
Al
31ed88bf6a [api] Adding a --json option to expand cli 2015-12-17 13:46:55 -05:00
Al
41ea105bb4 [api] Simple JSON encoding for strings, UTF-8 rather than Unicode 2015-12-17 12:25:05 -05:00
Al
af78614f62 [fix] Print usage info on -h/--help to libpostal cli 2015-12-16 22:21:13 -05:00
Al
f4ee9c2645 [fix] task list 2015-12-16 20:38:29 -05:00
Al
54cc1b8b2d [fix] Python syntax highlighting for README instructions 2015-12-16 02:25:56 -05:00
Al Barrentine
f3b4a4e894 Merge pull request #11 from nvkelso/master
andthus > and thus in Transliteration section
2015-12-16 02:22:55 -05:00
Al
59cc6d3417 [docs] README updates, better explanations of normalization and parsing 2015-12-16 02:19:10 -05:00
Nathaniel V. KELSO
11a9c47cea Merge pull request #1 from nvkelso/nvkelso/readme-translit-typo
andthus > and thus in Transliteration section
2015-12-15 22:45:35 -08:00
Nathaniel V. KELSO
7ff7027cdb andthus > and thus in Transliteration section 2015-12-15 22:45:07 -08:00
Al
3e44910664 [fix] Note about ldconfig 2015-12-16 00:48:22 -05:00
Al
ef941a6634 [fix] README parses 2015-12-15 16:18:22 -05:00
Al
c787821e96 [fix] README 2015-12-15 16:16:16 -05:00
Al
6cccc3ee46 [fix] README addition 2015-12-15 16:07:21 -05:00
Al
d1833a8f8f [docs] Updating README with parsing info/examples 2015-12-15 16:00:58 -05:00
Al
83ba053373 [build] Removing setup.py fanciness. Install the C library first, then run setup.py or pip install 2015-12-15 14:31:58 -05:00
Al
e0c0ed2d04 [numex] Return true if numex table already loaded 2015-12-15 14:28:40 -05:00
Al
7e04017851 [fix] default for libdir 2015-12-15 12:21:49 -05:00
Al
40641209ee [build] Build shared lib in site-packages 2015-12-15 12:19:40 -05:00
Al
04430f1a8e [fix] var 2015-12-15 10:51:56 -05:00
Al
d8f731b672 [build] setup.py include/library dirs 2015-12-15 10:50:57 -05:00
Al
faf8b00596 [python] libpostal includes 2015-12-15 02:56:02 -05:00
Al
d2426d3777 [build] build_ext 2015-12-15 02:31:48 -05:00
Al
cb648b63da [build] Adding include and library dirs based on autoconf prefix 2015-12-15 02:21:15 -05:00
Al
7cf48acd20 [fix] standard headers in new extensions 2015-12-15 01:18:33 -05:00
Al
bec43750d5 [build] bumping Python version 2015-12-15 00:58:11 -05:00
Al
33fdb912b6 [build] setup.py changes for parser extension 2015-12-15 00:56:53 -05:00
Al
c40ab06dd6 [python] Forgot expand.py 2015-12-15 00:56:34 -05:00
Al
842ef4526b [python] Adding address parser Python API 2015-12-15 00:55:41 -05:00
Al
b9bf5c629e [fix] Moving address_parser_response_destroy into libpostal so caller can free 2015-12-15 00:52:24 -05:00
Al
ab3ba249d7 [python/build] Modified install command for setup.py allowing --datadir and --prefix to be passed in. If there's a virtualenv active and nothing else is specified, install libpostal and its data files there by default 2015-12-14 18:21:21 -05:00