Al
|
c7fb7f685d
|
[transliteration] Fixing group replacement in transliteration in the case of multiple groups, not adding to phrase length when checking context
|
2015-12-21 16:06:04 -05:00 |
|
Al
|
682c316775
|
[transliteration] Removing Korean-Latin-BGN, not a great transliterator and AFAICT, ICU doesn't use it either
|
2015-12-21 12:45:45 -05:00 |
|
Al
|
ab124465e6
|
[fix] regenerating transliteration data
|
2015-12-20 15:41:42 -05:00 |
|
Al
|
ccf509edb1
|
[fix] update to control characters for generating the transliteration rules
|
2015-12-20 15:40:38 -05:00 |
|
Al
|
5439f4679f
|
[fix] Special tokens like emails/urls/phone numbers bypass normalization
|
2015-12-20 03:07:36 -05:00 |
|
Al
|
cf2a0efa11
|
[fix] Prefixes and suffixes that are the same length as the original token should be handled as regular expansions
|
2015-12-19 17:29:26 -05:00 |
|
Al
|
aaecd7961a
|
[fix] Options out of order
|
2015-12-19 15:05:50 -05:00 |
|
Al
|
48cb2b5c7b
|
[api] Node was complaining about non-trivial designated initializers (probably the bit fields), so converting to old-school initializer
|
2015-12-19 02:34:31 -05:00 |
|
Al
|
97906c86a8
|
[fix] Strip punctuation in final output in cases where there are no expansions
|
2015-12-19 02:10:41 -05:00 |
|
Al
|
4497c4501e
|
[fix] do not add a token if prefix/suffix expansions are inseparable and canonical
|
2015-12-19 01:36:02 -05:00 |
|
Al
|
f8da44e8b0
|
[fix] Making a copy even on pure Latin-script transliteration since string_trim modifies in-place, occasionally causes issues
|
2015-12-19 01:31:56 -05:00 |
|
Al
|
39e83961ef
|
[fix] Bug in suffix expansion affecting inseparable suffixes like burg as well as ordinal suffixes like first=>1st
|
2015-12-19 01:30:08 -05:00 |
|
Al
|
b2a944830a
|
[transliteration] Making sure the Python script to generate transliteration data works on the new CLDR format
|
2015-12-19 00:34:30 -05:00 |
|
Al
|
b4a8a69226
|
[expansion] Fixing extra space on prefix/suffix expansions
|
2015-12-18 20:28:59 -05:00 |
|
Al
|
df47dad817
|
[fix] Partial matches, ultimate misses in concatenated suffixes
|
2015-12-18 17:37:06 -05:00 |
|
Al
|
66073c17d5
|
[fix] Handling case of concatenated suffixes like straße when they stand alone
|
2015-12-18 17:17:35 -05:00 |
|
Al
|
b71755bf7f
|
[fix] Moving Python bindings up-front in the README
|
2015-12-17 14:28:36 -05:00 |
|
Al
|
31ed88bf6a
|
[api] Adding a --json option to expand cli
|
2015-12-17 13:46:55 -05:00 |
|
Al
|
41ea105bb4
|
[api] Simple JSON encoding for strings, UTF-8 rather than Unicode
|
2015-12-17 12:25:05 -05:00 |
|
Al
|
af78614f62
|
[fix] Print usage info on -h/--help to libpostal cli
|
2015-12-16 22:21:13 -05:00 |
|
Al
|
f4ee9c2645
|
[fix] task list
|
2015-12-16 20:38:29 -05:00 |
|
Al
|
54cc1b8b2d
|
[fix] Python syntax highlighting for README instructions
|
2015-12-16 02:25:56 -05:00 |
|
Al Barrentine
|
f3b4a4e894
|
Merge pull request #11 from nvkelso/master
andthus > and thus in Transliteration section
|
2015-12-16 02:22:55 -05:00 |
|
Al
|
59cc6d3417
|
[docs] README updates, better explanations of normalization and parsing
|
2015-12-16 02:19:10 -05:00 |
|
Nathaniel V. KELSO
|
11a9c47cea
|
Merge pull request #1 from nvkelso/nvkelso/readme-translit-typo
andthus > and thus in Transliteration section
|
2015-12-15 22:45:35 -08:00 |
|
Nathaniel V. KELSO
|
7ff7027cdb
|
andthus > and thus in Transliteration section
|
2015-12-15 22:45:07 -08:00 |
|
Al
|
3e44910664
|
[fix] Note about ldconfig
|
2015-12-16 00:48:22 -05:00 |
|
Al
|
ef941a6634
|
[fix] README parses
|
2015-12-15 16:18:22 -05:00 |
|
Al
|
c787821e96
|
[fix] README
|
2015-12-15 16:16:16 -05:00 |
|
Al
|
6cccc3ee46
|
[fix] README addition
|
2015-12-15 16:07:21 -05:00 |
|
Al
|
d1833a8f8f
|
[docs] Updating README with parsing info/examples
|
2015-12-15 16:00:58 -05:00 |
|
Al
|
83ba053373
|
[build] Removing setup.py fanciness. Install the C library first, then run setup.py or pip install
|
2015-12-15 14:31:58 -05:00 |
|
Al
|
e0c0ed2d04
|
[numex] Return true if numex table already loaded
|
2015-12-15 14:28:40 -05:00 |
|
Al
|
7e04017851
|
[fix] default for libdir
|
2015-12-15 12:21:49 -05:00 |
|
Al
|
40641209ee
|
[build] Build shared lib in site-packages
|
2015-12-15 12:19:40 -05:00 |
|
Al
|
04430f1a8e
|
[fix] var
|
2015-12-15 10:51:56 -05:00 |
|
Al
|
d8f731b672
|
[build] setup.py include/library dirs
|
2015-12-15 10:50:57 -05:00 |
|
Al
|
faf8b00596
|
[python] libpostal includes
|
2015-12-15 02:56:02 -05:00 |
|
Al
|
d2426d3777
|
[build] build_ext
|
2015-12-15 02:31:48 -05:00 |
|
Al
|
cb648b63da
|
[build] Adding include and library dirs based on autoconf prefix
|
2015-12-15 02:21:15 -05:00 |
|
Al
|
7cf48acd20
|
[fix] standard headers in new extensions
|
2015-12-15 01:18:33 -05:00 |
|
Al
|
bec43750d5
|
[build] bumping Python version
|
2015-12-15 00:58:11 -05:00 |
|
Al
|
33fdb912b6
|
[build] setup.py changes for parser extension
|
2015-12-15 00:56:53 -05:00 |
|
Al
|
c40ab06dd6
|
[python] Forgot expand.py
|
2015-12-15 00:56:34 -05:00 |
|
Al
|
842ef4526b
|
[python] Adding address parser Python API
|
2015-12-15 00:55:41 -05:00 |
|
Al
|
b9bf5c629e
|
[fix] Moving address_parser_response_destroy into libpostal so caller can free
|
2015-12-15 00:52:24 -05:00 |
|
Al
|
ab3ba249d7
|
[python/build] Modified install command for setup.py allowing --datadir and --prefix to be passed in. If there's a virtualenv active and nothing else is specified, install libpostal and its data files there by default
|
2015-12-14 18:21:21 -05:00 |
|
Al
|
7af0e2d967
|
[python] Adding Python bindings to the expand API
|
2015-12-14 18:18:16 -05:00 |
|
Al
|
b59c830ba6
|
[fix] warning about size_t
|
2015-12-14 18:17:09 -05:00 |
|
Al
|
406f9c533d
|
[api] Separating parser setup/teardown into two separate methods
|
2015-12-14 18:15:57 -05:00 |
|