786c616116[fix] typo in English numeric expression rule
Al
2016-03-02 16:10:52 -05:00
d1f62ddc63[dictionaries] Re-generating address_expansion_data.c
Al
2016-03-02 16:10:30 -05:00
1ddc69d984[fix] var declaration during trie creation
Al
2016-03-02 16:05:28 -05:00
271800bbf9[dictionaries] Adding all single letter tokens to ambiguous dictionaries so libpostal will always include an unexpanded version (e.g. for initials)
Al
2016-03-02 15:52:38 -05:00
9283ea2c4e[dictionaries] Adding single letter abbreviations to ambiguous English dictionaries
Al
2016-03-02 15:29:47 -05:00
94cd9831bfMerge branch 'nyalldawson-aus_data'
Al
2016-03-01 18:37:49 -05:00
122397759d[dictionaries] Re-generating address_expansion_data.c
Al
2016-03-01 18:37:40 -05:00
2eac968107[fix] removing duplicates in English dictionaries
Al
2016-03-01 18:36:20 -05:00
6b13edfe36[dictionaries] A few edits to the AU-English dictionaries
Al
2016-03-01 15:57:43 -05:00
93aac04c17[docs] Adding blog post link and license to the README
Al
2016-02-29 23:16:05 -05:00
d35f97f6f1[fix] All file_read_uint64 calls that use stack variables read into a uint64_t not a size_t so as not to smash the stack under a 32-bit arch (issue #18)
Al
2016-02-29 22:36:00 -05:00
e5b2311253Merge pull request #29 from nyalldawson/misssing_dep
Al Barrentine
2016-02-29 11:11:33 -05:00
7a79b2e759Add curl dependancy for Ubuntu
Nyall Dawson
2016-02-29 09:10:04 +11:00
5f3747e15dAdd some missing AU dictionary entries
Nyall Dawson
2016-02-29 09:08:44 +11:00
3aba78d806Merge pull request #20 from federicomenaquintero/master
Al Barrentine
2016-02-25 16:22:23 -05:00
4eac38c40c[fix] Check the return of malloc() in geonames.c
Federico Mena Quintero
2016-02-25 14:52:12 -06:00
2ae2450db7[fix] Check the return of malloc() in numex.c
Federico Mena Quintero
2016-02-25 14:33:18 -06:00
b172071d3b[fix] Remove superfluous #define; the caller actually uses sizeof(DEFAULT_ALPHABET) itself
Federico Mena Quintero
2016-02-25 11:42:50 -06:00
10c6768b5b[fix] Don't leak the trie if the number of nodes can't be read from a file
Federico Mena Quintero
2016-02-25 11:38:28 -06:00
e60ad47677[fix] Check return of malloc() in trie.c
Federico Mena Quintero
2016-02-25 11:37:00 -06:00
7371a88c53[docs][ci skip] Adding note about PHP binding
Al
2016-02-25 00:16:13 -05:00
52ebc9fc46[fix] Paths relative to the current file in address_dictionaries.py so it can be run from anywhere
Al
2016-02-24 13:10:44 -05:00
87cf63942e[dictionaries] Regenerating address_expansion_data.c
Al
2016-02-22 18:39:38 -05:00
366a4a454aMerge branch 'riordan-NY-SAM-Abbreviations'
Al
2016-02-22 18:26:35 -05:00
f349dbac01[dictionaries] Removing Rue from English street types (addresses using Rue as the street type are actually in French, which is a valid, detectable, regional language for the US in libpostal). Some of the abbreviations also belong in the synonyms or place names dictionaries.
Al
2016-02-22 18:26:02 -05:00
21cf7245a7Added additional street types from NY State Post-Address Types
Riordan
2016-02-22 17:27:14 -05:00
dda204ce71[fix][ci skip] Installation instruction tweaks
Al
2016-02-22 00:55:31 -05:00
fa11dbd181[fix][ci skip] smaller GIF take deux
Al
2016-02-22 00:48:33 -05:00
fe89f5d27a[fix][ci skip] Adding expand_address usage GIF at normal size
Al
2016-02-22 00:30:38 -05:00
724c102259[fix][ci skip] Python example, then C
Al
2016-02-22 00:23:20 -05:00
d95c74ef9e[fix][ci skip] heading for consistency
Al
2016-02-22 00:15:53 -05:00
c19781c724[docs][ci skip] README updates, usage GIFs
Al
2016-02-22 00:12:23 -05:00
82c05cacb1[docs][ci skip] Adding note about the new Java/JNI bindings
Al
2016-02-19 16:43:02 -05:00
1c0237beb7[docs][ci skip] Adding teardown calls to the C examples
Al
2016-02-18 21:10:31 -05:00
37cfe8ab3b[test] Adding automated parser tests to the C library
Al
2016-02-17 17:19:03 -05:00
43f41230b6[docs][ci skip] README fix
Al
2016-02-16 22:12:18 -05:00
ba243a5d58[docs][ci skip] Adding C examples to the README in addition to the Python examples
Al
2016-02-16 22:02:39 -05:00
3925a288c2[docs][ci skip] Adding note about Golang bindings
Al
2016-02-16 21:40:26 -05:00
4349a45b0f[build] Adding pkg-config support
Al
2016-02-16 11:31:02 -05:00
37c09d1ed9[api] Adding function to free expansions from expand_address
Al
2016-02-15 20:13:12 -05:00
98165e89ad[api] Using bools instead of bit fields in the public API
Al
2016-02-15 18:33:39 -05:00
cf2a79bef1[api] Default options accessible through getters, not static structs
Al
2016-02-15 17:16:21 -05:00
f5ae95eb70[docs][ci skip] Adding note about Ruby bindings to README
Al
2016-02-14 22:03:40 -05:00
e5239b00db[docs][ci skip] Moving CI badge back below heading
Al
2016-02-10 09:54:51 -05:00
2b9d502ffa[docs][ci skip] CI badge positioning in README
Al
2016-02-10 09:49:38 -05:00
c5e715f6e5[docs][ci skip] README updates for batch mode, notes about batch testing
Al
2016-02-10 09:46:59 -05:00
166119f5c3[test] Adding numex tests for the concatenated case
Al
2016-02-10 09:26:12 -05:00
98c395d34c[numex] Concatenating a string of numeric expressions with no intervening tokens like Seventeen Eighty or Ten Oh Four
Al
2016-02-10 09:21:31 -05:00
ffde22395c[numex] Adding test cases where stopword should not get swallowed into the numeric expression
Al
2016-02-10 08:30:53 -05:00
59cf5bfc62[numex] Fixing cases with stopwords not attached to a numeric expression
Al
2016-02-10 08:30:01 -05:00
c32ef9ccf8[fix] freeing up iterator in normalize_string
Al
2016-02-09 01:06:51 -05:00
87ef471f8b[test] Adding test cases for tokenized tail search
Al
2016-02-08 17:56:28 -05:00
12c2477359[phrases] Another fix to tail token search
Al
2016-02-08 17:55:21 -05:00
39f162b029[phrases] fix in tokenized tail search when whitespace tokens are preserved
Al
2016-02-08 16:37:52 -05:00
4ed0474f7e[fix] test for normalized output on an expansion with an accented canonical
Al
2016-02-08 12:32:14 -05:00
84d5ba18f0[api] Fixing multi-language expansions with overlapping expansions, whitespace, utf8 normalization of canonical strings
Al
2016-02-08 02:50:34 -05:00
0695738253[fix] cleaning up memory in normalize_string_languages
Al
2016-02-08 02:43:12 -05:00
afd5844f21[normalize] Permuting transliterators only once on the entire string rather than at each script break (so # permutations is bounded and can't get huge). Fixing some spacing issues. Adding method to check for an alpha+numeric token in normalization.
Al
2016-02-08 01:16:47 -05:00
393fd7e0f3[build] Using env var for data dir in geodata build script
Al
2016-02-08 01:11:42 -05:00
aaad213a20[cli] Adding printf while models are being loaded in address parser cli
Al
2016-02-08 01:10:06 -05:00
9ac0379a65[phrases] Case where trie search finds a match, makes progress beyond the next token but has to fall back. Adding trie search test case
Al
2016-02-08 01:07:56 -05:00
3701d8380f[cli] Command-line expansion client now supports piping in stdin, Unix-style
Al
2016-02-03 13:48:51 -05:00
90b9bb097c[docs] Adding German example back to README parser examples
Al
2016-02-02 13:05:42 -05:00
7536fa4647[fix] static inline
Al Barrentine
2016-02-02 00:53:13 -05:00
27d65d6a7e[docs] README example using more likely address with fewer commas
Al Barrentine
2016-02-02 00:11:10 -05:00
8411632829[docs] README title
Al Barrentine
2016-02-02 00:08:30 -05:00
79ce23db9b[fix] capitalization
Al Barrentine
2016-02-01 23:58:39 -05:00
7fd7becf66[docs] shorter title
Al Barrentine
2016-02-01 23:57:42 -05:00
39e0185da1[docs] README changes
Al Barrentine
2016-02-01 23:56:17 -05:00
6dcc71d87b[docs] README changes, code examples
Al
2016-02-01 17:16:48 -05:00
2100b80f81[docs] Adding build status to README
Al
2016-01-30 14:59:45 -05:00
6444647a00[fix] quote
Al
2016-01-30 14:48:14 -05:00
3c53926d0c[build] Using gcc 4.8 in Travis as it compiles faster. travis_wait no longer required
Al
2016-01-30 14:31:10 -05:00
6265ff9fa1[fix] absolute path in travis build
Al
2016-01-30 14:24:17 -05:00
9c787e2b99[build] using local datadir in Travis build
Al
2016-01-30 14:02:04 -05:00
c0b548833b[fix] create data dir if it doesn't exist
Al
2016-01-30 13:40:10 -05:00
1e65fafaaf[fix] char *
Al
2016-01-30 13:39:36 -05:00
f8de9d8e5a[fix] static methods in numex table loading, mallocs instead of stack variables
Al
2016-01-30 13:25:48 -05:00
cfe08c07e2[build] Adding travis_wait on make, as compiling the scanner still takes a while on the older GCC in Ubuntu 12.04
Al
2016-01-30 13:24:30 -05:00
085bfd6ada[fix] static methods for libpostal.c
Al
2016-01-30 02:20:59 -05:00
63d239eef0[tokenization] Using the new re2c 0.16 generates a 75% smaller DFA for scanner, should speed up compile times on gcc
Al
2016-01-30 02:20:01 -05:00
9b3296914a[build] Defining LIBPOSTAL_DATA_DIR at compile time, not configure
Al
2016-01-30 02:18:04 -05:00
45607e659e[fix] AC_HEADER_STDBOOL
Al
2016-01-29 15:55:41 -05:00
b5eaf79fab[build] Running bootstrap before script
Al
2016-01-29 02:45:34 -05:00
b716eabfa6[fix] Trying build with autoconf/automake/libtool
Al
2016-01-29 02:24:41 -05:00
2ad51786e1[fix] Autoconf macros may be missing in Travis build, adding autoconf-archive
Al
2016-01-29 02:17:59 -05:00
50e27ffe35[fix] trying build with pkg-config
Al
2016-01-29 02:14:28 -05:00
d89f2ed844[build] Travis build
Al
2016-01-29 02:10:37 -05:00
a30026fce7[build] shebang line in bootstrap.sh
Al
2016-01-28 18:32:07 -05:00
40fb530c06[docs] Adding note tot he README about running tests
Al
2016-01-28 18:02:38 -05:00
0da3774b1e[tests] Relative includes in tests to fix compilation issues on gcc/Ubuntu
Al
2016-01-28 17:04:02 -05:00
76445c4bf7[fix] Makefile dir
Al
2016-01-28 16:43:17 -05:00
cd76c660d8[fix] French numex
Al
2016-01-28 16:40:50 -05:00
8702ee931c[tests] Main test program and Makefile
Al
2016-01-28 16:37:29 -05:00
d318db15f1[tests] Numeric expression parsing tests
Al
2016-01-28 16:36:20 -05:00
aa4272ed9e[tests] Transliteration tests
Al
2016-01-28 16:36:09 -05:00