Commit Graph

  • 786c616116 [fix] typo in English numeric expression rule Al 2016-03-02 16:10:52 -05:00
  • d1f62ddc63 [dictionaries] Re-generating address_expansion_data.c Al 2016-03-02 16:10:30 -05:00
  • 1ddc69d984 [fix] var declaration during trie creation Al 2016-03-02 16:05:28 -05:00
  • 271800bbf9 [dictionaries] Adding all single letter tokens to ambiguous dictionaries so libpostal will always include an unexpanded version (e.g. for initials) Al 2016-03-02 15:52:38 -05:00
  • 9283ea2c4e [dictionaries] Adding single letter abbreviations to ambiguous English dictionaries Al 2016-03-02 15:29:47 -05:00
  • 94cd9831bf Merge branch 'nyalldawson-aus_data' Al 2016-03-01 18:37:49 -05:00
  • 122397759d [dictionaries] Re-generating address_expansion_data.c Al 2016-03-01 18:37:40 -05:00
  • 2eac968107 [fix] removing duplicates in English dictionaries Al 2016-03-01 18:36:20 -05:00
  • 6b13edfe36 [dictionaries] A few edits to the AU-English dictionaries Al 2016-03-01 15:57:43 -05:00
  • a6167159a6 Merge branch 'aus_data' of https://github.com/nyalldawson/libpostal into nyalldawson-aus_data Al 2016-03-01 13:40:18 -05:00
  • fd1f1a0b74 [fix][ci skip] README link Al 2016-03-01 01:28:42 -05:00
  • ecc8e199d3 [fix] README license link Al 2016-02-29 23:40:35 -05:00
  • d5e211bb97 Merge branch 'master' of https://github.com/openvenues/libpostal Al 2016-02-29 23:16:14 -05:00
  • 93aac04c17 [docs] Adding blog post link and license to the README Al 2016-02-29 23:16:05 -05:00
  • d35f97f6f1 [fix] All file_read_uint64 calls that use stack variables read into a uint64_t not a size_t so as not to smash the stack under a 32-bit arch (issue #18) Al 2016-02-29 22:36:00 -05:00
  • e5b2311253 Merge pull request #29 from nyalldawson/misssing_dep Al Barrentine 2016-02-29 11:11:33 -05:00
  • 7a79b2e759 Add curl dependancy for Ubuntu Nyall Dawson 2016-02-29 09:10:04 +11:00
  • 5f3747e15d Add some missing AU dictionary entries Nyall Dawson 2016-02-29 09:08:44 +11:00
  • 3aba78d806 Merge pull request #20 from federicomenaquintero/master Al Barrentine 2016-02-25 16:22:23 -05:00
  • 4eac38c40c [fix] Check the return of malloc() in geonames.c Federico Mena Quintero 2016-02-25 14:52:12 -06:00
  • 2ae2450db7 [fix] Check the return of malloc() in numex.c Federico Mena Quintero 2016-02-25 14:33:18 -06:00
  • b172071d3b [fix] Remove superfluous #define; the caller actually uses sizeof(DEFAULT_ALPHABET) itself Federico Mena Quintero 2016-02-25 11:42:50 -06:00
  • 10c6768b5b [fix] Don't leak the trie if the number of nodes can't be read from a file Federico Mena Quintero 2016-02-25 11:38:28 -06:00
  • e60ad47677 [fix] Check return of malloc() in trie.c Federico Mena Quintero 2016-02-25 11:37:00 -06:00
  • 7371a88c53 [docs][ci skip] Adding note about PHP binding Al 2016-02-25 00:16:13 -05:00
  • 52ebc9fc46 [fix] Paths relative to the current file in address_dictionaries.py so it can be run from anywhere Al 2016-02-24 13:10:44 -05:00
  • 87cf63942e [dictionaries] Regenerating address_expansion_data.c Al 2016-02-22 18:39:38 -05:00
  • 366a4a454a Merge branch 'riordan-NY-SAM-Abbreviations' Al 2016-02-22 18:26:35 -05:00
  • f349dbac01 [dictionaries] Removing Rue from English street types (addresses using Rue as the street type are actually in French, which is a valid, detectable, regional language for the US in libpostal). Some of the abbreviations also belong in the synonyms or place names dictionaries. Al 2016-02-22 18:26:02 -05:00
  • 21cf7245a7 Added additional street types from NY State Post-Address Types Riordan 2016-02-22 17:27:14 -05:00
  • dda204ce71 [fix][ci skip] Installation instruction tweaks Al 2016-02-22 00:55:31 -05:00
  • fa11dbd181 [fix][ci skip] smaller GIF take deux Al 2016-02-22 00:48:33 -05:00
  • fe89f5d27a [fix][ci skip] Adding expand_address usage GIF at normal size Al 2016-02-22 00:30:38 -05:00
  • 724c102259 [fix][ci skip] Python example, then C Al 2016-02-22 00:23:20 -05:00
  • d95c74ef9e [fix][ci skip] heading for consistency Al 2016-02-22 00:15:53 -05:00
  • c19781c724 [docs][ci skip] README updates, usage GIFs Al 2016-02-22 00:12:23 -05:00
  • 82c05cacb1 [docs][ci skip] Adding note about the new Java/JNI bindings Al 2016-02-19 16:43:02 -05:00
  • 1c0237beb7 [docs][ci skip] Adding teardown calls to the C examples Al 2016-02-18 21:10:31 -05:00
  • 37cfe8ab3b [test] Adding automated parser tests to the C library Al 2016-02-17 17:19:03 -05:00
  • 43f41230b6 [docs][ci skip] README fix Al 2016-02-16 22:12:18 -05:00
  • ba243a5d58 [docs][ci skip] Adding C examples to the README in addition to the Python examples Al 2016-02-16 22:02:39 -05:00
  • 3925a288c2 [docs][ci skip] Adding note about Golang bindings Al 2016-02-16 21:40:26 -05:00
  • 4349a45b0f [build] Adding pkg-config support Al 2016-02-16 11:31:02 -05:00
  • 37c09d1ed9 [api] Adding function to free expansions from expand_address Al 2016-02-15 20:13:12 -05:00
  • 98165e89ad [api] Using bools instead of bit fields in the public API Al 2016-02-15 18:33:39 -05:00
  • cf2a79bef1 [api] Default options accessible through getters, not static structs Al 2016-02-15 17:16:21 -05:00
  • f5ae95eb70 [docs][ci skip] Adding note about Ruby bindings to README Al 2016-02-14 22:03:40 -05:00
  • e5239b00db [docs][ci skip] Moving CI badge back below heading Al 2016-02-10 09:54:51 -05:00
  • 2b9d502ffa [docs][ci skip] CI badge positioning in README Al 2016-02-10 09:49:38 -05:00
  • c5e715f6e5 [docs][ci skip] README updates for batch mode, notes about batch testing Al 2016-02-10 09:46:59 -05:00
  • 166119f5c3 [test] Adding numex tests for the concatenated case Al 2016-02-10 09:26:12 -05:00
  • 98c395d34c [numex] Concatenating a string of numeric expressions with no intervening tokens like Seventeen Eighty or Ten Oh Four Al 2016-02-10 09:21:31 -05:00
  • ffde22395c [numex] Adding test cases where stopword should not get swallowed into the numeric expression Al 2016-02-10 08:30:53 -05:00
  • 59cf5bfc62 [numex] Fixing cases with stopwords not attached to a numeric expression Al 2016-02-10 08:30:01 -05:00
  • c32ef9ccf8 [fix] freeing up iterator in normalize_string Al 2016-02-09 01:06:51 -05:00
  • 87ef471f8b [test] Adding test cases for tokenized tail search Al 2016-02-08 17:56:28 -05:00
  • 12c2477359 [phrases] Another fix to tail token search Al 2016-02-08 17:55:21 -05:00
  • 39f162b029 [phrases] fix in tokenized tail search when whitespace tokens are preserved Al 2016-02-08 16:37:52 -05:00
  • 4ed0474f7e [fix] test for normalized output on an expansion with an accented canonical Al 2016-02-08 12:32:14 -05:00
  • 84d5ba18f0 [api] Fixing multi-language expansions with overlapping expansions, whitespace, utf8 normalization of canonical strings Al 2016-02-08 02:50:34 -05:00
  • 0695738253 [fix] cleaning up memory in normalize_string_languages Al 2016-02-08 02:43:12 -05:00
  • afd5844f21 [normalize] Permuting transliterators only once on the entire string rather than at each script break (so # permutations is bounded and can't get huge). Fixing some spacing issues. Adding method to check for an alpha+numeric token in normalization. Al 2016-02-08 01:16:47 -05:00
  • 393fd7e0f3 [build] Using env var for data dir in geodata build script Al 2016-02-08 01:11:42 -05:00
  • aaad213a20 [cli] Adding printf while models are being loaded in address parser cli Al 2016-02-08 01:10:06 -05:00
  • 9ac0379a65 [phrases] Case where trie search finds a match, makes progress beyond the next token but has to fall back. Adding trie search test case Al 2016-02-08 01:07:56 -05:00
  • 3701d8380f [cli] Command-line expansion client now supports piping in stdin, Unix-style Al 2016-02-03 13:48:51 -05:00
  • 90b9bb097c [docs] Adding German example back to README parser examples Al 2016-02-02 13:05:42 -05:00
  • 7536fa4647 [fix] static inline Al Barrentine 2016-02-02 00:53:13 -05:00
  • 27d65d6a7e [docs] README example using more likely address with fewer commas Al Barrentine 2016-02-02 00:11:10 -05:00
  • 8411632829 [docs] README title Al Barrentine 2016-02-02 00:08:30 -05:00
  • 79ce23db9b [fix] capitalization Al Barrentine 2016-02-01 23:58:39 -05:00
  • 7fd7becf66 [docs] shorter title Al Barrentine 2016-02-01 23:57:42 -05:00
  • 39e0185da1 [docs] README changes Al Barrentine 2016-02-01 23:56:17 -05:00
  • 6dcc71d87b [docs] README changes, code examples Al 2016-02-01 17:16:48 -05:00
  • 2100b80f81 [docs] Adding build status to README Al 2016-01-30 14:59:45 -05:00
  • 6444647a00 [fix] quote Al 2016-01-30 14:48:14 -05:00
  • 3c53926d0c [build] Using gcc 4.8 in Travis as it compiles faster. travis_wait no longer required Al 2016-01-30 14:31:10 -05:00
  • 6265ff9fa1 [fix] absolute path in travis build Al 2016-01-30 14:24:17 -05:00
  • 9c787e2b99 [build] using local datadir in Travis build Al 2016-01-30 14:02:04 -05:00
  • c0b548833b [fix] create data dir if it doesn't exist Al 2016-01-30 13:40:10 -05:00
  • 1e65fafaaf [fix] char * Al 2016-01-30 13:39:36 -05:00
  • f8de9d8e5a [fix] static methods in numex table loading, mallocs instead of stack variables Al 2016-01-30 13:25:48 -05:00
  • cfe08c07e2 [build] Adding travis_wait on make, as compiling the scanner still takes a while on the older GCC in Ubuntu 12.04 Al 2016-01-30 13:24:30 -05:00
  • 085bfd6ada [fix] static methods for libpostal.c Al 2016-01-30 02:20:59 -05:00
  • 63d239eef0 [tokenization] Using the new re2c 0.16 generates a 75% smaller DFA for scanner, should speed up compile times on gcc Al 2016-01-30 02:20:01 -05:00
  • 9b3296914a [build] Defining LIBPOSTAL_DATA_DIR at compile time, not configure Al 2016-01-30 02:18:04 -05:00
  • 45607e659e [fix] AC_HEADER_STDBOOL Al 2016-01-29 15:55:41 -05:00
  • b5eaf79fab [build] Running bootstrap before script Al 2016-01-29 02:45:34 -05:00
  • b716eabfa6 [fix] Trying build with autoconf/automake/libtool Al 2016-01-29 02:24:41 -05:00
  • 2ad51786e1 [fix] Autoconf macros may be missing in Travis build, adding autoconf-archive Al 2016-01-29 02:17:59 -05:00
  • 50e27ffe35 [fix] trying build with pkg-config Al 2016-01-29 02:14:28 -05:00
  • d89f2ed844 [build] Travis build Al 2016-01-29 02:10:37 -05:00
  • a30026fce7 [build] shebang line in bootstrap.sh Al 2016-01-28 18:32:07 -05:00
  • 40fb530c06 [docs] Adding note tot he README about running tests Al 2016-01-28 18:02:38 -05:00
  • 0da3774b1e [tests] Relative includes in tests to fix compilation issues on gcc/Ubuntu Al 2016-01-28 17:04:02 -05:00
  • 76445c4bf7 [fix] Makefile dir Al 2016-01-28 16:43:17 -05:00
  • cd76c660d8 [fix] French numex Al 2016-01-28 16:40:50 -05:00
  • 8702ee931c [tests] Main test program and Makefile Al 2016-01-28 16:37:29 -05:00
  • d318db15f1 [tests] Numeric expression parsing tests Al 2016-01-28 16:36:20 -05:00
  • aa4272ed9e [tests] Transliteration tests Al 2016-01-28 16:36:09 -05:00