Commit Graph

1691 Commits

Author SHA1 Message Date
Al
8a8b4b6ee9 Merge branch 'Jeffrey04-ms-dictionary-expansion' 2016-10-06 20:31:03 -04:00
Al
03d0afb820 [fix] removing level types and given names from synonyms since they're already covered 2016-10-06 20:30:48 -04:00
Al
5f42e66f31 [fix] removing road/rd from the synonyms list for jalan as they're covered by the English dictionaries 2016-10-06 20:29:35 -04:00
Al
c4e147ed20 [fix] separating words that have different roots 2016-10-06 20:29:09 -04:00
Al
2c48acd680 [dictionaries] removing flat/rumah pangsa/pangsapuri from place_names, aliasing gim to gimnasium rather than the other way around, removing duplicate/mixed English + Malay line 2016-10-06 20:28:44 -04:00
Al
244dbbdd4a [fix] separating synonyms that are for different words 2016-10-06 20:27:15 -04:00
jeffrey04
b2305b574d removing english abbr 2016-10-04 11:30:28 +08:00
jeffrey04
57210bd657 each term should be in separate lines 2016-10-04 11:30:09 +08:00
jeffrey04
f5477a7369 each term should be in a separate line 2016-10-04 11:29:28 +08:00
jeffrey04
8ae8340bee remove shopping mall from list 2016-09-30 10:18:04 +08:00
jeffrey04
f43ba7fe63 removing english words from dictionary 2016-09-30 10:14:25 +08:00
jeffrey04
20b87ba5c8 removing ambiguous_expansion(s).txt 2016-09-30 10:01:13 +08:00
jeffrey04
2bae8075b0 initial commit of malay words 2016-09-28 18:41:15 +08:00
Al
01afbf80ef [data] Each curl process will retry the chunk up to 3 times 2016-08-25 23:18:39 -04:00
Travis
de1255af00 [auto][ci skip] Adding data files from Travis build #161 2016-08-23 22:48:20 +00:00
Al Barrentine
f03df6aab8 Merge pull request #108 from petacat/patch-5
Update toponyms.txt
2016-08-23 18:38:08 -04:00
Travis
f19c9852aa [auto][ci skip] Adding data files from Travis build #160 2016-08-23 22:24:19 +00:00
Travis
d797d6c863 [auto][ci skip] Adding data files from Travis build #159 2016-08-23 22:14:07 +00:00
Al Barrentine
d1991848a3 Merge pull request #106 from petacat/patch-3
Update place_names.txt
2016-08-23 18:09:47 -04:00
Al Barrentine
964b440380 Merge pull request #104 from petacat/patch-1
Update directionals.txt
2016-08-23 17:49:36 -04:00
Thomas Rosen
a787c25cdf Update toponyms.txt 2016-08-23 23:09:32 +02:00
Thomas Rosen
7e258f2d87 Update place_names.txt 2016-08-23 23:03:31 +02:00
Thomas Rosen
bd109dc9ca Update directionals.txt 2016-08-23 22:56:56 +02:00
Al
757a7ee15f [docs][ci skip] Moving parser examples up so they come before normalization 2016-08-10 01:16:07 -04:00
Al
7ff8e1a5cb [docs][ci skip] Moving OpenCollective folks to the top of the README 2016-08-10 01:14:45 -04:00
Al Barrentine
a277096c96 Merge pull request #72 from piamancini/patch-1
Added backers and sponsors from OpenCollective
2016-08-09 23:05:45 -04:00
Al Barrentine
3e3950b37a Merge pull request #98 from uberbaud/posix_sh
Use posix `sh` for systems without `bash`
2016-07-27 18:44:11 -04:00
Tom Davis
18c8e90eb3 Use xargs to start workers as soon as possible 2016-07-27 17:46:44 -04:00
Tom Davis
11abf6cb22 Use posix sh for systems without bash 2016-07-26 20:17:18 -04:00
Al Barrentine
65c4688f89 Merge pull request #97 from uberbaud/multipart_edgecase
Don't call `download_multipart` for 1 chunk
2016-07-24 00:03:51 -04:00
Travis
3f0eff228e [auto][ci skip] Adding data files from Travis build #145 2016-07-23 22:28:32 +00:00
Al
bedfd34363 [fix] small change to dictionary so generated file rebuilds 2016-07-23 18:18:36 -04:00
Al
e8beca0971 [fix] ReEscape backslash when escaping dictionary files 2016-07-23 18:16:44 -04:00
Tom Davis
2991ffd193 Don't call download_multipart for 1 chunk
Previously, where a file was larger than `$LARGE_FILE_SIZE` but smaller
than `$CHUNK_SIZE*2`, `download_multipart` would be called but would
only download one (1) chunk that was the whole file.

This fix keeps the same download performance as before but optimizes
processing chunks out.
2016-07-23 16:41:04 -04:00
Al Barrentine
e3eaa9efaf Merge pull request #93 from uberbaud/no_seq
Remove call to `seq` which may not exist
2016-07-23 01:16:57 -04:00
Tom Davis
24e0314e71 Remove call to seq which may not exist 2016-07-23 01:03:15 -04:00
Al Barrentine
e02c6adc85 Merge pull request #91 from uberbaud/openbsd
Add support for OpenBSD
2016-07-20 19:47:18 -04:00
Al Barrentine
08e4284f68 Merge pull request #89 from uberbaud/alt_build_path
Call libpostal_data in source path, not build path
2016-07-20 18:33:00 -04:00
Tom Davis
c0366147e8 Add support for OpenBSD 2016-07-20 18:19:31 -04:00
Tom Davis
a8bb798ce0 Call libpostal_data in source path, not build path
This fix updates Makefile to find the actual libpostal_data file when
`configure` is called from another directory, which it uses as the build
directory.
2016-07-20 17:31:52 -04:00
Travis
a0f6e100f1 [auto][ci skip] Adding data files from Travis build #133 2016-07-17 19:13:46 +00:00
Al
12d50aac12 Merge branch 'master' of https://github.com/openvenues/libpostal 2016-07-17 15:03:52 -04:00
Al
83381e9d8a [expand] Adding exception for a few types of special punctuation (ampersand, plus, pound sign) which should be left in the original string and separated by whitespace. Closes #84. Closes #85 2016-07-17 15:02:47 -04:00
Al
d8a0e19a32 [dictionaries] Adding hyphenated version of rr-row, should always normalize 2016-07-17 15:01:32 -04:00
Travis
2fb677ca73 [auto][ci skip] Adding data files from Travis build #132 2016-07-17 18:47:28 +00:00
Al
22c02151ea Merge branch 'master' of https://github.com/openvenues/libpostal 2016-07-17 14:37:02 -04:00
Al
ce7fb82fcc [dictionaries] RR-ROW for #87 and new abbreviation for right-of-way generally 2016-07-17 14:32:22 -04:00
Al Barrentine
7c4593f08e Merge pull request #86 from dnmfarrell/master
don't error on multiple setup_parser()
2016-07-17 13:35:48 -04:00
David Farrell
a7a9708d2b don't error on multiple setup_parser() 2016-07-17 11:25:03 -04:00
Al
c05c54c9f6 Merge branch 'null_options' 2016-07-17 01:57:10 -04:00