Commit Graph

876 Commits

Author SHA1 Message Date
Al
c790a2b87f [fix] spoken/official 2015-10-02 19:50:11 -04:00
Al
db3364be30 [geonames] Using official country languages in GeoNames 2015-10-01 02:21:14 -04:00
Al
01856dd36d [fix] acronyms 2015-10-01 00:24:04 -04:00
Al
562aeb497d [tokenization] Regenerating scanner.c 2015-09-30 11:32:38 -04:00
Al
689b830ad2 [tokenization] Acronym vs abbreviation 2015-09-30 04:10:04 -04:00
Al
7dfbcce9ec [languages] options for get_country_languages 2015-09-30 04:09:07 -04:00
Al
86e9166ae8 [doc] doumentation for country_names module, fixing variable name 2015-09-30 03:08:04 -04:00
Al
42e77cb570 [countries] Making country official names align better with OSM/Wikipedia, plugging holes 2015-09-30 01:03:03 -04:00
Al
0cedc68a97 [languages] Changing Arabic to default in North African countries with two official languages. Making Danish secondary in the US Virgin Islands 2015-09-30 01:01:42 -04:00
Al
40cf247655 [formatting] Constants for field names, a few options in format_address 2015-09-29 23:03:37 -04:00
Al
22e8178a97 [countries] Adding module for getting official country names in every language from CLDR + a dictionary of local language names 2015-09-29 21:10:38 -04:00
Al
c3c6a18df8 [geodb] Renaming geodb 2015-09-29 13:07:50 -04:00
Al
8ca22247f9 [fix] labels in averaged perceptron trainer 2015-09-29 13:07:07 -04:00
Al
6666f0baf8 [fix] Labels in averaged perceptron tagger 2015-09-29 13:06:34 -04:00
Al
05da2ee6bd [dictionaries] Adding commonly used colon form No: for Turkish addresses 2015-09-28 17:48:19 -04:00
Al
daad1a1313 [geonames] Removing alternate names from geonames data set which are digits-only (most are not legitimate) 2015-09-28 17:46:53 -04:00
Al
12816d0e95 [api] Setting global objects to NULL on teardown 2015-09-28 17:27:57 -04:00
Al
abfa744d59 [build] Adding libpostal_data script for downloading data from S3, Makefile uses that now as part of the all-local target. Can be run periodically after install 2015-09-28 17:26:15 -04:00
Al
f29f2f091b [fix] PEBCAK 2015-09-27 22:49:27 -04:00
Al
93b3110a49 [fix] only commas and hyphens need to be eliminated at the end of phrases in untagged address formatting 2015-09-27 19:25:34 -04:00
Al
d3bfaf6b43 [osm/formatting] Fixing formatting tagged addresses with comma separated fields 2015-09-27 03:19:23 -04:00
Al
d512201e2c [fix] removing space from tokens in address formatting 2015-09-27 02:18:34 -04:00
Al
a3214b7914 [readme] Readme fixes and additions 2015-09-26 23:32:19 -04:00
Al
5b829cd5a7 [fix] blank values containing punctuation in formatting 2015-09-26 21:49:28 -04:00
Al
dac0440be8 [fix] rsplit 2015-09-26 21:07:54 -04:00
Al
e255ae0e09 [dictionaries] Luxembourgish dictionaries 2015-09-26 18:31:07 -04:00
Al
3fe56d029d [dictionaries] German Swiss dictionaries 2015-09-26 18:30:55 -04:00
Al
ae93552455 [osm/formatting] Moving back to openvenues repo pending resolution of the Turkish address issue 2015-09-26 03:56:52 -04:00
Al
0c792a2cc3 [osm/formatting] Changing the way the formatter elimiates inter-component separators, changing repo back to OpenCageData after pull request merge 2015-09-26 03:21:26 -04:00
Al
856198a352 [tokenization] Regenerated scanner.c 2015-09-26 02:27:45 -04:00
Al
07f1f361e2 [transliteration] Regenerating transliteration data with new categories 2015-09-26 00:07:39 -04:00
Al
172263af58 [tokenization] Adding updated token classes to scanner.re 2015-09-26 00:05:23 -04:00
Al
5417b4e602 [unicode] Downloading latest UnicodeData.txt instead of using builtin Python module (out of date) e.g. for getting unicode codepoint categories 2015-09-25 23:59:38 -04:00
Al
8fe791a14a [fix] ensure_dir in file downloads 2015-09-25 17:05:22 -04:00
Al
646b9f7248 [osm/formatting] Continuing to use openvenues formatter for the India fix 2015-09-25 13:36:24 -04:00
Al
5a6b47d0fd [api] Adding LIBPOSTAL_DEFAULT_OPTIONS to libpostal.h 2015-09-25 01:53:29 -04:00
Al
f5bb72c6f5 [readme] missed a dictionary type 2015-09-24 23:32:36 -04:00
Al
f243b9cfa6 [fix] phrasing 2015-09-24 23:30:03 -04:00
Al
dc31019604 [readme] Heading 2015-09-24 23:20:23 -04:00
Al
cfef3059bb [readme] Moving paragraph 2015-09-24 23:19:53 -04:00
Al
f62cfb9551 [readme] README changes 2015-09-24 23:16:07 -04:00
Al
3e256404b9 [readme] More informative README 2015-09-24 23:02:09 -04:00
Al
9901dd2aac [fix] Switching address formatter back to OpenCageData repo 2015-09-24 18:42:17 -04:00
Al
accd8a57e7 [expansion] Regenerating expansion data 2015-09-24 16:38:20 -04:00
Al
fa320defb7 [dictionaries] Afrikaans dictionaries for better disambiguatin in South Africa 2015-09-24 16:37:16 -04:00
Al
050a850fb9 [dictionaries] Dutch directionals, separating out the west vs westen forms 2015-09-24 16:36:52 -04:00
Al
fe5d665533 [dictionaries] Arc in English needn't always expand to Arcade 2015-09-24 16:36:21 -04:00
Al
bcac6a41be [dictionaries] Separating out Austrian toponym abbreviations 2015-09-24 16:35:56 -04:00
Al
3ce1669c30 [fix] import 2015-09-24 01:25:00 -04:00
Al
c85ce0b11d [osm/formatting] Tagging separators as well in tagged output of the address formatter 2015-09-24 01:22:49 -04:00