Commit Graph

49 Commits

Author SHA1 Message Date
Al
5af6dc77d1 [dictionaries] Adding a few additional abbreviated names of political leaders that come up, a missing abbreviation 2015-10-06 15:09:50 -04:00
Al
ed51fce291 [fix] Safe to assume Bokmål for Norwegian street addresses 2015-10-04 11:19:43 -04:00
Al
0cedc68a97 [languages] Changing Arabic to default in North African countries with two official languages. Making Danish secondary in the US Virgin Islands 2015-09-30 01:01:42 -04:00
Al
05da2ee6bd [dictionaries] Adding commonly used colon form No: for Turkish addresses 2015-09-28 17:48:19 -04:00
Al
e255ae0e09 [dictionaries] Luxembourgish dictionaries 2015-09-26 18:31:07 -04:00
Al
3fe56d029d [dictionaries] German Swiss dictionaries 2015-09-26 18:30:55 -04:00
Al
fa320defb7 [dictionaries] Afrikaans dictionaries for better disambiguatin in South Africa 2015-09-24 16:37:16 -04:00
Al
050a850fb9 [dictionaries] Dutch directionals, separating out the west vs westen forms 2015-09-24 16:36:52 -04:00
Al
fe5d665533 [dictionaries] Arc in English needn't always expand to Arcade 2015-09-24 16:36:21 -04:00
Al
bcac6a41be [dictionaries] Separating out Austrian toponym abbreviations 2015-09-24 16:35:56 -04:00
Al
22c16b43cf [languages] Italian is also the regional default in Valle D'Aosta and Trentino-Alto Adige 2015-09-10 11:09:13 -07:00
Al
c1da2fa94b [dictionaries] Adding 'Rang' to French dictionaries 2015-09-09 17:21:26 -07:00
Al
d13d4d7d28 [dictionaries] Adding English gazetteers as non-default to Georgia 2015-09-03 20:25:42 -04:00
Al
90f333b16c [languages] Adding English non-default dictionaries to a number of countries where English can be found in OSM 2015-08-24 02:49:49 -04:00
Al
c1ce91abbf [languages] Better handling of non-default langauge canonicals in default langauge text 2015-08-24 01:26:17 -04:00
Al
9f6f4feea1 [dictionaries/languages] Adding English gazetteers for Bahrain, pas abbreviation for paseo 2015-08-23 23:32:34 -04:00
Al
d14be57e73 [dictionaries] Adding exit as an English street type 2015-08-23 22:51:22 -04:00
Al
e26776a5e9 [dictionaries] Occitan stopwords for disambiguating from French 2015-08-23 16:35:46 -04:00
Al
43178747f8 [languages] Using stopwords only to account for how ambiguous a phrase is, not for disambiguation 2015-08-23 04:28:44 -04:00
Al
9c176961ff [dictionaries] Norwegian street types from the suffix dictionary 2015-08-23 02:32:44 -04:00
Al
122a81b610 [languages] non-default languages can still be labeled from > 1 char abbreviations if there's no evidence of other languages in the string. Adding Python version of get_string_script from the C lib 2015-08-23 02:26:06 -04:00
Al
a419dad630 [languages] Adding canonical back in to language disambiguation (for prefixes/suffixes too), using non-canonicals/abbreviations in non-default languages if there are no other abbreviations found, adding in stopwords dictionaries 2015-08-23 00:43:37 -04:00
Al
5c15c4a99f [languages] Adding non-default Spanish and French gazetteers to the US, and giving the country of Jersey shared English/French defaults instead of just English 2015-08-22 15:21:04 -04:00
Al
cc43409b72 [languages] Adding English gazetteers to many countries where the default language is Arabic but the road signs may be in English 2015-08-22 13:42:31 -04:00
Al
330002197a [fix] via in English is a stopword, not a street type 2015-08-18 16:00:48 -04:00
Al
089a197155 [dictionaries] Updates to Galician and Catalan where they overlap with Spanish 2015-08-18 13:14:21 -04:00
Al
faf3435ffc [fix] English dictionaries 2015-08-18 12:40:09 -04:00
Al
9183ba4e01 [dictionaries] Accented Gran Via for Catalan 2015-08-18 12:39:40 -04:00
Al
07b43e524e [dictionaries] A few more Catalan terms that are the same as in Spanish 2015-08-18 12:23:11 -04:00
Al
3b55b51ef1 [fix] English dictionary 2015-08-18 11:34:18 -04:00
Al
fb7f2999e5 [dictionaries] Moving a few terms in German dictionaries 2015-08-18 11:06:53 -04:00
Al
c5d14e9c4d [dictionaries] A few new terms in Dutch dictionaries to help distinguish from German 2015-08-18 11:06:10 -04:00
Al
4d115fdd88 [dictionaries] Better categorization of French dictionaries 2015-08-18 11:05:39 -04:00
Al
0f883a8872 [dictionaries] A few English dictionary terms that came up in language detection tests 2015-08-18 11:04:53 -04:00
Al
db7ffa7cab [dictionaries] Updating Catalan dictionaries with place types to help distinguish from Spanish 2015-08-18 11:03:44 -04:00
Al
a1d8d3bf5f [dictionaries] Fixes to Spanish dictionaries 2015-08-18 11:03:01 -04:00
Al
b8fbbb1917 [languages] Removing the Belarusian override as Russian appears to be used often in street signs and there are generally good name:ru/name:be tags 2015-08-17 04:20:39 -04:00
Al
453aa7c633 [dictionaries] Adding French as equally likely language for Guernesey, which will effectively exclude it from the language training data (doesn't matter since there's already enough English/French addresses). 2015-08-17 02:04:29 -04:00
Al
133ce9e5b1 [languages] Bonaire admin1 as well as country code 2015-08-14 21:42:13 -04:00
Al
191c0e3ce5 [languages] Changing Bonaire's default road sign language to Papiamento to help distinguish from Dutch 2015-08-14 21:06:16 -04:00
Al
ee982cd872 [dictionaries] Removing dictionaries/all/personal_suffixes, can add to languages as needed 2015-08-08 23:13:09 -04:00
Al
90cde298dd [dictionaries] condensed forms of sin numero in various languages 2015-08-02 21:19:55 -06:00
Al
6bf563ca89 [dictionaries] Italian abbreviations for strada 2015-07-28 19:15:30 -04:00
Al
3dc6115a4e [dictionaries] Updates to English and Spanish dictionaries on looking through a data set of real test addresses 2015-07-27 16:42:09 -04:00
Al
0ab1434f20 [numex] Making all languages except the ideographic writing systems (CJK) whole_tokens_only for numex. Otherwise non-number prefixes may accidentally get converted into numbers. May add some more options around this in the future. 2015-07-27 01:52:44 -04:00
Al
42f6be7434 [fix] county road 2015-07-25 14:19:38 -04:00
Al
cff72a0cb3 [dictionaries] Adding a few versions of the phrase "centro commerical" in French, Spanish and Italian after a review of addresses in those languages 2015-07-24 16:07:34 -04:00
Al
caf714f06f [fix] typo and frivolous key 2015-07-24 15:22:57 -04:00
Al
64a63fdf51 [mv] Moving all repo data files to a resources dir, data is only for runtime files 2015-07-21 18:11:36 -04:00