Commit Graph

74 Commits

Author SHA1 Message Date
Al
f349dbac01 [dictionaries] Removing Rue from English street types (addresses using Rue as the street type are actually in French, which is a valid, detectable, regional language for the US in libpostal). Some of the abbreviations also belong in the synonyms or place names dictionaries. 2016-02-22 18:26:02 -05:00
Riordan
21cf7245a7 Added additional street types from NY State Post-Address Types
NY State [posted their state address database](http://gis.ny.gov/gisdata/inventories/details.cfm?DSID=921) along with the metadata standards for consuming it. Included in this metadata is a [guide to abbreviations](http://gis.ny.gov/gisdata/supportfiles/Updated-Deliverable-Schema.pdf), incorporated in this commit.
2016-02-22 17:27:14 -05:00
Al
cd76c660d8 [fix] French numex 2016-01-28 16:40:50 -05:00
Al
299998d8b5 [languages] Making Basque the only default in the Basque region. 2016-01-24 19:35:03 -05:00
Al
b735c79326 [languages] Adding Spanish in as a secondary default in Spain to supplement regional language defaults so we're more careful in disambiguation 2016-01-24 16:34:23 -05:00
Al
87aff60a7e [dictionaries] Gulch 2016-01-24 03:23:40 -05:00
Al
cb914ae85b [dictionaries] Adding a few terms to English dictionaries for automated disambiguation in the US/Canada 2016-01-24 03:15:10 -05:00
Al
89aa039692 [dictionaries] Adding some Italian month abbreviations 2016-01-21 15:12:46 -05:00
Al
5385cb71d6 [languages] Adding English dictionaries to Indonesia 2016-01-17 22:08:06 -05:00
Al
24b4a680c3 [languages] Adding English dictionaries for Bangladesh 2016-01-14 13:36:07 -05:00
Al
edebdf73e0 [dictionaries] Using long forms as canonical for English degrees as new language models may do some auto-abbreviating 2016-01-14 13:35:41 -05:00
Al
81624f8b6d [dictionaries] All professional suffixes should use the abbreviated form as the canonical 2015-12-31 13:14:29 -05:00
Al
7906f5542d [dictionaries] ulitsa is the proper transliteration for Russian 2015-12-31 03:49:51 -05:00
Al
cc89b768d8 [dictionaries] New Japanese abbreviations from the OSM wiki 2015-12-31 01:32:42 -05:00
Al
ffe9c2a971 [dictionaries] Santi/SS in Italian 2015-12-31 01:32:21 -05:00
Al
ecfdbc3ec2 [dictionaries] New German toponym abbreviations from the OSM wiki 2015-12-31 01:32:00 -05:00
Al
a6f7924f12 [dictionaries] Adding service road to English 2015-12-31 01:31:27 -05:00
Al
684c238ca0 [dictionaries] Adding no to English ambiguous 2015-12-31 01:31:01 -05:00
Al
2c254ebc5e [fix] Belgium cities again 2015-12-08 23:09:28 -05:00
Al
f252869671 [dictionaries] adding ste to English dictionaries 2015-12-08 22:29:52 -05:00
Al
35db855819 [fix] canonical index in address expansion data, should be -1 for all canonical phrases 2015-12-08 15:09:51 -05:00
Al
bfc517ae42 [fix] Belgium districts 2015-12-07 22:11:11 -05:00
Al
4dba0c54e4 [dictionaries] Adding state abbreviations for US, CA and AU into dictionaries 2015-12-06 16:47:36 -05:00
Al
470bd17c07 [formatting] Adding configs for a few dozen countries mapping OSM admin level to an address formatter field 2015-11-19 18:07:54 -05:00
Al
094a5bf5f4 [dictionaries] adding Jnr and Snr forms for generational suffixes 2015-10-28 00:00:34 -04:00
Al
5af6dc77d1 [dictionaries] Adding a few additional abbreviated names of political leaders that come up, a missing abbreviation 2015-10-06 15:09:50 -04:00
Al
ed51fce291 [fix] Safe to assume Bokmål for Norwegian street addresses 2015-10-04 11:19:43 -04:00
Al
0cedc68a97 [languages] Changing Arabic to default in North African countries with two official languages. Making Danish secondary in the US Virgin Islands 2015-09-30 01:01:42 -04:00
Al
05da2ee6bd [dictionaries] Adding commonly used colon form No: for Turkish addresses 2015-09-28 17:48:19 -04:00
Al
e255ae0e09 [dictionaries] Luxembourgish dictionaries 2015-09-26 18:31:07 -04:00
Al
3fe56d029d [dictionaries] German Swiss dictionaries 2015-09-26 18:30:55 -04:00
Al
fa320defb7 [dictionaries] Afrikaans dictionaries for better disambiguatin in South Africa 2015-09-24 16:37:16 -04:00
Al
050a850fb9 [dictionaries] Dutch directionals, separating out the west vs westen forms 2015-09-24 16:36:52 -04:00
Al
fe5d665533 [dictionaries] Arc in English needn't always expand to Arcade 2015-09-24 16:36:21 -04:00
Al
bcac6a41be [dictionaries] Separating out Austrian toponym abbreviations 2015-09-24 16:35:56 -04:00
Al
22c16b43cf [languages] Italian is also the regional default in Valle D'Aosta and Trentino-Alto Adige 2015-09-10 11:09:13 -07:00
Al
c1da2fa94b [dictionaries] Adding 'Rang' to French dictionaries 2015-09-09 17:21:26 -07:00
Al
d13d4d7d28 [dictionaries] Adding English gazetteers as non-default to Georgia 2015-09-03 20:25:42 -04:00
Al
90f333b16c [languages] Adding English non-default dictionaries to a number of countries where English can be found in OSM 2015-08-24 02:49:49 -04:00
Al
c1ce91abbf [languages] Better handling of non-default langauge canonicals in default langauge text 2015-08-24 01:26:17 -04:00
Al
9f6f4feea1 [dictionaries/languages] Adding English gazetteers for Bahrain, pas abbreviation for paseo 2015-08-23 23:32:34 -04:00
Al
d14be57e73 [dictionaries] Adding exit as an English street type 2015-08-23 22:51:22 -04:00
Al
e26776a5e9 [dictionaries] Occitan stopwords for disambiguating from French 2015-08-23 16:35:46 -04:00
Al
43178747f8 [languages] Using stopwords only to account for how ambiguous a phrase is, not for disambiguation 2015-08-23 04:28:44 -04:00
Al
9c176961ff [dictionaries] Norwegian street types from the suffix dictionary 2015-08-23 02:32:44 -04:00
Al
122a81b610 [languages] non-default languages can still be labeled from > 1 char abbreviations if there's no evidence of other languages in the string. Adding Python version of get_string_script from the C lib 2015-08-23 02:26:06 -04:00
Al
a419dad630 [languages] Adding canonical back in to language disambiguation (for prefixes/suffixes too), using non-canonicals/abbreviations in non-default languages if there are no other abbreviations found, adding in stopwords dictionaries 2015-08-23 00:43:37 -04:00
Al
5c15c4a99f [languages] Adding non-default Spanish and French gazetteers to the US, and giving the country of Jersey shared English/French defaults instead of just English 2015-08-22 15:21:04 -04:00
Al
cc43409b72 [languages] Adding English gazetteers to many countries where the default language is Arabic but the road signs may be in English 2015-08-22 13:42:31 -04:00
Al
330002197a [fix] via in English is a stopword, not a street type 2015-08-18 16:00:48 -04:00