2a10dd16d5[fix] Afrikaans expansion
Al
2016-06-28 13:17:29 -04:00
fcfd28f23a[fix] Fixes to address configs
Al
2016-06-28 13:16:59 -04:00
0b119becaf[numex] Estonian ordinal indicators are just .
Al
2016-06-28 13:12:08 -04:00
8023c1e86a[numex] Finnish ordinals can also use .
Al
2016-06-28 13:11:44 -04:00
4412ba1177[test] Adding tests for address configs
Al
2016-06-28 13:10:47 -04:00
d3a6a032ab[fix] a few errors with non-numbers in numeric_phrase
Al
2016-06-28 13:08:38 -04:00
be5fd79a48[expansion] Prefix/suffix expansions by default can apply to ADDRESS_ANY but also inherit the types of any dictionary that lists their canonical form (so we can add suffixes without worrying about whether they're for streets or place names, etc.)
Al
2016-06-28 02:37:38 -04:00
8072b01023[dictionaries] Adding concatenated suffixes to street types, adding universitat as a suffix
Al
2016-06-28 01:31:04 -04:00
96d4c64ebd[addresses] Use bostad in Swedish addresses in Finland
Al
2016-06-28 01:29:10 -04:00
2505afa2b9[addresses] Adding new configs
Al
2016-06-27 03:06:54 -04:00
dfd29911fd[addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class
Al
2016-06-27 03:06:33 -04:00
79bc859692[addresses] Italian address config
Al
2016-06-27 03:05:50 -04:00
46a82aef89[dictionaries] Italian dictionaries to support sub-building config
Al
2016-06-27 03:05:06 -04:00
579dafc6e0[addresses] Slovak address config
Al
2016-06-27 03:04:41 -04:00
540e4be7b2[dictionaries] Slovak dictionaries to support sub-building config
Al
2016-06-27 03:04:16 -04:00
02f19c4df0[addresses] Czech sub-building config
Al
2016-06-27 03:03:57 -04:00
c16bd2768a[dictionaries] Czech dictionaries to support sub-building config
Al
2016-06-27 03:03:47 -04:00
078aa20930[numex] ordinal suffixes for Czech/Slovak
Al
2016-06-27 03:03:24 -04:00
6ab8041618[dictionaries] Ampersand in Polish/Russian
Al
2016-06-27 03:02:39 -04:00
faed055803[dictionaries] Numero sign in Italian
Al
2016-06-27 03:02:02 -04:00
efa75919e6[dictionaries] numero sign in French
Al
2016-06-27 03:01:43 -04:00
ee71d94e85[addresses] Adding Roman numerals to the Polish config for floor numbers
Al
2016-06-27 03:01:10 -04:00
11c6564783[addresses] Russian address config
Al
2016-06-26 01:24:00 -04:00
7bc459f1a9[dictionaries] Russian dictionaries to support address configs
Al
2016-06-26 01:23:47 -04:00
53052e6d25[addresses] Polish address config and dictionary updates
Al
2016-06-25 20:36:07 -04:00
558d643042[numex] Portuguese ordinals fix
Al
2016-06-25 20:32:31 -04:00
b15675f8cb[addresses/dictionaries] Adding rez-de-chaussée bas and rez-de-chaussée haut in French
Al
2016-06-25 20:32:03 -04:00
d89e9dcd04[dictionaries] Variations on sin numero for Spanish
Al
2016-06-25 20:30:02 -04:00
ee27dc5ea1[addresses/dictionaries] Updates to Portuguese configs, variations for Brasil
Al
2016-06-25 20:29:36 -04:00
8a5dd26dbf[numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen
Al
2016-06-25 13:36:10 -04:00
eee68d1ca5[numex] Ordinal spellout using the numex configs
Al
2016-06-25 13:35:03 -04:00
c628b9bee8[dictionaries] English cross streets
Al
2016-06-24 16:12:33 -04:00
8383d5bb12[numex] Adding numeric expression spellout in the Python geodata module for generating training data
Al
2016-06-24 16:06:59 -04:00
53ea1c139a[osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number
Al
2016-06-23 13:59:40 -04:00
8926293063[parser/cli] Using NFC normalization on the output in the parser client (closes#30). Optional command-line arg for parser output dir, useful for spot-checking different experiments
Al
2016-06-22 11:56:35 -04:00
44908ff95a[parser] No digit normalization in training data-derived parser phrases (for postcodes, etc.), phrases include the new island type, house number phrases if any are valid. Adjacent words are now full phrases if they are part of a multiword token like a city name. For hyphenated names like Carmel-by-the-Sea, adding a version to the phrase dictionary where the hyphens are replaced with spaces
Al
2016-06-22 11:50:42 -04:00
41ae742285[fix] tokenized trie search when falling off the trie at the start of a valid phrase
Al
2016-06-21 15:48:43 -04:00
6e60b3bbda[fix] semicolon in #define
Al
2016-06-21 15:16:14 -04:00
0f76c8c631[dictionaries] Portuguese abbreviations
Al
2016-06-16 19:18:02 +02:00
b8aba86471[addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway)
Al
2016-06-16 01:45:43 +02:00
c29d1ad947[addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint
Al
2016-06-16 01:44:12 +02:00
589497cb16[addresses] Adding Portuguese sub-building config
Al
2016-06-16 01:43:03 +02:00
2be41732f8[dictionaries] Portuguese dictionaries to support sub-building config
Al
2016-06-16 01:42:21 +02:00
1bd62313f4[dictionaries] Adding e/ to ambiguous in Spanish dictionaries
Al
2016-06-16 01:41:54 +02:00
6b7e4f8515[dictionaries] Adding No to Germanic-language number synonyms
Al
2016-06-16 01:41:06 +02:00
619127e4b1[fix] adding back staircase in Swedish sub-building config
Al
2016-06-15 23:39:16 +02:00
bc70a54b09[addresses] Swedish address config
Al
2016-06-15 16:32:24 +02:00
b622315d0f[addresses] Lower probability of null phrase in Norwegian configs
Al
2016-06-15 16:29:53 +02:00
ac22f270bb[dictionaries] Swedish dictionaries to support sub-building config
Al
2016-06-15 16:29:26 +02:00
d8ddae362f[addresses] venstre in Norway rather than igjen
Al
2016-06-15 14:22:25 +02:00
cd9b33983a[addresses] Adding parterre for ground floor in Switzerland
Al
2016-06-15 14:20:42 +02:00
a61d9b1548[dictionaries] adding phrases meaning 'near' or 'in' for Norwegian to the dictionaries
Al
2016-06-15 03:13:28 +02:00
541fe6c5ac[dictionaries] no standalone level types for Norway
Al
2016-06-15 03:12:54 +02:00
06fdf1c532[fix] /underetasje/hovedetasje/ in Norwegian and translating category phrases from Danish
Al
2016-06-15 03:12:24 +02:00
0222049b88[addresses] Danish level/unit and entrance/unit combinations
Al
2016-06-15 02:55:25 +02:00
03b9825390[addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway)
Al
2016-06-14 22:02:14 +02:00
9d7239d0ad[addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs
Al
2016-06-14 21:53:43 +02:00
420b169d48[addresses] adding nb.yaml to valid configs
Al
2016-06-14 21:52:11 +02:00
d50495f609[addresses] null_phrase_alpha_only for phrases like 3o B in Spain
Al
2016-06-14 21:51:47 +02:00
52db502929[addresses] Norwegian address configs
Al
2016-06-14 21:36:32 +02:00
2831b70747[dictionaries] Norwegian sub-building dictionaries
Al
2016-06-14 21:35:23 +02:00
b5d4dd6f37[tokenization] Including full-width numbers in numeric tokens
Al
2016-06-14 01:28:25 +02:00
02d40c23a6[numex] Norwegian ordinal indicators
Al
2016-06-13 16:46:50 +02:00
0136c88629[addresses] Updates to Danish sub-building config
Al
2016-06-13 16:46:25 +02:00
5834f6b8ed[dictionaries] Updates to Danish sub-building dictionaries
Al
2016-06-13 16:45:45 +02:00
23736f2650[fix] return None if there are no ordinal suffixes for a given language
Al
2016-06-13 16:17:26 +02:00
a6da72a831[fix] addr:place=
Al
2016-06-09 16:17:21 +02:00
ca88ff7f73[osm] Adding railway stations to venues/addresses data sets
Al
2016-06-09 14:59:37 +02:00
b22d30cb52[addresses] Adding Danish config to parsed configs
Al
2016-06-07 18:04:24 -04:00
003c95f9eb[formatting] Adding Danish config to formatter and adjusting continental European template insertions
Al
2016-06-07 18:03:41 -04:00
b8ae1ad61d[addresses] Danish address config
Al
2016-06-07 18:01:46 -04:00
6f5b0e16a1[dictionaries] Danish sub-building dictionaries
Al
2016-06-07 18:01:30 -04:00
1d09060012[fix] adjusting a few probabilities for German
Al
2016-06-07 17:58:22 -04:00
6861c09caa[addresses/dictionaries] Adding Catalan address config
Al
2016-06-02 21:06:29 -04:00
4fa8c2aa8e[addresses] Dutch cross streets
Al
2016-06-02 12:26:12 -04:00
6e4ca716df[fix] Adding sampling for French intersections
Al
2016-06-02 12:22:25 -04:00
38e17bd1b2[fix] adding sampling to Spanish intersections
Al
2016-06-02 12:21:18 -04:00
72e647902d[fix] name
Al
2016-06-02 12:17:40 -04:00
03be909a60[fix] name
Al
2016-06-02 03:05:31 -04:00
45e069be6a[dictionaries] Adding suite to Spanish dictionaries, used sometimes in Latin America, removing entre from stopwords as it's part of the intersections dictionary
Al
2016-06-02 00:31:40 -04:00
127883facc[addresses] Spanish intersections, suite
Al
2016-06-02 00:26:11 -04:00
14f08e5991[formatting] Adding aliases in formatting config, so e.g. most of the Francophone world shares France's config without needing to be the case for every French address (e.g. Belgium), generic config for continental Europe, etc.
Al
2016-06-01 17:12:35 -04:00
75e9d94684[dictionaries] Adding case postale to French dictionaries
Al
2016-06-01 17:10:28 -04:00
ad7ef082a5[dictionaries] extended Dutch dictionaries
Al
2016-06-01 16:48:48 -04:00
b8a9d15d41[addresses] Dutch address config
Al
2016-06-01 16:47:57 -04:00
88762a7778[addresses] German address config numbered units
Al
2016-06-01 16:46:39 -04:00
a456262aca[addresses] German categories and cross streets
Al
2016-06-01 16:44:12 -04:00
dd7ef6fabf[dictionaries] Making new component for near/nearby prepositions
Al
2016-06-01 15:32:23 -04:00
755976bc16[dictionaries] Adding new dictionary for prepositions like near/nearby
Al
2016-06-01 15:31:20 -04:00
ec44fdaf79[addresses] case postale for Canada/Switzerland
Al
2016-06-01 15:25:32 -04:00
ca39272d18[addresses] German address config
Al
2016-06-01 12:36:24 -04:00
22be892635[dictionaries] Updates to German dictionaries
Al
2016-06-01 12:35:48 -04:00
0bbced4966[fix] subdir config in OpenAddresses formatter
Al
2016-06-01 12:17:08 -04:00
fdba7b138d[addresses] Fixes for English/French Canadian apartment numbers
Al
2016-06-01 11:43:42 -04:00
7d5d54bd29[formatting] Territories use parent country's template insertion probabilities
Al
2016-06-01 11:42:11 -04:00
77a4476b8e[openaddresses] CLDR country names for OpenAddresses training set
Al
2016-05-31 18:54:34 -04:00
7d62a3a762[fix] gauche
Al
2016-05-31 18:52:13 -04:00
afa58e6edb[openaddresses] Removing New Zealand city as the field is not specific enough and may conflict with OSM names, needs to be reverse geocoded. Adding cldr country probabilities so we can add localized names/codes given the country
Al
2016-05-31 18:29:07 -04:00
e91b318121[addresses] French address levels alphanumeric
Al
2016-05-31 16:07:48 -04:00
9059c2af60[addresses] Don't generate sub-building components at all if there's no house number
Al
2016-05-31 16:02:55 -04:00