13e10adc13[addresses/dictionaries] Adding rez-de-chaussée bas and rez-de-chaussée haut in French
Al
2016-06-25 20:32:03 -04:00
41898c3a86[dictionaries] Variations on sin numero for Spanish
Al
2016-06-25 20:30:02 -04:00
414c2e9820[addresses/dictionaries] Updates to Portuguese configs, variations for Brasil
Al
2016-06-25 20:29:36 -04:00
2b752de6a7[numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen
Al
2016-06-25 13:36:10 -04:00
b8bc8a33d5[numex] Ordinal spellout using the numex configs
Al
2016-06-25 13:35:03 -04:00
9c43a6fdf8[dictionaries] English cross streets
Al
2016-06-24 16:12:33 -04:00
e2a9a57269[numex] Adding numeric expression spellout in the Python geodata module for generating training data
Al
2016-06-24 16:06:59 -04:00
cf2ed2b299[osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number
Al
2016-06-23 13:59:40 -04:00
a2350281feAdded backers and sponsors from OpenCollective
Pia Mancini
2016-06-22 15:33:02 -07:00
106dfa80c3[parser/cli] Using NFC normalization on the output in the parser client (closes#30). Optional command-line arg for parser output dir, useful for spot-checking different experiments
Al
2016-06-22 11:56:35 -04:00
e19bc86c5a[parser] No digit normalization in training data-derived parser phrases (for postcodes, etc.), phrases include the new island type, house number phrases if any are valid. Adjacent words are now full phrases if they are part of a multiword token like a city name. For hyphenated names like Carmel-by-the-Sea, adding a version to the phrase dictionary where the hyphens are replaced with spaces
Al
2016-06-22 11:50:42 -04:00
3ff2f726d0[fix] tokenized trie search when falling off the trie at the start of a valid phrase
Al
2016-06-21 15:48:43 -04:00
935a31df07[fix] semicolon in #define
Al
2016-06-21 15:16:14 -04:00
83eef9bd4dMerge pull request #70 from djui/patch-1
Al Barrentine
2016-06-17 11:31:05 +02:00
e03bd77be8Do not run Homebrew's brew under sudo
Uwe Dauernheim
2016-06-17 09:50:22 +09:00
b90239206f[dictionaries] Portuguese abbreviations
Al
2016-06-16 19:18:02 +02:00
082dbe6dd2[addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway)
Al
2016-06-16 01:45:43 +02:00
1f08cce1a7[addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint
Al
2016-06-16 01:44:12 +02:00
c76e7ab776[addresses] Adding Portuguese sub-building config
Al
2016-06-16 01:43:03 +02:00
68db871a33[dictionaries] Portuguese dictionaries to support sub-building config
Al
2016-06-16 01:42:21 +02:00
cb0c913c34[dictionaries] Adding e/ to ambiguous in Spanish dictionaries
Al
2016-06-16 01:41:54 +02:00
f22fcb7932[dictionaries] Adding No to Germanic-language number synonyms
Al
2016-06-16 01:41:06 +02:00
a576e32371[fix] adding back staircase in Swedish sub-building config
Al
2016-06-15 23:39:16 +02:00
5d7fabaa19[addresses] Swedish address config
Al
2016-06-15 16:32:24 +02:00
d680f400d5[addresses] Lower probability of null phrase in Norwegian configs
Al
2016-06-15 16:29:53 +02:00
2854621f2e[dictionaries] Swedish dictionaries to support sub-building config
Al
2016-06-15 16:29:26 +02:00
8d31acbe17[addresses] venstre in Norway rather than igjen
Al
2016-06-15 14:22:25 +02:00
145786a4f3[addresses] Adding parterre for ground floor in Switzerland
Al
2016-06-15 14:20:42 +02:00
715686954a[dictionaries] adding phrases meaning 'near' or 'in' for Norwegian to the dictionaries
Al
2016-06-15 03:13:28 +02:00
4e9f9fbac0[dictionaries] no standalone level types for Norway
Al
2016-06-15 03:12:54 +02:00
8e27dd2554[fix] /underetasje/hovedetasje/ in Norwegian and translating category phrases from Danish
Al
2016-06-15 03:12:24 +02:00
b07a594f79[addresses] Danish level/unit and entrance/unit combinations
Al
2016-06-15 02:55:25 +02:00
ccd1d4825c[addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway)
Al
2016-06-14 22:02:14 +02:00
f02d393b90[addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs
Al
2016-06-14 21:53:43 +02:00
e6ac8062d8[addresses] adding nb.yaml to valid configs
Al
2016-06-14 21:52:11 +02:00
4e192d0c2a[addresses] null_phrase_alpha_only for phrases like 3o B in Spain
Al
2016-06-14 21:51:47 +02:00
699b882a31[addresses] Norwegian address configs
Al
2016-06-14 21:36:32 +02:00
2c0bdd9afe[dictionaries] Norwegian sub-building dictionaries
Al
2016-06-14 21:35:23 +02:00
eb1b410d63[tokenization] Including full-width numbers in numeric tokens
Al
2016-06-14 01:28:25 +02:00
faf7ccbddd[numex] Norwegian ordinal indicators
Al
2016-06-13 16:46:50 +02:00
e79ef340ba[addresses] Updates to Danish sub-building config
Al
2016-06-13 16:46:25 +02:00
3557a2313c[dictionaries] Updates to Danish sub-building dictionaries
Al
2016-06-13 16:45:45 +02:00
e1cb8b4bbb[fix] return None if there are no ordinal suffixes for a given language
Al
2016-06-13 16:17:26 +02:00
1f7186d9f2[fix] addr:place=
Al
2016-06-09 16:17:21 +02:00
e0306b2147[osm] Adding railway stations to venues/addresses data sets
Al
2016-06-09 14:59:37 +02:00
791b298b6d[docs][ci skip] Adding Posty McPostFace to the official bindings list (to be maintained in lockstep with this repo)
Al
2016-06-09 13:03:40 +02:00
89c09fb8aa[addresses] Adding Danish config to parsed configs
Al
2016-06-07 18:04:24 -04:00
95842a0a8d[formatting] Adding Danish config to formatter and adjusting continental European template insertions
Al
2016-06-07 18:03:41 -04:00
085ba945e2[addresses] Danish address config
Al
2016-06-07 18:01:46 -04:00
30e1114e6e[dictionaries] Danish sub-building dictionaries
Al
2016-06-07 18:01:30 -04:00
135d50827d[fix] adjusting a few probabilities for German
Al
2016-06-07 17:58:22 -04:00
8854c372ac[addresses/dictionaries] Adding Catalan address config
Al
2016-06-02 21:06:29 -04:00
d947af8152[addresses] Dutch cross streets
Al
2016-06-02 12:26:12 -04:00
a5ae40f7ee[fix] Adding sampling for French intersections
Al
2016-06-02 12:22:25 -04:00
18c25fd4fc[fix] adding sampling to Spanish intersections
Al
2016-06-02 12:21:18 -04:00
3b0712ef41[fix] name
Al
2016-06-02 12:17:40 -04:00
24b84dd503[fix] name
Al
2016-06-02 03:05:31 -04:00
2958cbfacb[dictionaries] Adding suite to Spanish dictionaries, used sometimes in Latin America, removing entre from stopwords as it's part of the intersections dictionary
Al
2016-06-02 00:31:40 -04:00
a05eb0fd51[addresses] Spanish intersections, suite
Al
2016-06-02 00:26:11 -04:00
a32820835c[formatting] Adding aliases in formatting config, so e.g. most of the Francophone world shares France's config without needing to be the case for every French address (e.g. Belgium), generic config for continental Europe, etc.
Al
2016-06-01 17:12:35 -04:00
c386e765ef[dictionaries] Adding case postale to French dictionaries
Al
2016-06-01 17:10:28 -04:00
815c0cf69c[dictionaries] extended Dutch dictionaries
Al
2016-06-01 16:48:48 -04:00
118bd95fed[addresses] Dutch address config
Al
2016-06-01 16:47:57 -04:00
69680a4a0d[addresses] German address config numbered units
Al
2016-06-01 16:46:39 -04:00
6cce045e92[addresses] German categories and cross streets
Al
2016-06-01 16:44:12 -04:00
1e295ea8e9[dictionaries] Making new component for near/nearby prepositions
Al
2016-06-01 15:32:23 -04:00
9e83c2cf22[dictionaries] Adding new dictionary for prepositions like near/nearby
Al
2016-06-01 15:31:20 -04:00
42566eced3[addresses] case postale for Canada/Switzerland
Al
2016-06-01 15:25:32 -04:00
7b1d141fdc[addresses] German address config
Al
2016-06-01 12:36:24 -04:00
010d03b55b[dictionaries] Updates to German dictionaries
Al
2016-06-01 12:35:48 -04:00
b6fe41451f[fix] subdir config in OpenAddresses formatter
Al
2016-06-01 12:17:08 -04:00
a02692713c[addresses] Fixes for English/French Canadian apartment numbers
Al
2016-06-01 11:43:42 -04:00
3c51a5a052[formatting] Territories use parent country's template insertion probabilities
Al
2016-06-01 11:42:11 -04:00
012d174fdc[openaddresses] CLDR country names for OpenAddresses training set
Al
2016-05-31 18:54:34 -04:00
5ae990cd43[fix] gauche
Al
2016-05-31 18:52:13 -04:00
0d2e8387e6[openaddresses] Removing New Zealand city as the field is not specific enough and may conflict with OSM names, needs to be reverse geocoded. Adding cldr country probabilities so we can add localized names/codes given the country
Al
2016-05-31 18:29:07 -04:00
5af6546569[addresses] French address levels alphanumeric
Al
2016-05-31 16:07:48 -04:00
a9c65af75e[addresses] Don't generate sub-building components at all if there's no house number
Al
2016-05-31 16:02:55 -04:00
9bf6018018[addresses] Topological sort of address component dependencies so they get checked/removed in order
Al
2016-05-31 16:01:49 -04:00
2fe379c547[states] State abbreviations for Brazil and Mexico
Al
2016-05-31 15:53:40 -04:00
9fcc04e440[parser] road has no dependencies
Al
2016-05-31 15:52:24 -04:00
bc28f69875[openaddresses] Country code for Belgium, removing Flanders as it has encoding issues, removing region from New Zealand formats as it appears to be conflated with districts
Al
2016-05-31 12:11:42 -04:00
e8647c4701[fix] unused var
Al
2016-05-31 11:01:37 -04:00
e362851459[addresses] French address config
Al
2016-05-31 03:36:07 -04:00
d7ddefbfaf[addresses] Spanish PO box probabilities
Al
2016-05-31 03:35:49 -04:00
3bbbc741d4[openaddresses] OpenAddresses training script
Al
2016-05-31 02:33:32 -04:00
d98eeb08e8[openaddresses] Only adding units for Australia, as they're known to contain both designator and number. US units seem to often have simple numbers/letters for the unit field
Al
2016-05-31 02:20:28 -04:00
7e2b87f10b[openaddresses] Added components via OA config
Al
2016-05-31 02:12:41 -04:00
0efab434f7[openaddresses] Adding abbreviated unit
Al
2016-05-31 02:11:52 -04:00
29406aaa2f[openaddresses] Adding unit by default (only for files that have been vetted)
Al
2016-05-31 02:06:52 -04:00
cf26e6a1bd[fix] OpenAddresses formatting
Al
2016-05-31 02:04:06 -04:00
31bd94191e[fix] condition
Al
2016-05-31 02:00:33 -04:00
ededf6373e[fix] validators
Al
2016-05-31 01:59:05 -04:00
9ace2fb4e7[fix] method name
Al
2016-05-31 01:57:34 -04:00
52fd241235[fix] return value
Al
2016-05-31 01:56:14 -04:00
30f23703ac[fix] import again
Al
2016-05-31 01:53:17 -04:00
86e3cb95f8[fix] import
Al
2016-05-31 01:51:16 -04:00
b793b6e0af[fix] directory structure
Al
2016-05-31 01:48:45 -04:00
dd561ba5b2[fix] import
Al
2016-05-31 01:42:27 -04:00
913d448db1[openaddresses] OpenAddresses address formatter, using the config
Al
2016-05-31 01:41:16 -04:00