Commit Graph

  • 13e10adc13 [addresses/dictionaries] Adding rez-de-chaussée bas and rez-de-chaussée haut in French Al 2016-06-25 20:32:03 -04:00
  • 41898c3a86 [dictionaries] Variations on sin numero for Spanish Al 2016-06-25 20:30:02 -04:00
  • 414c2e9820 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil Al 2016-06-25 20:29:36 -04:00
  • 2b752de6a7 [numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen Al 2016-06-25 13:36:10 -04:00
  • b8bc8a33d5 [numex] Ordinal spellout using the numex configs Al 2016-06-25 13:35:03 -04:00
  • 9c43a6fdf8 [dictionaries] English cross streets Al 2016-06-24 16:12:33 -04:00
  • e2a9a57269 [numex] Adding numeric expression spellout in the Python geodata module for generating training data Al 2016-06-24 16:06:59 -04:00
  • cf2ed2b299 [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number Al 2016-06-23 13:59:40 -04:00
  • a2350281fe Added backers and sponsors from OpenCollective Pia Mancini 2016-06-22 15:33:02 -07:00
  • 106dfa80c3 [parser/cli] Using NFC normalization on the output in the parser client (closes #30). Optional command-line arg for parser output dir, useful for spot-checking different experiments Al 2016-06-22 11:56:35 -04:00
  • e19bc86c5a [parser] No digit normalization in training data-derived parser phrases (for postcodes, etc.), phrases include the new island type, house number phrases if any are valid. Adjacent words are now full phrases if they are part of a multiword token like a city name. For hyphenated names like Carmel-by-the-Sea, adding a version to the phrase dictionary where the hyphens are replaced with spaces Al 2016-06-22 11:50:42 -04:00
  • 3ff2f726d0 [fix] tokenized trie search when falling off the trie at the start of a valid phrase Al 2016-06-21 15:48:43 -04:00
  • 935a31df07 [fix] semicolon in #define Al 2016-06-21 15:16:14 -04:00
  • 83eef9bd4d Merge pull request #70 from djui/patch-1 Al Barrentine 2016-06-17 11:31:05 +02:00
  • e03bd77be8 Do not run Homebrew's brew under sudo Uwe Dauernheim 2016-06-17 09:50:22 +09:00
  • b90239206f [dictionaries] Portuguese abbreviations Al 2016-06-16 19:18:02 +02:00
  • 082dbe6dd2 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) Al 2016-06-16 01:45:43 +02:00
  • 1f08cce1a7 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint Al 2016-06-16 01:44:12 +02:00
  • c76e7ab776 [addresses] Adding Portuguese sub-building config Al 2016-06-16 01:43:03 +02:00
  • 68db871a33 [dictionaries] Portuguese dictionaries to support sub-building config Al 2016-06-16 01:42:21 +02:00
  • cb0c913c34 [dictionaries] Adding e/ to ambiguous in Spanish dictionaries Al 2016-06-16 01:41:54 +02:00
  • f22fcb7932 [dictionaries] Adding No to Germanic-language number synonyms Al 2016-06-16 01:41:06 +02:00
  • a576e32371 [fix] adding back staircase in Swedish sub-building config Al 2016-06-15 23:39:16 +02:00
  • 5d7fabaa19 [addresses] Swedish address config Al 2016-06-15 16:32:24 +02:00
  • d680f400d5 [addresses] Lower probability of null phrase in Norwegian configs Al 2016-06-15 16:29:53 +02:00
  • 2854621f2e [dictionaries] Swedish dictionaries to support sub-building config Al 2016-06-15 16:29:26 +02:00
  • 8d31acbe17 [addresses] venstre in Norway rather than igjen Al 2016-06-15 14:22:25 +02:00
  • 145786a4f3 [addresses] Adding parterre for ground floor in Switzerland Al 2016-06-15 14:20:42 +02:00
  • 715686954a [dictionaries] adding phrases meaning 'near' or 'in' for Norwegian to the dictionaries Al 2016-06-15 03:13:28 +02:00
  • 4e9f9fbac0 [dictionaries] no standalone level types for Norway Al 2016-06-15 03:12:54 +02:00
  • 8e27dd2554 [fix] /underetasje/hovedetasje/ in Norwegian and translating category phrases from Danish Al 2016-06-15 03:12:24 +02:00
  • b07a594f79 [addresses] Danish level/unit and entrance/unit combinations Al 2016-06-15 02:55:25 +02:00
  • ccd1d4825c [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) Al 2016-06-14 22:02:14 +02:00
  • f02d393b90 [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs Al 2016-06-14 21:53:43 +02:00
  • e6ac8062d8 [addresses] adding nb.yaml to valid configs Al 2016-06-14 21:52:11 +02:00
  • 4e192d0c2a [addresses] null_phrase_alpha_only for phrases like 3o B in Spain Al 2016-06-14 21:51:47 +02:00
  • 699b882a31 [addresses] Norwegian address configs Al 2016-06-14 21:36:32 +02:00
  • 2c0bdd9afe [dictionaries] Norwegian sub-building dictionaries Al 2016-06-14 21:35:23 +02:00
  • eb1b410d63 [tokenization] Including full-width numbers in numeric tokens Al 2016-06-14 01:28:25 +02:00
  • faf7ccbddd [numex] Norwegian ordinal indicators Al 2016-06-13 16:46:50 +02:00
  • e79ef340ba [addresses] Updates to Danish sub-building config Al 2016-06-13 16:46:25 +02:00
  • 3557a2313c [dictionaries] Updates to Danish sub-building dictionaries Al 2016-06-13 16:45:45 +02:00
  • e1cb8b4bbb [fix] return None if there are no ordinal suffixes for a given language Al 2016-06-13 16:17:26 +02:00
  • 1f7186d9f2 [fix] addr:place= Al 2016-06-09 16:17:21 +02:00
  • e0306b2147 [osm] Adding railway stations to venues/addresses data sets Al 2016-06-09 14:59:37 +02:00
  • 791b298b6d [docs][ci skip] Adding Posty McPostFace to the official bindings list (to be maintained in lockstep with this repo) Al 2016-06-09 13:03:40 +02:00
  • 89c09fb8aa [addresses] Adding Danish config to parsed configs Al 2016-06-07 18:04:24 -04:00
  • 95842a0a8d [formatting] Adding Danish config to formatter and adjusting continental European template insertions Al 2016-06-07 18:03:41 -04:00
  • 085ba945e2 [addresses] Danish address config Al 2016-06-07 18:01:46 -04:00
  • 30e1114e6e [dictionaries] Danish sub-building dictionaries Al 2016-06-07 18:01:30 -04:00
  • 135d50827d [fix] adjusting a few probabilities for German Al 2016-06-07 17:58:22 -04:00
  • 8854c372ac [addresses/dictionaries] Adding Catalan address config Al 2016-06-02 21:06:29 -04:00
  • d947af8152 [addresses] Dutch cross streets Al 2016-06-02 12:26:12 -04:00
  • a5ae40f7ee [fix] Adding sampling for French intersections Al 2016-06-02 12:22:25 -04:00
  • 18c25fd4fc [fix] adding sampling to Spanish intersections Al 2016-06-02 12:21:18 -04:00
  • 3b0712ef41 [fix] name Al 2016-06-02 12:17:40 -04:00
  • 24b84dd503 [fix] name Al 2016-06-02 03:05:31 -04:00
  • 2958cbfacb [dictionaries] Adding suite to Spanish dictionaries, used sometimes in Latin America, removing entre from stopwords as it's part of the intersections dictionary Al 2016-06-02 00:31:40 -04:00
  • a05eb0fd51 [addresses] Spanish intersections, suite Al 2016-06-02 00:26:11 -04:00
  • a32820835c [formatting] Adding aliases in formatting config, so e.g. most of the Francophone world shares France's config without needing to be the case for every French address (e.g. Belgium), generic config for continental Europe, etc. Al 2016-06-01 17:12:35 -04:00
  • c386e765ef [dictionaries] Adding case postale to French dictionaries Al 2016-06-01 17:10:28 -04:00
  • 815c0cf69c [dictionaries] extended Dutch dictionaries Al 2016-06-01 16:48:48 -04:00
  • 118bd95fed [addresses] Dutch address config Al 2016-06-01 16:47:57 -04:00
  • 69680a4a0d [addresses] German address config numbered units Al 2016-06-01 16:46:39 -04:00
  • 6cce045e92 [addresses] German categories and cross streets Al 2016-06-01 16:44:12 -04:00
  • 1e295ea8e9 [dictionaries] Making new component for near/nearby prepositions Al 2016-06-01 15:32:23 -04:00
  • 9e83c2cf22 [dictionaries] Adding new dictionary for prepositions like near/nearby Al 2016-06-01 15:31:20 -04:00
  • 42566eced3 [addresses] case postale for Canada/Switzerland Al 2016-06-01 15:25:32 -04:00
  • 7b1d141fdc [addresses] German address config Al 2016-06-01 12:36:24 -04:00
  • 010d03b55b [dictionaries] Updates to German dictionaries Al 2016-06-01 12:35:48 -04:00
  • b6fe41451f [fix] subdir config in OpenAddresses formatter Al 2016-06-01 12:17:08 -04:00
  • a02692713c [addresses] Fixes for English/French Canadian apartment numbers Al 2016-06-01 11:43:42 -04:00
  • 3c51a5a052 [formatting] Territories use parent country's template insertion probabilities Al 2016-06-01 11:42:11 -04:00
  • 012d174fdc [openaddresses] CLDR country names for OpenAddresses training set Al 2016-05-31 18:54:34 -04:00
  • 5ae990cd43 [fix] gauche Al 2016-05-31 18:52:13 -04:00
  • 0d2e8387e6 [openaddresses] Removing New Zealand city as the field is not specific enough and may conflict with OSM names, needs to be reverse geocoded. Adding cldr country probabilities so we can add localized names/codes given the country Al 2016-05-31 18:29:07 -04:00
  • 5af6546569 [addresses] French address levels alphanumeric Al 2016-05-31 16:07:48 -04:00
  • a9c65af75e [addresses] Don't generate sub-building components at all if there's no house number Al 2016-05-31 16:02:55 -04:00
  • 9bf6018018 [addresses] Topological sort of address component dependencies so they get checked/removed in order Al 2016-05-31 16:01:49 -04:00
  • 2fe379c547 [states] State abbreviations for Brazil and Mexico Al 2016-05-31 15:53:40 -04:00
  • 9fcc04e440 [parser] road has no dependencies Al 2016-05-31 15:52:24 -04:00
  • bc28f69875 [openaddresses] Country code for Belgium, removing Flanders as it has encoding issues, removing region from New Zealand formats as it appears to be conflated with districts Al 2016-05-31 12:11:42 -04:00
  • e8647c4701 [fix] unused var Al 2016-05-31 11:01:37 -04:00
  • e362851459 [addresses] French address config Al 2016-05-31 03:36:07 -04:00
  • d7ddefbfaf [addresses] Spanish PO box probabilities Al 2016-05-31 03:35:49 -04:00
  • 3bbbc741d4 [openaddresses] OpenAddresses training script Al 2016-05-31 02:33:32 -04:00
  • d98eeb08e8 [openaddresses] Only adding units for Australia, as they're known to contain both designator and number. US units seem to often have simple numbers/letters for the unit field Al 2016-05-31 02:20:28 -04:00
  • 7e2b87f10b [openaddresses] Added components via OA config Al 2016-05-31 02:12:41 -04:00
  • 0efab434f7 [openaddresses] Adding abbreviated unit Al 2016-05-31 02:11:52 -04:00
  • 29406aaa2f [openaddresses] Adding unit by default (only for files that have been vetted) Al 2016-05-31 02:06:52 -04:00
  • cf26e6a1bd [fix] OpenAddresses formatting Al 2016-05-31 02:04:06 -04:00
  • 31bd94191e [fix] condition Al 2016-05-31 02:00:33 -04:00
  • ededf6373e [fix] validators Al 2016-05-31 01:59:05 -04:00
  • 9ace2fb4e7 [fix] method name Al 2016-05-31 01:57:34 -04:00
  • 52fd241235 [fix] return value Al 2016-05-31 01:56:14 -04:00
  • 30f23703ac [fix] import again Al 2016-05-31 01:53:17 -04:00
  • 86e3cb95f8 [fix] import Al 2016-05-31 01:51:16 -04:00
  • b793b6e0af [fix] directory structure Al 2016-05-31 01:48:45 -04:00
  • dd561ba5b2 [fix] import Al 2016-05-31 01:42:27 -04:00
  • 913d448db1 [openaddresses] OpenAddresses address formatter, using the config Al 2016-05-31 01:41:16 -04:00