Commit Graph

  • 2a10dd16d5 [fix] Afrikaans expansion Al 2016-06-28 13:17:29 -04:00
  • fcfd28f23a [fix] Fixes to address configs Al 2016-06-28 13:16:59 -04:00
  • 0b119becaf [numex] Estonian ordinal indicators are just . Al 2016-06-28 13:12:08 -04:00
  • 8023c1e86a [numex] Finnish ordinals can also use . Al 2016-06-28 13:11:44 -04:00
  • 4412ba1177 [test] Adding tests for address configs Al 2016-06-28 13:10:47 -04:00
  • d3a6a032ab [fix] a few errors with non-numbers in numeric_phrase Al 2016-06-28 13:08:38 -04:00
  • be5fd79a48 [expansion] Prefix/suffix expansions by default can apply to ADDRESS_ANY but also inherit the types of any dictionary that lists their canonical form (so we can add suffixes without worrying about whether they're for streets or place names, etc.) Al 2016-06-28 02:37:38 -04:00
  • 8072b01023 [dictionaries] Adding concatenated suffixes to street types, adding universitat as a suffix Al 2016-06-28 01:31:04 -04:00
  • 96d4c64ebd [addresses] Use bostad in Swedish addresses in Finland Al 2016-06-28 01:29:10 -04:00
  • 2505afa2b9 [addresses] Adding new configs Al 2016-06-27 03:06:54 -04:00
  • dfd29911fd [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class Al 2016-06-27 03:06:33 -04:00
  • 79bc859692 [addresses] Italian address config Al 2016-06-27 03:05:50 -04:00
  • 46a82aef89 [dictionaries] Italian dictionaries to support sub-building config Al 2016-06-27 03:05:06 -04:00
  • 579dafc6e0 [addresses] Slovak address config Al 2016-06-27 03:04:41 -04:00
  • 540e4be7b2 [dictionaries] Slovak dictionaries to support sub-building config Al 2016-06-27 03:04:16 -04:00
  • 02f19c4df0 [addresses] Czech sub-building config Al 2016-06-27 03:03:57 -04:00
  • c16bd2768a [dictionaries] Czech dictionaries to support sub-building config Al 2016-06-27 03:03:47 -04:00
  • 078aa20930 [numex] ordinal suffixes for Czech/Slovak Al 2016-06-27 03:03:24 -04:00
  • 6ab8041618 [dictionaries] Ampersand in Polish/Russian Al 2016-06-27 03:02:39 -04:00
  • faed055803 [dictionaries] Numero sign in Italian Al 2016-06-27 03:02:02 -04:00
  • efa75919e6 [dictionaries] numero sign in French Al 2016-06-27 03:01:43 -04:00
  • ee71d94e85 [addresses] Adding Roman numerals to the Polish config for floor numbers Al 2016-06-27 03:01:10 -04:00
  • 11c6564783 [addresses] Russian address config Al 2016-06-26 01:24:00 -04:00
  • 7bc459f1a9 [dictionaries] Russian dictionaries to support address configs Al 2016-06-26 01:23:47 -04:00
  • 53052e6d25 [addresses] Polish address config and dictionary updates Al 2016-06-25 20:36:07 -04:00
  • 558d643042 [numex] Portuguese ordinals fix Al 2016-06-25 20:32:31 -04:00
  • b15675f8cb [addresses/dictionaries] Adding rez-de-chaussée bas and rez-de-chaussée haut in French Al 2016-06-25 20:32:03 -04:00
  • d89e9dcd04 [dictionaries] Variations on sin numero for Spanish Al 2016-06-25 20:30:02 -04:00
  • ee27dc5ea1 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil Al 2016-06-25 20:29:36 -04:00
  • 8a5dd26dbf [numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen Al 2016-06-25 13:36:10 -04:00
  • eee68d1ca5 [numex] Ordinal spellout using the numex configs Al 2016-06-25 13:35:03 -04:00
  • c628b9bee8 [dictionaries] English cross streets Al 2016-06-24 16:12:33 -04:00
  • 8383d5bb12 [numex] Adding numeric expression spellout in the Python geodata module for generating training data Al 2016-06-24 16:06:59 -04:00
  • 53ea1c139a [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number Al 2016-06-23 13:59:40 -04:00
  • 8926293063 [parser/cli] Using NFC normalization on the output in the parser client (closes #30). Optional command-line arg for parser output dir, useful for spot-checking different experiments Al 2016-06-22 11:56:35 -04:00
  • 44908ff95a [parser] No digit normalization in training data-derived parser phrases (for postcodes, etc.), phrases include the new island type, house number phrases if any are valid. Adjacent words are now full phrases if they are part of a multiword token like a city name. For hyphenated names like Carmel-by-the-Sea, adding a version to the phrase dictionary where the hyphens are replaced with spaces Al 2016-06-22 11:50:42 -04:00
  • 41ae742285 [fix] tokenized trie search when falling off the trie at the start of a valid phrase Al 2016-06-21 15:48:43 -04:00
  • 6e60b3bbda [fix] semicolon in #define Al 2016-06-21 15:16:14 -04:00
  • 0f76c8c631 [dictionaries] Portuguese abbreviations Al 2016-06-16 19:18:02 +02:00
  • b8aba86471 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) Al 2016-06-16 01:45:43 +02:00
  • c29d1ad947 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint Al 2016-06-16 01:44:12 +02:00
  • 589497cb16 [addresses] Adding Portuguese sub-building config Al 2016-06-16 01:43:03 +02:00
  • 2be41732f8 [dictionaries] Portuguese dictionaries to support sub-building config Al 2016-06-16 01:42:21 +02:00
  • 1bd62313f4 [dictionaries] Adding e/ to ambiguous in Spanish dictionaries Al 2016-06-16 01:41:54 +02:00
  • 6b7e4f8515 [dictionaries] Adding No to Germanic-language number synonyms Al 2016-06-16 01:41:06 +02:00
  • 619127e4b1 [fix] adding back staircase in Swedish sub-building config Al 2016-06-15 23:39:16 +02:00
  • bc70a54b09 [addresses] Swedish address config Al 2016-06-15 16:32:24 +02:00
  • b622315d0f [addresses] Lower probability of null phrase in Norwegian configs Al 2016-06-15 16:29:53 +02:00
  • ac22f270bb [dictionaries] Swedish dictionaries to support sub-building config Al 2016-06-15 16:29:26 +02:00
  • d8ddae362f [addresses] venstre in Norway rather than igjen Al 2016-06-15 14:22:25 +02:00
  • cd9b33983a [addresses] Adding parterre for ground floor in Switzerland Al 2016-06-15 14:20:42 +02:00
  • a61d9b1548 [dictionaries] adding phrases meaning 'near' or 'in' for Norwegian to the dictionaries Al 2016-06-15 03:13:28 +02:00
  • 541fe6c5ac [dictionaries] no standalone level types for Norway Al 2016-06-15 03:12:54 +02:00
  • 06fdf1c532 [fix] /underetasje/hovedetasje/ in Norwegian and translating category phrases from Danish Al 2016-06-15 03:12:24 +02:00
  • 0222049b88 [addresses] Danish level/unit and entrance/unit combinations Al 2016-06-15 02:55:25 +02:00
  • 03b9825390 [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) Al 2016-06-14 22:02:14 +02:00
  • 9d7239d0ad [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs Al 2016-06-14 21:53:43 +02:00
  • 420b169d48 [addresses] adding nb.yaml to valid configs Al 2016-06-14 21:52:11 +02:00
  • d50495f609 [addresses] null_phrase_alpha_only for phrases like 3o B in Spain Al 2016-06-14 21:51:47 +02:00
  • 52db502929 [addresses] Norwegian address configs Al 2016-06-14 21:36:32 +02:00
  • 2831b70747 [dictionaries] Norwegian sub-building dictionaries Al 2016-06-14 21:35:23 +02:00
  • b5d4dd6f37 [tokenization] Including full-width numbers in numeric tokens Al 2016-06-14 01:28:25 +02:00
  • 02d40c23a6 [numex] Norwegian ordinal indicators Al 2016-06-13 16:46:50 +02:00
  • 0136c88629 [addresses] Updates to Danish sub-building config Al 2016-06-13 16:46:25 +02:00
  • 5834f6b8ed [dictionaries] Updates to Danish sub-building dictionaries Al 2016-06-13 16:45:45 +02:00
  • 23736f2650 [fix] return None if there are no ordinal suffixes for a given language Al 2016-06-13 16:17:26 +02:00
  • a6da72a831 [fix] addr:place= Al 2016-06-09 16:17:21 +02:00
  • ca88ff7f73 [osm] Adding railway stations to venues/addresses data sets Al 2016-06-09 14:59:37 +02:00
  • b22d30cb52 [addresses] Adding Danish config to parsed configs Al 2016-06-07 18:04:24 -04:00
  • 003c95f9eb [formatting] Adding Danish config to formatter and adjusting continental European template insertions Al 2016-06-07 18:03:41 -04:00
  • b8ae1ad61d [addresses] Danish address config Al 2016-06-07 18:01:46 -04:00
  • 6f5b0e16a1 [dictionaries] Danish sub-building dictionaries Al 2016-06-07 18:01:30 -04:00
  • 1d09060012 [fix] adjusting a few probabilities for German Al 2016-06-07 17:58:22 -04:00
  • 6861c09caa [addresses/dictionaries] Adding Catalan address config Al 2016-06-02 21:06:29 -04:00
  • 4fa8c2aa8e [addresses] Dutch cross streets Al 2016-06-02 12:26:12 -04:00
  • 6e4ca716df [fix] Adding sampling for French intersections Al 2016-06-02 12:22:25 -04:00
  • 38e17bd1b2 [fix] adding sampling to Spanish intersections Al 2016-06-02 12:21:18 -04:00
  • 72e647902d [fix] name Al 2016-06-02 12:17:40 -04:00
  • 03be909a60 [fix] name Al 2016-06-02 03:05:31 -04:00
  • 45e069be6a [dictionaries] Adding suite to Spanish dictionaries, used sometimes in Latin America, removing entre from stopwords as it's part of the intersections dictionary Al 2016-06-02 00:31:40 -04:00
  • 127883facc [addresses] Spanish intersections, suite Al 2016-06-02 00:26:11 -04:00
  • 14f08e5991 [formatting] Adding aliases in formatting config, so e.g. most of the Francophone world shares France's config without needing to be the case for every French address (e.g. Belgium), generic config for continental Europe, etc. Al 2016-06-01 17:12:35 -04:00
  • 75e9d94684 [dictionaries] Adding case postale to French dictionaries Al 2016-06-01 17:10:28 -04:00
  • ad7ef082a5 [dictionaries] extended Dutch dictionaries Al 2016-06-01 16:48:48 -04:00
  • b8a9d15d41 [addresses] Dutch address config Al 2016-06-01 16:47:57 -04:00
  • 88762a7778 [addresses] German address config numbered units Al 2016-06-01 16:46:39 -04:00
  • a456262aca [addresses] German categories and cross streets Al 2016-06-01 16:44:12 -04:00
  • dd7ef6fabf [dictionaries] Making new component for near/nearby prepositions Al 2016-06-01 15:32:23 -04:00
  • 755976bc16 [dictionaries] Adding new dictionary for prepositions like near/nearby Al 2016-06-01 15:31:20 -04:00
  • ec44fdaf79 [addresses] case postale for Canada/Switzerland Al 2016-06-01 15:25:32 -04:00
  • ca39272d18 [addresses] German address config Al 2016-06-01 12:36:24 -04:00
  • 22be892635 [dictionaries] Updates to German dictionaries Al 2016-06-01 12:35:48 -04:00
  • 0bbced4966 [fix] subdir config in OpenAddresses formatter Al 2016-06-01 12:17:08 -04:00
  • fdba7b138d [addresses] Fixes for English/French Canadian apartment numbers Al 2016-06-01 11:43:42 -04:00
  • 7d5d54bd29 [formatting] Territories use parent country's template insertion probabilities Al 2016-06-01 11:42:11 -04:00
  • 77a4476b8e [openaddresses] CLDR country names for OpenAddresses training set Al 2016-05-31 18:54:34 -04:00
  • 7d62a3a762 [fix] gauche Al 2016-05-31 18:52:13 -04:00
  • afa58e6edb [openaddresses] Removing New Zealand city as the field is not specific enough and may conflict with OSM names, needs to be reverse geocoded. Adding cldr country probabilities so we can add localized names/codes given the country Al 2016-05-31 18:29:07 -04:00
  • e91b318121 [addresses] French address levels alphanumeric Al 2016-05-31 16:07:48 -04:00
  • 9059c2af60 [addresses] Don't generate sub-building components at all if there's no house number Al 2016-05-31 16:02:55 -04:00