Commit Graph

  • 2fac4c955b [addresses] field combinations, Roman numerals and spellout for Russian config Al 2016-07-07 03:38:06 -04:00
  • 6de9002c9e [addresses] Adding combinations and Roman numeral floor numbers into Serbian config Al 2016-07-06 18:06:04 -04:00
  • bdbcbb8d0a [test] Printing invalid phrases in address config tests Al 2016-07-06 15:40:54 -04:00
  • dfeea0906c [osm] Adding OSM file for places stored as nodes. Adding a general venue definition accessible from the geodata Python package. OSM definitions expand simple variables so can reuse/combine definitions in the bash script Al 2016-07-06 15:40:04 -04:00
  • e816b4f77e [parser] Ignore language/country options explicitly in the parser. The purpose of these options is not to be able to create language-specific/country-specific models at some point, shouldn't be used in the global model Al 2016-07-06 14:56:46 -04:00
  • 9ba55100e2 [addresses] Serbian address config Al 2016-07-05 15:48:27 -04:00
  • f4c4f2b8f7 [dictionaries] Serbian dictionaries to support new address config Al 2016-07-05 15:28:59 -04:00
  • e43c299113 [numex] Serbian numex Al 2016-07-05 00:53:18 -04:00
  • a9c19056c9 [fix] no upper casing for Pinyin and Romanized Korean Al 2016-07-04 23:34:01 -04:00
  • d8d3eaf79e [fix] rename zh_pinyin Al 2016-07-04 18:55:33 -04:00
  • 985ed3996e [addresses] Adding Latvian address config Al 2016-07-04 18:13:57 -04:00
  • f409a84ccc [numex] Adding ordinal suffixes for Latvian and Lithuanian Al 2016-07-04 18:13:33 -04:00
  • 455384b230 [dictionaries] Latvian dictionary updates to support the new address config Al 2016-07-04 18:10:42 -04:00
  • 2f4548664f [addresses] Implementing list-based field combinations Al 2016-07-04 18:09:50 -04:00
  • d09e0ca966 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals Al 2016-07-04 18:08:55 -04:00
  • 2810e2638d [addresses] Lithuanian address config Al 2016-07-04 15:20:18 -04:00
  • 86a1190e0a [dictionaries] Lithuanian dictionaries to support new address config Al 2016-07-04 15:19:59 -04:00
  • baf8b086b0 [dictionaries] Adding 'no' form to languages that also use '№' Al 2016-07-04 14:51:13 -04:00
  • 318b7fe35b [addresses] Romanized Korean address config Al 2016-07-04 13:58:22 -04:00
  • 83634f110a [addresses] Korean address config Al 2016-07-04 13:58:06 -04:00
  • 09c333cbc1 [dictionaries] Korean dictionaries to support new address config Al 2016-07-04 13:57:54 -04:00
  • f27f7d3046 [dictionaries] Updates to Russian dictionaries Al 2016-07-04 13:57:16 -04:00
  • a893b6de40 [addresses] Chinese Pinyin config Al 2016-07-04 13:56:46 -04:00
  • 5f0cde16fe [addresses] Ukrainian address config Al 2016-07-04 13:56:26 -04:00
  • fab727d98f [dictionaries] Ukrainian dictionaries to support new address config Al 2016-07-04 13:56:05 -04:00
  • 52951cd335 [boundaries] boundary mappings for South Korea Al 2016-07-04 13:47:20 -04:00
  • b7cc9bd857 [addresses] Adding digit spellout and the list form of field combinations to existing configs Al 2016-07-04 13:46:19 -04:00
  • df530b8f4a [tokenization] Re-generating scanner Al 2016-07-03 23:51:29 -04:00
  • 3cbb1b3976 [tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens Al 2016-07-03 23:51:13 -04:00
  • ad50e44c12 [osm] Japanese addresses only use named valid venues, not just anything with a name Al 2016-07-03 23:43:32 -04:00
  • ce2f5be564 [fix] ordinal spellout for numbers which map directly to a simple rule Al 2016-07-03 23:42:40 -04:00
  • adb2d30438 [fix] alternatives lists in config utils Al 2016-07-03 23:42:13 -04:00
  • ba6ec40748 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling Al 2016-07-03 23:41:49 -04:00
  • 1c45163411 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class Al 2016-07-03 23:39:37 -04:00
  • 24c0622bce [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported Al 2016-07-03 23:38:15 -04:00
  • 085cae3407 [fix] components Al 2016-07-03 23:36:27 -04:00
  • 8fc3f8b925 [addresses] Chinese address config with variations for Hong Kong, Taiwan, etc. Al 2016-07-03 13:48:44 -04:00
  • 0f7d3f6373 [dictionaries] Chinese dictionaries to support new address config Al 2016-07-03 13:48:00 -04:00
  • 31e5808413 [dictionaries] A few more Japanese dictionaries to support the address config Al 2016-07-03 13:47:45 -04:00
  • 1dc90aab1f [dictionaries] código postal also used in some Portuguese-speaking countries e.g. Angola, Mozambique Al 2016-07-03 13:46:35 -04:00
  • 31a9a82e3c [dictionaries] Abbreviation for apartado in Portugal to match Spanish Al 2016-07-03 13:45:45 -04:00
  • ba0e3c0408 [dictionaries] Adding railway station token to Japanese qualifiers Al 2016-07-03 13:45:16 -04:00
  • e4240d9096 [addresses] Japanese Romaji address config Al 2016-07-02 04:32:39 -04:00
  • 5723b11130 [addresses] Japanese address config Al 2016-07-02 04:32:26 -04:00
  • 51524b7d71 [dictionaries] Updates to Japanese qualifiers Al 2016-07-02 04:31:39 -04:00
  • 203980fe0f [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only Al 2016-07-02 04:27:55 -04:00
  • 94b5d055f7 [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes Al 2016-07-02 04:26:21 -04:00
  • 28f49f3eb7 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout Al 2016-07-02 04:25:29 -04:00
  • 22524f7822 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist Al 2016-07-02 04:24:07 -04:00
  • 5579156320 [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number Al 2016-07-02 04:22:34 -04:00
  • 58a5dbe7e0 [logging] Logging the value of LIBPOSTAL_DATA_DIR when a setup error occurs Al 2016-07-01 14:51:04 -04:00
  • ad9dfb46bd [build] Using a process pool with 64MB chunks (similar to aws cli) for S3 downloads. Setting the max concurrent requeests to 10, also the default in aws cli. Al 2016-07-01 02:15:19 -04:00
  • 1b42d29129 [dictionaries] gebouw as concatenated inseparable suffix in Dutch (helps with identifying unknown words) Al 2016-06-30 17:46:34 -04:00
  • 02e2417edd [addresses] New structure for blocks (placeholder, not implemented as random phrases yet) Al 2016-06-30 17:46:04 -04:00
  • 8103d2d220 [addresses/dictionaries] Netherlands config update, moving verdiep abbreviation to Belgian Flemish Al 2016-06-30 17:43:30 -04:00
  • ef2a01ed7f [dictionaries] Dutch abbreviations for standalone levels like begane grond Al 2016-06-30 17:41:13 -04:00
  • df7f64439b [docs][ci skip] Adding package dependencies for CentOS Al 2016-06-30 17:38:14 -04:00
  • ade190f8c7 [osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm Al 2016-06-30 02:33:46 -04:00
  • dfcc1ab9ee [addresses] Making house number phrase (e.g. Calle Foobar nº 2) slightly more common in Spanish-speaking world (and even more likely in Colombia) Al 2016-06-29 16:00:49 -04:00
  • b7bbd486cd [addresses] Hungarian address config Al 2016-06-29 15:59:27 -04:00
  • e7ad848464 [numex] adding '.' for Hungarian ordinal indicator (Roman numerals handled in address config) Al 2016-06-29 15:58:34 -04:00
  • b5f28eca28 [dictionaries] Hungarian dictionaries to support address config Al 2016-06-29 15:57:40 -04:00
  • 171a2c9b2f [addresses] Adding ability to determine unit numbers using a known floor number Al 2016-06-29 15:56:45 -04:00
  • 5b17a3a3ce [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors Al 2016-06-28 19:47:23 -04:00
  • a613bdbf74 [dictionaries] Adding another level type for Russian Al 2016-06-28 19:28:38 -04:00
  • e808d0c4a3 [dictionaries] A few more Spanish additions Al 2016-06-28 19:17:56 -04:00
  • aad9231bc5 [dictionaries] & to Swedish cross streets Al 2016-06-28 13:21:23 -04:00
  • a260acf0af [addresses] Romanian address config Al 2016-06-28 13:20:54 -04:00
  • a3f6eb68be [dictionaries] Romanian dictionaries to support address config Al 2016-06-28 13:20:39 -04:00
  • 07d94c0fe7 [dictionaries] Bostad in Swedish (used in Finland) Al 2016-06-28 13:20:19 -04:00
  • 03004a2967 [addresses] Finnish address config Al 2016-06-28 13:19:14 -04:00
  • 69ea3b98dc [dictionaries] Finnish dictionaries to support address config Al 2016-06-28 13:18:46 -04:00
  • e154f98ac1 [addresses] Estonian address config Al 2016-06-28 13:18:23 -04:00
  • a2e70f453d [dictionaries] Estonian dictionaries to support address config Al 2016-06-28 13:18:05 -04:00
  • 887091495d [fix] Afrikaans expansion Al 2016-06-28 13:17:29 -04:00
  • 99d3bd0244 [fix] Fixes to address configs Al 2016-06-28 13:16:59 -04:00
  • a4c02f7031 [numex] Estonian ordinal indicators are just . Al 2016-06-28 13:12:08 -04:00
  • 5b5e13a178 [numex] Finnish ordinals can also use . Al 2016-06-28 13:11:44 -04:00
  • 15059c76a6 [test] Adding tests for address configs Al 2016-06-28 13:10:47 -04:00
  • 5e78f72fc7 [fix] a few errors with non-numbers in numeric_phrase Al 2016-06-28 13:08:38 -04:00
  • d79189c501 [expansion] Prefix/suffix expansions by default can apply to ADDRESS_ANY but also inherit the types of any dictionary that lists their canonical form (so we can add suffixes without worrying about whether they're for streets or place names, etc.) Al 2016-06-28 02:37:38 -04:00
  • 6da69d837e [dictionaries] Adding concatenated suffixes to street types, adding universitat as a suffix Al 2016-06-28 01:31:04 -04:00
  • a39c3fa0d5 [addresses] Use bostad in Swedish addresses in Finland Al 2016-06-28 01:29:10 -04:00
  • 4bbc52372c [addresses] Adding new configs Al 2016-06-27 03:06:54 -04:00
  • 0e7348abba [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class Al 2016-06-27 03:06:33 -04:00
  • 9a2730103d [addresses] Italian address config Al 2016-06-27 03:05:50 -04:00
  • 6fcdd81ca8 [dictionaries] Italian dictionaries to support sub-building config Al 2016-06-27 03:05:06 -04:00
  • cb4ea1fb71 [addresses] Slovak address config Al 2016-06-27 03:04:41 -04:00
  • ff3e43cea9 [dictionaries] Slovak dictionaries to support sub-building config Al 2016-06-27 03:04:16 -04:00
  • c2f017fd66 [addresses] Czech sub-building config Al 2016-06-27 03:03:57 -04:00
  • cf0ad9ab50 [dictionaries] Czech dictionaries to support sub-building config Al 2016-06-27 03:03:47 -04:00
  • de754f4557 [numex] ordinal suffixes for Czech/Slovak Al 2016-06-27 03:03:24 -04:00
  • 699dae6c56 [dictionaries] Ampersand in Polish/Russian Al 2016-06-27 03:02:39 -04:00
  • ec1ca339fe [dictionaries] Numero sign in Italian Al 2016-06-27 03:02:02 -04:00
  • c1dce672fb [dictionaries] numero sign in French Al 2016-06-27 03:01:43 -04:00
  • cea667598b [addresses] Adding Roman numerals to the Polish config for floor numbers Al 2016-06-27 03:01:10 -04:00
  • ee1326b15c [addresses] Russian address config Al 2016-06-26 01:24:00 -04:00
  • aac2042ba9 [dictionaries] Russian dictionaries to support address configs Al 2016-06-26 01:23:47 -04:00
  • 975ce0f8f8 [addresses] Polish address config and dictionary updates Al 2016-06-25 20:36:07 -04:00
  • 1014056c48 [numex] Portuguese ordinals fix Al 2016-06-25 20:32:31 -04:00