Commit Graph

2266 Commits

Author SHA1 Message Date
Al
5f0cde16fe [addresses] Ukrainian address config 2016-07-04 13:56:26 -04:00
Al
fab727d98f [dictionaries] Ukrainian dictionaries to support new address config 2016-07-04 13:56:05 -04:00
Al
52951cd335 [boundaries] boundary mappings for South Korea 2016-07-04 13:47:20 -04:00
Al
b7cc9bd857 [addresses] Adding digit spellout and the list form of field combinations to existing configs 2016-07-04 13:46:47 -04:00
Al
df530b8f4a [tokenization] Re-generating scanner 2016-07-03 23:51:29 -04:00
Al
3cbb1b3976 [tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens 2016-07-03 23:51:13 -04:00
Al
ad50e44c12 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-03 23:43:32 -04:00
Al
ce2f5be564 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-03 23:42:40 -04:00
Al
adb2d30438 [fix] alternatives lists in config utils 2016-07-03 23:42:13 -04:00
Al
ba6ec40748 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-03 23:41:49 -04:00
Al
1c45163411 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-03 23:40:50 -04:00
Al
24c0622bce [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-03 23:38:15 -04:00
Al
085cae3407 [fix] components 2016-07-03 23:36:27 -04:00
Al
8fc3f8b925 [addresses] Chinese address config with variations for Hong Kong, Taiwan, etc. 2016-07-03 13:48:44 -04:00
Al
0f7d3f6373 [dictionaries] Chinese dictionaries to support new address config 2016-07-03 13:48:00 -04:00
Al
31e5808413 [dictionaries] A few more Japanese dictionaries to support the address config 2016-07-03 13:47:45 -04:00
Al
1dc90aab1f [dictionaries] código postal also used in some Portuguese-speaking countries e.g. Angola, Mozambique 2016-07-03 13:46:35 -04:00
Al
31a9a82e3c [dictionaries] Abbreviation for apartado in Portugal to match Spanish 2016-07-03 13:45:45 -04:00
Al
ba0e3c0408 [dictionaries] Adding railway station token to Japanese qualifiers 2016-07-03 13:45:16 -04:00
Al
e4240d9096 [addresses] Japanese Romaji address config 2016-07-02 04:32:39 -04:00
Al
5723b11130 [addresses] Japanese address config 2016-07-02 04:32:26 -04:00
Al
51524b7d71 [dictionaries] Updates to Japanese qualifiers 2016-07-02 04:31:39 -04:00
Al
203980fe0f [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-02 04:27:55 -04:00
Al
94b5d055f7 [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-02 04:26:40 -04:00
Al
28f49f3eb7 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout 2016-07-02 04:25:29 -04:00
Al
22524f7822 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist 2016-07-02 04:24:07 -04:00
Al
5579156320 [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number 2016-07-02 04:22:34 -04:00
Al
1b42d29129 [dictionaries] gebouw as concatenated inseparable suffix in Dutch (helps with identifying unknown words) 2016-06-30 17:46:37 -04:00
Al
02e2417edd [addresses] New structure for blocks (placeholder, not implemented as random phrases yet) 2016-06-30 17:46:04 -04:00
Al
8103d2d220 [addresses/dictionaries] Netherlands config update, moving verdiep abbreviation to Belgian Flemish 2016-06-30 17:43:41 -04:00
Al
ef2a01ed7f [dictionaries] Dutch abbreviations for standalone levels like begane grond 2016-06-30 17:41:13 -04:00
Al
ade190f8c7 [osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm 2016-06-30 02:34:03 -04:00
Al
dfcc1ab9ee [addresses] Making house number phrase (e.g. Calle Foobar nº 2) slightly more common in Spanish-speaking world (and even more likely in Colombia) 2016-06-29 16:00:55 -04:00
Al
b7bbd486cd [addresses] Hungarian address config 2016-06-29 15:59:27 -04:00
Al
e7ad848464 [numex] adding '.' for Hungarian ordinal indicator (Roman numerals handled in address config) 2016-06-29 15:58:37 -04:00
Al
b5f28eca28 [dictionaries] Hungarian dictionaries to support address config 2016-06-29 15:57:40 -04:00
Al
171a2c9b2f [addresses] Adding ability to determine unit numbers using a known floor number 2016-06-29 15:57:10 -04:00
Al
5b17a3a3ce [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors 2016-06-28 19:47:23 -04:00
Al
a613bdbf74 [dictionaries] Adding another level type for Russian 2016-06-28 19:28:38 -04:00
Al
e808d0c4a3 [dictionaries] A few more Spanish additions 2016-06-28 19:17:56 -04:00
Al
aad9231bc5 [dictionaries] & to Swedish cross streets 2016-06-28 13:21:23 -04:00
Al
a260acf0af [addresses] Romanian address config 2016-06-28 13:20:54 -04:00
Al
a3f6eb68be [dictionaries] Romanian dictionaries to support address config 2016-06-28 13:20:39 -04:00
Al
07d94c0fe7 [dictionaries] Bostad in Swedish (used in Finland) 2016-06-28 13:20:19 -04:00
Al
03004a2967 [addresses] Finnish address config 2016-06-28 13:19:14 -04:00
Al
69ea3b98dc [dictionaries] Finnish dictionaries to support address config 2016-06-28 13:18:46 -04:00
Al
e154f98ac1 [addresses] Estonian address config 2016-06-28 13:18:23 -04:00
Al
a2e70f453d [dictionaries] Estonian dictionaries to support address config 2016-06-28 13:18:05 -04:00
Al
887091495d [fix] Afrikaans expansion 2016-06-28 13:17:29 -04:00
Al
99d3bd0244 [fix] Fixes to address configs 2016-06-28 13:16:59 -04:00