Al
|
df530b8f4a
|
[tokenization] Re-generating scanner
|
2016-07-03 23:51:29 -04:00 |
|
Al
|
3cbb1b3976
|
[tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens
|
2016-07-03 23:51:13 -04:00 |
|
Al
|
ad50e44c12
|
[osm] Japanese addresses only use named valid venues, not just anything with a name
|
2016-07-03 23:43:32 -04:00 |
|
Al
|
ce2f5be564
|
[fix] ordinal spellout for numbers which map directly to a simple rule
|
2016-07-03 23:42:40 -04:00 |
|
Al
|
adb2d30438
|
[fix] alternatives lists in config utils
|
2016-07-03 23:42:13 -04:00 |
|
Al
|
ba6ec40748
|
[addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling
|
2016-07-03 23:41:49 -04:00 |
|
Al
|
1c45163411
|
[addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class
|
2016-07-03 23:40:50 -04:00 |
|
Al
|
24c0622bce
|
[addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported
|
2016-07-03 23:38:15 -04:00 |
|
Al
|
085cae3407
|
[fix] components
|
2016-07-03 23:36:27 -04:00 |
|
Al
|
8fc3f8b925
|
[addresses] Chinese address config with variations for Hong Kong, Taiwan, etc.
|
2016-07-03 13:48:44 -04:00 |
|
Al
|
0f7d3f6373
|
[dictionaries] Chinese dictionaries to support new address config
|
2016-07-03 13:48:00 -04:00 |
|
Al
|
31e5808413
|
[dictionaries] A few more Japanese dictionaries to support the address config
|
2016-07-03 13:47:45 -04:00 |
|
Al
|
1dc90aab1f
|
[dictionaries] código postal also used in some Portuguese-speaking countries e.g. Angola, Mozambique
|
2016-07-03 13:46:35 -04:00 |
|
Al
|
31a9a82e3c
|
[dictionaries] Abbreviation for apartado in Portugal to match Spanish
|
2016-07-03 13:45:45 -04:00 |
|
Al
|
ba0e3c0408
|
[dictionaries] Adding railway station token to Japanese qualifiers
|
2016-07-03 13:45:16 -04:00 |
|
Al
|
e4240d9096
|
[addresses] Japanese Romaji address config
|
2016-07-02 04:32:39 -04:00 |
|
Al
|
5723b11130
|
[addresses] Japanese address config
|
2016-07-02 04:32:26 -04:00 |
|
Al
|
51524b7d71
|
[dictionaries] Updates to Japanese qualifiers
|
2016-07-02 04:31:39 -04:00 |
|
Al
|
203980fe0f
|
[addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only
|
2016-07-02 04:27:55 -04:00 |
|
Al
|
94b5d055f7
|
[addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes
|
2016-07-02 04:26:40 -04:00 |
|
Al
|
28f49f3eb7
|
[addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout
|
2016-07-02 04:25:29 -04:00 |
|
Al
|
22524f7822
|
[addresses] Adding some of the new configs and returning None if no phrase alternatives exist
|
2016-07-02 04:24:07 -04:00 |
|
Al
|
5579156320
|
[addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number
|
2016-07-02 04:22:34 -04:00 |
|
Al
|
1b42d29129
|
[dictionaries] gebouw as concatenated inseparable suffix in Dutch (helps with identifying unknown words)
|
2016-06-30 17:46:37 -04:00 |
|
Al
|
02e2417edd
|
[addresses] New structure for blocks (placeholder, not implemented as random phrases yet)
|
2016-06-30 17:46:04 -04:00 |
|
Al
|
8103d2d220
|
[addresses/dictionaries] Netherlands config update, moving verdiep abbreviation to Belgian Flemish
|
2016-06-30 17:43:41 -04:00 |
|
Al
|
ef2a01ed7f
|
[dictionaries] Dutch abbreviations for standalone levels like begane grond
|
2016-06-30 17:41:13 -04:00 |
|
Al
|
ade190f8c7
|
[osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm
|
2016-06-30 02:34:03 -04:00 |
|
Al
|
dfcc1ab9ee
|
[addresses] Making house number phrase (e.g. Calle Foobar nº 2) slightly more common in Spanish-speaking world (and even more likely in Colombia)
|
2016-06-29 16:00:55 -04:00 |
|
Al
|
b7bbd486cd
|
[addresses] Hungarian address config
|
2016-06-29 15:59:27 -04:00 |
|
Al
|
e7ad848464
|
[numex] adding '.' for Hungarian ordinal indicator (Roman numerals handled in address config)
|
2016-06-29 15:58:37 -04:00 |
|
Al
|
b5f28eca28
|
[dictionaries] Hungarian dictionaries to support address config
|
2016-06-29 15:57:40 -04:00 |
|
Al
|
171a2c9b2f
|
[addresses] Adding ability to determine unit numbers using a known floor number
|
2016-06-29 15:57:10 -04:00 |
|
Al
|
5b17a3a3ce
|
[addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors
|
2016-06-28 19:47:23 -04:00 |
|
Al
|
a613bdbf74
|
[dictionaries] Adding another level type for Russian
|
2016-06-28 19:28:38 -04:00 |
|
Al
|
e808d0c4a3
|
[dictionaries] A few more Spanish additions
|
2016-06-28 19:17:56 -04:00 |
|
Al
|
aad9231bc5
|
[dictionaries] & to Swedish cross streets
|
2016-06-28 13:21:23 -04:00 |
|
Al
|
a260acf0af
|
[addresses] Romanian address config
|
2016-06-28 13:20:54 -04:00 |
|
Al
|
a3f6eb68be
|
[dictionaries] Romanian dictionaries to support address config
|
2016-06-28 13:20:39 -04:00 |
|
Al
|
07d94c0fe7
|
[dictionaries] Bostad in Swedish (used in Finland)
|
2016-06-28 13:20:19 -04:00 |
|
Al
|
03004a2967
|
[addresses] Finnish address config
|
2016-06-28 13:19:14 -04:00 |
|
Al
|
69ea3b98dc
|
[dictionaries] Finnish dictionaries to support address config
|
2016-06-28 13:18:46 -04:00 |
|
Al
|
e154f98ac1
|
[addresses] Estonian address config
|
2016-06-28 13:18:23 -04:00 |
|
Al
|
a2e70f453d
|
[dictionaries] Estonian dictionaries to support address config
|
2016-06-28 13:18:05 -04:00 |
|
Al
|
887091495d
|
[fix] Afrikaans expansion
|
2016-06-28 13:17:29 -04:00 |
|
Al
|
99d3bd0244
|
[fix] Fixes to address configs
|
2016-06-28 13:16:59 -04:00 |
|
Al
|
a4c02f7031
|
[numex] Estonian ordinal indicators are just .
|
2016-06-28 13:12:08 -04:00 |
|
Al
|
5b5e13a178
|
[numex] Finnish ordinals can also use .
|
2016-06-28 13:11:44 -04:00 |
|
Al
|
15059c76a6
|
[test] Adding tests for address configs
|
2016-06-28 13:10:47 -04:00 |
|
Al
|
5e78f72fc7
|
[fix] a few errors with non-numbers in numeric_phrase
|
2016-06-28 13:08:38 -04:00 |
|