Commit Graph

2292 Commits

Author SHA1 Message Date
Al
aea22eebe8 [dictionaries] More Ukrainian qualifiers 2016-07-07 03:41:02 -04:00
Al
21c1ec3b9b [dictionaries] Number abbreviations for Russian and Ukrainian 2016-07-07 03:40:16 -04:00
Al
3001dcf3e9 [dictionaries] Croatian dictionary additions to support new config 2016-07-07 03:39:15 -04:00
Al
c7900e572d [addresses] house_number/level combination and Roman numerals for Dutch config 2016-07-07 03:38:41 -04:00
Al
2fac4c955b [addresses] field combinations, Roman numerals and spellout for Russian config 2016-07-07 03:38:06 -04:00
Al
6de9002c9e [addresses] Adding combinations and Roman numeral floor numbers into Serbian config 2016-07-06 18:06:07 -04:00
Al
bdbcbb8d0a [test] Printing invalid phrases in address config tests 2016-07-06 15:40:54 -04:00
Al
dfeea0906c [osm] Adding OSM file for places stored as nodes. Adding a general venue definition accessible from the geodata Python package. OSM definitions expand simple variables so can reuse/combine definitions in the bash script 2016-07-06 15:40:04 -04:00
Al
9ba55100e2 [addresses] Serbian address config 2016-07-05 15:48:27 -04:00
Al
f4c4f2b8f7 [dictionaries] Serbian dictionaries to support new address config 2016-07-05 15:28:59 -04:00
Al
e43c299113 [numex] Serbian numex 2016-07-05 00:53:18 -04:00
Al
a9c19056c9 [fix] no upper casing for Pinyin and Romanized Korean 2016-07-04 23:34:01 -04:00
Al
d8d3eaf79e [fix] rename zh_pinyin 2016-07-04 18:55:33 -04:00
Al
985ed3996e [addresses] Adding Latvian address config 2016-07-04 18:13:57 -04:00
Al
f409a84ccc [numex] Adding ordinal suffixes for Latvian and Lithuanian 2016-07-04 18:13:33 -04:00
Al
455384b230 [dictionaries] Latvian dictionary updates to support the new address config 2016-07-04 18:10:42 -04:00
Al
2f4548664f [addresses] Implementing list-based field combinations 2016-07-04 18:09:50 -04:00
Al
d09e0ca966 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals 2016-07-04 18:09:26 -04:00
Al
2810e2638d [addresses] Lithuanian address config 2016-07-04 15:20:18 -04:00
Al
86a1190e0a [dictionaries] Lithuanian dictionaries to support new address config 2016-07-04 15:19:59 -04:00
Al
baf8b086b0 [dictionaries] Adding 'no' form to languages that also use '№' 2016-07-04 14:51:13 -04:00
Al
318b7fe35b [addresses] Romanized Korean address config 2016-07-04 13:58:22 -04:00
Al
83634f110a [addresses] Korean address config 2016-07-04 13:58:06 -04:00
Al
09c333cbc1 [dictionaries] Korean dictionaries to support new address config 2016-07-04 13:57:54 -04:00
Al
f27f7d3046 [dictionaries] Updates to Russian dictionaries 2016-07-04 13:57:16 -04:00
Al
a893b6de40 [addresses] Chinese Pinyin config 2016-07-04 13:56:50 -04:00
Al
5f0cde16fe [addresses] Ukrainian address config 2016-07-04 13:56:26 -04:00
Al
fab727d98f [dictionaries] Ukrainian dictionaries to support new address config 2016-07-04 13:56:05 -04:00
Al
52951cd335 [boundaries] boundary mappings for South Korea 2016-07-04 13:47:20 -04:00
Al
b7cc9bd857 [addresses] Adding digit spellout and the list form of field combinations to existing configs 2016-07-04 13:46:47 -04:00
Al
df530b8f4a [tokenization] Re-generating scanner 2016-07-03 23:51:29 -04:00
Al
3cbb1b3976 [tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens 2016-07-03 23:51:13 -04:00
Al
ad50e44c12 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-03 23:43:32 -04:00
Al
ce2f5be564 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-03 23:42:40 -04:00
Al
adb2d30438 [fix] alternatives lists in config utils 2016-07-03 23:42:13 -04:00
Al
ba6ec40748 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-03 23:41:49 -04:00
Al
1c45163411 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-03 23:40:50 -04:00
Al
24c0622bce [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-03 23:38:15 -04:00
Al
085cae3407 [fix] components 2016-07-03 23:36:27 -04:00
Al
8fc3f8b925 [addresses] Chinese address config with variations for Hong Kong, Taiwan, etc. 2016-07-03 13:48:44 -04:00
Al
0f7d3f6373 [dictionaries] Chinese dictionaries to support new address config 2016-07-03 13:48:00 -04:00
Al
31e5808413 [dictionaries] A few more Japanese dictionaries to support the address config 2016-07-03 13:47:45 -04:00
Al
1dc90aab1f [dictionaries] código postal also used in some Portuguese-speaking countries e.g. Angola, Mozambique 2016-07-03 13:46:35 -04:00
Al
31a9a82e3c [dictionaries] Abbreviation for apartado in Portugal to match Spanish 2016-07-03 13:45:45 -04:00
Al
ba0e3c0408 [dictionaries] Adding railway station token to Japanese qualifiers 2016-07-03 13:45:16 -04:00
Al
e4240d9096 [addresses] Japanese Romaji address config 2016-07-02 04:32:39 -04:00
Al
5723b11130 [addresses] Japanese address config 2016-07-02 04:32:26 -04:00
Al
51524b7d71 [dictionaries] Updates to Japanese qualifiers 2016-07-02 04:31:39 -04:00
Al
203980fe0f [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-02 04:27:55 -04:00
Al
94b5d055f7 [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-02 04:26:40 -04:00