Commit Graph

2303 Commits

Author SHA1 Message Date
Al
0de8649702 [dictionaries] Bulgarian dictionary updates to support new address config 2016-07-07 16:07:16 -04:00
Al
352b7254e3 [fix] /severno/sever/ in Slovenian 2016-07-07 16:06:27 -04:00
Al
9fe8611d47 [fix] Serbian config 2016-07-07 16:05:54 -04:00
Al
1ac4ab182e [addresses] Implementing alphabet_probability, so may still use the Latin alphabet in some cases 2016-07-07 15:26:51 -04:00
Al
5ed09f3843 [numex] Croatian ordinal suffixes 2016-07-07 03:43:22 -04:00
Al
73dc2961b8 [numex] Bosnian numex 2016-07-07 03:42:58 -04:00
Al
ce0d2f6c8e [addresses] Bosnian address config 2016-07-07 03:42:41 -04:00
Al
4db633b02c [dictionaries] Bosnian dictionaries to support new address config 2016-07-07 03:42:27 -04:00
Al
30e75639d9 [addresses] Slovenian address config 2016-07-07 03:42:06 -04:00
Al
1a349706e2 [dictionaries] Slovenian dictionary additions to support new address config 2016-07-07 03:41:55 -04:00
Al
d6d6de0444 [addresses] Croatian address config 2016-07-07 03:41:31 -04:00
Al
aea22eebe8 [dictionaries] More Ukrainian qualifiers 2016-07-07 03:41:02 -04:00
Al
21c1ec3b9b [dictionaries] Number abbreviations for Russian and Ukrainian 2016-07-07 03:40:16 -04:00
Al
3001dcf3e9 [dictionaries] Croatian dictionary additions to support new config 2016-07-07 03:39:15 -04:00
Al
c7900e572d [addresses] house_number/level combination and Roman numerals for Dutch config 2016-07-07 03:38:41 -04:00
Al
2fac4c955b [addresses] field combinations, Roman numerals and spellout for Russian config 2016-07-07 03:38:06 -04:00
Al
6de9002c9e [addresses] Adding combinations and Roman numeral floor numbers into Serbian config 2016-07-06 18:06:07 -04:00
Al
bdbcbb8d0a [test] Printing invalid phrases in address config tests 2016-07-06 15:40:54 -04:00
Al
dfeea0906c [osm] Adding OSM file for places stored as nodes. Adding a general venue definition accessible from the geodata Python package. OSM definitions expand simple variables so can reuse/combine definitions in the bash script 2016-07-06 15:40:04 -04:00
Al
9ba55100e2 [addresses] Serbian address config 2016-07-05 15:48:27 -04:00
Al
f4c4f2b8f7 [dictionaries] Serbian dictionaries to support new address config 2016-07-05 15:28:59 -04:00
Al
e43c299113 [numex] Serbian numex 2016-07-05 00:53:18 -04:00
Al
a9c19056c9 [fix] no upper casing for Pinyin and Romanized Korean 2016-07-04 23:34:01 -04:00
Al
d8d3eaf79e [fix] rename zh_pinyin 2016-07-04 18:55:33 -04:00
Al
985ed3996e [addresses] Adding Latvian address config 2016-07-04 18:13:57 -04:00
Al
f409a84ccc [numex] Adding ordinal suffixes for Latvian and Lithuanian 2016-07-04 18:13:33 -04:00
Al
455384b230 [dictionaries] Latvian dictionary updates to support the new address config 2016-07-04 18:10:42 -04:00
Al
2f4548664f [addresses] Implementing list-based field combinations 2016-07-04 18:09:50 -04:00
Al
d09e0ca966 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals 2016-07-04 18:09:26 -04:00
Al
2810e2638d [addresses] Lithuanian address config 2016-07-04 15:20:18 -04:00
Al
86a1190e0a [dictionaries] Lithuanian dictionaries to support new address config 2016-07-04 15:19:59 -04:00
Al
baf8b086b0 [dictionaries] Adding 'no' form to languages that also use '№' 2016-07-04 14:51:13 -04:00
Al
318b7fe35b [addresses] Romanized Korean address config 2016-07-04 13:58:22 -04:00
Al
83634f110a [addresses] Korean address config 2016-07-04 13:58:06 -04:00
Al
09c333cbc1 [dictionaries] Korean dictionaries to support new address config 2016-07-04 13:57:54 -04:00
Al
f27f7d3046 [dictionaries] Updates to Russian dictionaries 2016-07-04 13:57:16 -04:00
Al
a893b6de40 [addresses] Chinese Pinyin config 2016-07-04 13:56:50 -04:00
Al
5f0cde16fe [addresses] Ukrainian address config 2016-07-04 13:56:26 -04:00
Al
fab727d98f [dictionaries] Ukrainian dictionaries to support new address config 2016-07-04 13:56:05 -04:00
Al
52951cd335 [boundaries] boundary mappings for South Korea 2016-07-04 13:47:20 -04:00
Al
b7cc9bd857 [addresses] Adding digit spellout and the list form of field combinations to existing configs 2016-07-04 13:46:47 -04:00
Al
df530b8f4a [tokenization] Re-generating scanner 2016-07-03 23:51:29 -04:00
Al
3cbb1b3976 [tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens 2016-07-03 23:51:13 -04:00
Al
ad50e44c12 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-03 23:43:32 -04:00
Al
ce2f5be564 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-03 23:42:40 -04:00
Al
adb2d30438 [fix] alternatives lists in config utils 2016-07-03 23:42:13 -04:00
Al
ba6ec40748 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-03 23:41:49 -04:00
Al
1c45163411 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-03 23:40:50 -04:00
Al
24c0622bce [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-03 23:38:15 -04:00
Al
085cae3407 [fix] components 2016-07-03 23:36:27 -04:00