2fac4c955b[addresses] field combinations, Roman numerals and spellout for Russian config
Al
2016-07-07 03:38:06 -04:00
6de9002c9e[addresses] Adding combinations and Roman numeral floor numbers into Serbian config
Al
2016-07-06 18:06:04 -04:00
bdbcbb8d0a[test] Printing invalid phrases in address config tests
Al
2016-07-06 15:40:54 -04:00
dfeea0906c[osm] Adding OSM file for places stored as nodes. Adding a general venue definition accessible from the geodata Python package. OSM definitions expand simple variables so can reuse/combine definitions in the bash script
Al
2016-07-06 15:40:04 -04:00
e816b4f77e[parser] Ignore language/country options explicitly in the parser. The purpose of these options is not to be able to create language-specific/country-specific models at some point, shouldn't be used in the global model
Al
2016-07-06 14:56:46 -04:00
9ba55100e2[addresses] Serbian address config
Al
2016-07-05 15:48:27 -04:00
f4c4f2b8f7[dictionaries] Serbian dictionaries to support new address config
Al
2016-07-05 15:28:59 -04:00
e43c299113[numex] Serbian numex
Al
2016-07-05 00:53:18 -04:00
a9c19056c9[fix] no upper casing for Pinyin and Romanized Korean
Al
2016-07-04 23:34:01 -04:00
d8d3eaf79e[fix] rename zh_pinyin
Al
2016-07-04 18:55:33 -04:00
985ed3996e[addresses] Adding Latvian address config
Al
2016-07-04 18:13:57 -04:00
f409a84ccc[numex] Adding ordinal suffixes for Latvian and Lithuanian
Al
2016-07-04 18:13:33 -04:00
455384b230[dictionaries] Latvian dictionary updates to support the new address config
Al
2016-07-04 18:10:42 -04:00
2f4548664f[addresses] Implementing list-based field combinations
Al
2016-07-04 18:09:50 -04:00
d09e0ca966[addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals
Al
2016-07-04 18:08:55 -04:00
2810e2638d[addresses] Lithuanian address config
Al
2016-07-04 15:20:18 -04:00
86a1190e0a[dictionaries] Lithuanian dictionaries to support new address config
Al
2016-07-04 15:19:59 -04:00
baf8b086b0[dictionaries] Adding 'no' form to languages that also use '№'
Al
2016-07-04 14:51:13 -04:00
318b7fe35b[addresses] Romanized Korean address config
Al
2016-07-04 13:58:22 -04:00
83634f110a[addresses] Korean address config
Al
2016-07-04 13:58:06 -04:00
09c333cbc1[dictionaries] Korean dictionaries to support new address config
Al
2016-07-04 13:57:54 -04:00
f27f7d3046[dictionaries] Updates to Russian dictionaries
Al
2016-07-04 13:57:16 -04:00
a893b6de40[addresses] Chinese Pinyin config
Al
2016-07-04 13:56:46 -04:00
5f0cde16fe[addresses] Ukrainian address config
Al
2016-07-04 13:56:26 -04:00
fab727d98f[dictionaries] Ukrainian dictionaries to support new address config
Al
2016-07-04 13:56:05 -04:00
52951cd335[boundaries] boundary mappings for South Korea
Al
2016-07-04 13:47:20 -04:00
b7cc9bd857[addresses] Adding digit spellout and the list form of field combinations to existing configs
Al
2016-07-04 13:46:19 -04:00
df530b8f4a[tokenization] Re-generating scanner
Al
2016-07-03 23:51:29 -04:00
3cbb1b3976[tokenization] Hyphens, etc. between non-ASCII digits (e.g. Unicode full-width numbers) should be single tokens
Al
2016-07-03 23:51:13 -04:00
ad50e44c12[osm] Japanese addresses only use named valid venues, not just anything with a name
Al
2016-07-03 23:43:32 -04:00
ce2f5be564[fix] ordinal spellout for numbers which map directly to a simple rule
Al
2016-07-03 23:42:40 -04:00
adb2d30438[fix] alternatives lists in config utils
Al
2016-07-03 23:42:13 -04:00
ba6ec40748[addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling
Al
2016-07-03 23:41:49 -04:00
1c45163411[addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class
Al
2016-07-03 23:39:37 -04:00
24c0622bce[addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported
Al
2016-07-03 23:38:15 -04:00
085cae3407[fix] components
Al
2016-07-03 23:36:27 -04:00
8fc3f8b925[addresses] Chinese address config with variations for Hong Kong, Taiwan, etc.
Al
2016-07-03 13:48:44 -04:00
0f7d3f6373[dictionaries] Chinese dictionaries to support new address config
Al
2016-07-03 13:48:00 -04:00
31e5808413[dictionaries] A few more Japanese dictionaries to support the address config
Al
2016-07-03 13:47:45 -04:00
1dc90aab1f[dictionaries] código postal also used in some Portuguese-speaking countries e.g. Angola, Mozambique
Al
2016-07-03 13:46:35 -04:00
31a9a82e3c[dictionaries] Abbreviation for apartado in Portugal to match Spanish
Al
2016-07-03 13:45:45 -04:00
ba0e3c0408[dictionaries] Adding railway station token to Japanese qualifiers
Al
2016-07-03 13:45:16 -04:00
e4240d9096[addresses] Japanese Romaji address config
Al
2016-07-02 04:32:39 -04:00
5723b11130[addresses] Japanese address config
Al
2016-07-02 04:32:26 -04:00
51524b7d71[dictionaries] Updates to Japanese qualifiers
Al
2016-07-02 04:31:39 -04:00
203980fe0f[addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only
Al
2016-07-02 04:27:55 -04:00
94b5d055f7[addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes
Al
2016-07-02 04:26:21 -04:00
28f49f3eb7[addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout
Al
2016-07-02 04:25:29 -04:00
22524f7822[addresses] Adding some of the new configs and returning None if no phrase alternatives exist
Al
2016-07-02 04:24:07 -04:00
5579156320[addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number
Al
2016-07-02 04:22:34 -04:00
58a5dbe7e0[logging] Logging the value of LIBPOSTAL_DATA_DIR when a setup error occurs
Al
2016-07-01 14:51:04 -04:00
ad9dfb46bd[build] Using a process pool with 64MB chunks (similar to aws cli) for S3 downloads. Setting the max concurrent requeests to 10, also the default in aws cli.
Al
2016-07-01 02:15:19 -04:00
1b42d29129[dictionaries] gebouw as concatenated inseparable suffix in Dutch (helps with identifying unknown words)
Al
2016-06-30 17:46:34 -04:00
02e2417edd[addresses] New structure for blocks (placeholder, not implemented as random phrases yet)
Al
2016-06-30 17:46:04 -04:00
8103d2d220[addresses/dictionaries] Netherlands config update, moving verdiep abbreviation to Belgian Flemish
Al
2016-06-30 17:43:30 -04:00
ef2a01ed7f[dictionaries] Dutch abbreviations for standalone levels like begane grond
Al
2016-06-30 17:41:13 -04:00
df7f64439b[docs][ci skip] Adding package dependencies for CentOS
Al
2016-06-30 17:38:14 -04:00
ade190f8c7[osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm
Al
2016-06-30 02:33:46 -04:00
dfcc1ab9ee[addresses] Making house number phrase (e.g. Calle Foobar nº 2) slightly more common in Spanish-speaking world (and even more likely in Colombia)
Al
2016-06-29 16:00:49 -04:00
b7bbd486cd[addresses] Hungarian address config
Al
2016-06-29 15:59:27 -04:00
e7ad848464[numex] adding '.' for Hungarian ordinal indicator (Roman numerals handled in address config)
Al
2016-06-29 15:58:34 -04:00
b5f28eca28[dictionaries] Hungarian dictionaries to support address config
Al
2016-06-29 15:57:40 -04:00
171a2c9b2f[addresses] Adding ability to determine unit numbers using a known floor number
Al
2016-06-29 15:56:45 -04:00
5b17a3a3ce[addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors
Al
2016-06-28 19:47:23 -04:00
a613bdbf74[dictionaries] Adding another level type for Russian
Al
2016-06-28 19:28:38 -04:00
e808d0c4a3[dictionaries] A few more Spanish additions
Al
2016-06-28 19:17:56 -04:00
aad9231bc5[dictionaries] & to Swedish cross streets
Al
2016-06-28 13:21:23 -04:00
a260acf0af[addresses] Romanian address config
Al
2016-06-28 13:20:54 -04:00
a3f6eb68be[dictionaries] Romanian dictionaries to support address config
Al
2016-06-28 13:20:39 -04:00
07d94c0fe7[dictionaries] Bostad in Swedish (used in Finland)
Al
2016-06-28 13:20:19 -04:00
03004a2967[addresses] Finnish address config
Al
2016-06-28 13:19:14 -04:00
69ea3b98dc[dictionaries] Finnish dictionaries to support address config
Al
2016-06-28 13:18:46 -04:00
e154f98ac1[addresses] Estonian address config
Al
2016-06-28 13:18:23 -04:00
a2e70f453d[dictionaries] Estonian dictionaries to support address config
Al
2016-06-28 13:18:05 -04:00
887091495d[fix] Afrikaans expansion
Al
2016-06-28 13:17:29 -04:00
99d3bd0244[fix] Fixes to address configs
Al
2016-06-28 13:16:59 -04:00
a4c02f7031[numex] Estonian ordinal indicators are just .
Al
2016-06-28 13:12:08 -04:00
5b5e13a178[numex] Finnish ordinals can also use .
Al
2016-06-28 13:11:44 -04:00
15059c76a6[test] Adding tests for address configs
Al
2016-06-28 13:10:47 -04:00
5e78f72fc7[fix] a few errors with non-numbers in numeric_phrase
Al
2016-06-28 13:08:38 -04:00
d79189c501[expansion] Prefix/suffix expansions by default can apply to ADDRESS_ANY but also inherit the types of any dictionary that lists their canonical form (so we can add suffixes without worrying about whether they're for streets or place names, etc.)
Al
2016-06-28 02:37:38 -04:00
6da69d837e[dictionaries] Adding concatenated suffixes to street types, adding universitat as a suffix
Al
2016-06-28 01:31:04 -04:00
a39c3fa0d5[addresses] Use bostad in Swedish addresses in Finland
Al
2016-06-28 01:29:10 -04:00
4bbc52372c[addresses] Adding new configs
Al
2016-06-27 03:06:54 -04:00
0e7348abba[addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class
Al
2016-06-27 03:06:33 -04:00
9a2730103d[addresses] Italian address config
Al
2016-06-27 03:05:50 -04:00
6fcdd81ca8[dictionaries] Italian dictionaries to support sub-building config
Al
2016-06-27 03:05:06 -04:00
cb4ea1fb71[addresses] Slovak address config
Al
2016-06-27 03:04:41 -04:00
ff3e43cea9[dictionaries] Slovak dictionaries to support sub-building config
Al
2016-06-27 03:04:16 -04:00
c2f017fd66[addresses] Czech sub-building config
Al
2016-06-27 03:03:57 -04:00
cf0ad9ab50[dictionaries] Czech dictionaries to support sub-building config
Al
2016-06-27 03:03:47 -04:00
de754f4557[numex] ordinal suffixes for Czech/Slovak
Al
2016-06-27 03:03:24 -04:00
699dae6c56[dictionaries] Ampersand in Polish/Russian
Al
2016-06-27 03:02:39 -04:00
ec1ca339fe[dictionaries] Numero sign in Italian
Al
2016-06-27 03:02:02 -04:00
c1dce672fb[dictionaries] numero sign in French
Al
2016-06-27 03:01:43 -04:00
cea667598b[addresses] Adding Roman numerals to the Polish config for floor numbers
Al
2016-06-27 03:01:10 -04:00
ee1326b15c[addresses] Russian address config
Al
2016-06-26 01:24:00 -04:00
aac2042ba9[dictionaries] Russian dictionaries to support address configs
Al
2016-06-26 01:23:47 -04:00
975ce0f8f8[addresses] Polish address config and dictionary updates
Al
2016-06-25 20:36:07 -04:00
1014056c48[numex] Portuguese ordinals fix
Al
2016-06-25 20:32:31 -04:00