Commit Graph

883 Commits

Author SHA1 Message Date
Al
ad50e44c12 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-03 23:43:32 -04:00
Al
ce2f5be564 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-03 23:42:40 -04:00
Al
adb2d30438 [fix] alternatives lists in config utils 2016-07-03 23:42:13 -04:00
Al
ba6ec40748 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-03 23:41:49 -04:00
Al
1c45163411 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-03 23:40:50 -04:00
Al
24c0622bce [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-03 23:38:15 -04:00
Al
085cae3407 [fix] components 2016-07-03 23:36:27 -04:00
Al
203980fe0f [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-02 04:27:55 -04:00
Al
94b5d055f7 [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-02 04:26:40 -04:00
Al
28f49f3eb7 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout 2016-07-02 04:25:29 -04:00
Al
22524f7822 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist 2016-07-02 04:24:07 -04:00
Al
5579156320 [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number 2016-07-02 04:22:34 -04:00
Al
ade190f8c7 [osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm 2016-06-30 02:34:03 -04:00
Al
171a2c9b2f [addresses] Adding ability to determine unit numbers using a known floor number 2016-06-29 15:57:10 -04:00
Al
5b17a3a3ce [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors 2016-06-28 19:47:23 -04:00
Al
15059c76a6 [test] Adding tests for address configs 2016-06-28 13:10:47 -04:00
Al
5e78f72fc7 [fix] a few errors with non-numbers in numeric_phrase 2016-06-28 13:08:38 -04:00
Al
4bbc52372c [addresses] Adding new configs 2016-06-27 03:06:54 -04:00
Al
0e7348abba [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class 2016-06-27 03:06:33 -04:00
Al
ee1326b15c [addresses] Russian address config 2016-06-26 01:25:27 -04:00
Al
414c2e9820 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil 2016-06-25 20:29:36 -04:00
Al
2b752de6a7 [numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen 2016-06-25 13:36:10 -04:00
Al
b8bc8a33d5 [numex] Ordinal spellout using the numex configs 2016-06-25 13:35:03 -04:00
Al
e2a9a57269 [numex] Adding numeric expression spellout in the Python geodata module for generating training data 2016-06-24 16:10:36 -04:00
Al
cf2ed2b299 [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number 2016-06-23 13:59:40 -04:00
Al
082dbe6dd2 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) 2016-06-16 01:45:43 +02:00
Al
1f08cce1a7 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint 2016-06-16 01:44:12 +02:00
Al
c76e7ab776 [addresses] Adding Portuguese sub-building config 2016-06-16 01:43:03 +02:00
Al
ccd1d4825c [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) 2016-06-14 22:02:14 +02:00
Al
f02d393b90 [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs 2016-06-14 21:53:43 +02:00
Al
e6ac8062d8 [addresses] adding nb.yaml to valid configs 2016-06-14 21:52:11 +02:00
Al
e1cb8b4bbb [fix] return None if there are no ordinal suffixes for a given language 2016-06-13 16:17:26 +02:00
Al
1f7186d9f2 [fix] addr:place= 2016-06-09 16:17:21 +02:00
Al
e0306b2147 [osm] Adding railway stations to venues/addresses data sets 2016-06-09 14:59:37 +02:00
Al
89c09fb8aa [addresses] Adding Danish config to parsed configs 2016-06-07 18:04:24 -04:00
Al
3b0712ef41 [fix] name 2016-06-02 12:17:40 -04:00
Al
24b84dd503 [fix] name 2016-06-02 03:05:31 -04:00
Al
118bd95fed [addresses] Dutch address config 2016-06-01 16:47:57 -04:00
Al
1e295ea8e9 [dictionaries] Making new component for near/nearby prepositions 2016-06-01 15:32:23 -04:00
Al
010d03b55b [dictionaries] Updates to German dictionaries 2016-06-01 12:35:48 -04:00
Al
b6fe41451f [fix] subdir config in OpenAddresses formatter 2016-06-01 12:17:08 -04:00
Al
3c51a5a052 [formatting] Territories use parent country's template insertion probabilities 2016-06-01 11:42:11 -04:00
Al
012d174fdc [openaddresses] CLDR country names for OpenAddresses training set 2016-05-31 18:54:37 -04:00
Al
5af6546569 [addresses] French address levels alphanumeric 2016-05-31 16:07:58 -04:00
Al
a9c65af75e [addresses] Don't generate sub-building components at all if there's no house number 2016-05-31 16:02:55 -04:00
Al
9bf6018018 [addresses] Topological sort of address component dependencies so they get checked/removed in order 2016-05-31 16:01:49 -04:00
Al
e8647c4701 [fix] unused var 2016-05-31 11:01:37 -04:00
Al
3bbbc741d4 [openaddresses] OpenAddresses training script 2016-05-31 02:33:32 -04:00
Al
7e2b87f10b [openaddresses] Added components via OA config 2016-05-31 02:12:41 -04:00
Al
0efab434f7 [openaddresses] Adding abbreviated unit 2016-05-31 02:11:52 -04:00