Commit Graph

875 Commits

Author SHA1 Message Date
Al
eca6fc7de3 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals 2016-07-21 17:04:57 -04:00
Al
e4d8faab73 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-21 17:04:57 -04:00
Al
068e24a206 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-21 17:04:57 -04:00
Al
d6c44a0c09 [fix] alternatives lists in config utils 2016-07-21 17:04:57 -04:00
Al
793671d0b9 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-21 17:04:57 -04:00
Al
47f926c4b6 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-21 17:04:57 -04:00
Al
d97b00b4c1 [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-21 17:04:57 -04:00
Al
1e79f31649 [fix] components 2016-07-21 17:04:57 -04:00
Al
2d35b89345 [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-21 17:04:57 -04:00
Al
bbeb9a14ca [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-21 17:04:57 -04:00
Al
4d0506a295 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout 2016-07-21 17:04:57 -04:00
Al
ed77ceead3 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist 2016-07-21 17:04:57 -04:00
Al
2d2e2489ff [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number 2016-07-21 17:04:57 -04:00
Al
fe2bb06ac2 [osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm 2016-07-21 17:04:57 -04:00
Al
9efc2d4d79 [addresses] Adding ability to determine unit numbers using a known floor number 2016-07-21 17:04:57 -04:00
Al
6fc18b9adb [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors 2016-07-21 17:04:57 -04:00
Al
4412ba1177 [test] Adding tests for address configs 2016-07-21 17:04:57 -04:00
Al
d3a6a032ab [fix] a few errors with non-numbers in numeric_phrase 2016-07-21 17:04:57 -04:00
Al
2505afa2b9 [addresses] Adding new configs 2016-07-21 17:04:57 -04:00
Al
dfd29911fd [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class 2016-07-21 17:04:57 -04:00
Al
11c6564783 [addresses] Russian address config 2016-07-21 17:04:57 -04:00
Al
ee27dc5ea1 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil 2016-07-21 17:04:57 -04:00
Al
8a5dd26dbf [numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen 2016-07-21 17:04:57 -04:00
Al
eee68d1ca5 [numex] Ordinal spellout using the numex configs 2016-07-21 17:04:57 -04:00
Al
8383d5bb12 [numex] Adding numeric expression spellout in the Python geodata module for generating training data 2016-07-21 17:04:57 -04:00
Al
53ea1c139a [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number 2016-07-21 17:04:57 -04:00
Al
b8aba86471 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) 2016-07-21 17:04:57 -04:00
Al
c29d1ad947 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint 2016-07-21 17:04:57 -04:00
Al
589497cb16 [addresses] Adding Portuguese sub-building config 2016-07-21 17:04:57 -04:00
Al
03b9825390 [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) 2016-07-21 17:04:57 -04:00
Al
9d7239d0ad [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs 2016-07-21 17:04:57 -04:00
Al
420b169d48 [addresses] adding nb.yaml to valid configs 2016-07-21 17:04:57 -04:00
Al
23736f2650 [fix] return None if there are no ordinal suffixes for a given language 2016-07-21 17:04:57 -04:00
Al
a6da72a831 [fix] addr:place= 2016-07-21 17:04:57 -04:00
Al
ca88ff7f73 [osm] Adding railway stations to venues/addresses data sets 2016-07-21 17:04:57 -04:00
Al
b22d30cb52 [addresses] Adding Danish config to parsed configs 2016-07-21 17:04:57 -04:00
Al
72e647902d [fix] name 2016-07-21 17:04:57 -04:00
Al
03be909a60 [fix] name 2016-07-21 17:04:57 -04:00
Al
b8a9d15d41 [addresses] Dutch address config 2016-07-21 17:04:57 -04:00
Al
dd7ef6fabf [dictionaries] Making new component for near/nearby prepositions 2016-07-21 17:04:57 -04:00
Al
22be892635 [dictionaries] Updates to German dictionaries 2016-07-21 17:04:57 -04:00
Al
0bbced4966 [fix] subdir config in OpenAddresses formatter 2016-07-21 17:04:57 -04:00
Al
7d5d54bd29 [formatting] Territories use parent country's template insertion probabilities 2016-07-21 17:04:57 -04:00
Al
77a4476b8e [openaddresses] CLDR country names for OpenAddresses training set 2016-07-21 17:04:57 -04:00
Al
e91b318121 [addresses] French address levels alphanumeric 2016-07-21 17:04:57 -04:00
Al
9059c2af60 [addresses] Don't generate sub-building components at all if there's no house number 2016-07-21 17:04:57 -04:00
Al
9c090302f7 [addresses] Topological sort of address component dependencies so they get checked/removed in order 2016-07-21 17:04:57 -04:00
Al
419f5961a5 [fix] unused var 2016-07-21 17:04:57 -04:00
Al
a57ace0be0 [openaddresses] OpenAddresses training script 2016-07-21 17:04:57 -04:00
Al
584a4e0ee8 [openaddresses] Added components via OA config 2016-07-21 17:04:57 -04:00