Commit Graph

129 Commits

Author SHA1 Message Date
Al
6d0e5359e7 [addresses] Implementing list-based field combinations 2016-07-21 17:04:57 -04:00
Al
eca6fc7de3 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals 2016-07-21 17:04:57 -04:00
Al
793671d0b9 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-21 17:04:57 -04:00
Al
47f926c4b6 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-21 17:04:57 -04:00
Al
d97b00b4c1 [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-21 17:04:57 -04:00
Al
1e79f31649 [fix] components 2016-07-21 17:04:57 -04:00
Al
2d35b89345 [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-21 17:04:57 -04:00
Al
bbeb9a14ca [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-21 17:04:57 -04:00
Al
4d0506a295 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout 2016-07-21 17:04:57 -04:00
Al
ed77ceead3 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist 2016-07-21 17:04:57 -04:00
Al
2d2e2489ff [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number 2016-07-21 17:04:57 -04:00
Al
9efc2d4d79 [addresses] Adding ability to determine unit numbers using a known floor number 2016-07-21 17:04:57 -04:00
Al
6fc18b9adb [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors 2016-07-21 17:04:57 -04:00
Al
d3a6a032ab [fix] a few errors with non-numbers in numeric_phrase 2016-07-21 17:04:57 -04:00
Al
2505afa2b9 [addresses] Adding new configs 2016-07-21 17:04:57 -04:00
Al
dfd29911fd [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class 2016-07-21 17:04:57 -04:00
Al
11c6564783 [addresses] Russian address config 2016-07-21 17:04:57 -04:00
Al
ee27dc5ea1 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil 2016-07-21 17:04:57 -04:00
Al
53ea1c139a [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number 2016-07-21 17:04:57 -04:00
Al
b8aba86471 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) 2016-07-21 17:04:57 -04:00
Al
c29d1ad947 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint 2016-07-21 17:04:57 -04:00
Al
589497cb16 [addresses] Adding Portuguese sub-building config 2016-07-21 17:04:57 -04:00
Al
03b9825390 [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) 2016-07-21 17:04:57 -04:00
Al
9d7239d0ad [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs 2016-07-21 17:04:57 -04:00
Al
420b169d48 [addresses] adding nb.yaml to valid configs 2016-07-21 17:04:57 -04:00
Al
b22d30cb52 [addresses] Adding Danish config to parsed configs 2016-07-21 17:04:57 -04:00
Al
b8a9d15d41 [addresses] Dutch address config 2016-07-21 17:04:57 -04:00
Al
22be892635 [dictionaries] Updates to German dictionaries 2016-07-21 17:04:57 -04:00
Al
e91b318121 [addresses] French address levels alphanumeric 2016-07-21 17:04:57 -04:00
Al
9c090302f7 [addresses] Topological sort of address component dependencies so they get checked/removed in order 2016-07-21 17:04:57 -04:00
Al
a32f6b5017 [addresses] Making address_language a classmethod 2016-07-21 17:04:57 -04:00
Al
6980565698 [addresses] Allowing null_phrase_probability for alpha, and alpha+digits instead of just for ordinals (mostly for Spain) 2016-07-21 17:04:57 -04:00
Al
69e1c846ba [parser] Fixing config keys so OSM streets/venues get abbreviated. Selecting namespaced address fields in cases like Brussels or Hong Kong where everything is bilingual. Adding the ability to pass a known language into address component expansion 2016-07-21 17:04:57 -04:00
Al
75aa713792 [fix] moving language code replacements out of address components 2016-07-21 17:04:57 -04:00
Al
046f445a56 [fix] component bitsets 2016-07-21 17:04:57 -04:00
Al
0dbfd79b72 [fix] language format changes only apply to local languages 2016-07-21 17:04:57 -04:00
Al
890268aa87 [languages] Use English formats for Romanized CJK 2016-07-21 17:04:57 -04:00
Al
ad4b197ead [fix] floor samples 2016-07-21 17:04:57 -04:00
Al
110be7a245 [fix] args 2016-07-21 17:04:57 -04:00
Al
52aa95c213 [subdivisions] Adding zone types 2016-07-21 17:04:57 -04:00
Al
694020ddf3 [fix] all_names returns a list not a set 2016-07-21 17:04:57 -04:00
Al
97d2bfb508 [osm] venue names 2016-07-21 17:04:57 -04:00
Al
66e35d517d [fix] language disambiguation 2016-07-21 17:04:57 -04:00
Al
4c71cab6a0 [languages] Adding script-only disambiguation 2016-07-21 17:04:57 -04:00
Al
e5a0dc5a4f [fix] dependencies 2016-07-21 17:04:57 -04:00
Al
35b4f598f8 [fix] drop invalid components 2016-07-21 17:04:57 -04:00
Al
a0b4095479 [fix] args 2016-07-21 17:04:57 -04:00
Al
b884719101 [fix] var name 2016-07-21 17:04:57 -04:00
Al
99ebc3228a [addresses] Address component expansion makes a copy while building PO box addresses, now has a drop names method to drop just venue name, etc. 2016-07-21 17:04:57 -04:00
Al
f4a217045f [fix] name error in neighborhoods 2016-07-21 17:04:57 -04:00