Commit Graph

901 Commits

Author SHA1 Message Date
Al
dd20db85e1 [fix] include all relation members from OSM in dependencies 2016-07-21 17:04:57 -04:00
Al
cabe2c2930 [fix] define var 2016-07-21 17:04:57 -04:00
Al
12250533e3 [osm] fixing Santa Monica and other polygons by treating member ways with blank role type (old way of specifying outer) as outer boundaries so the polygons are valid, adding a properties-only option admin polygons 2016-07-21 17:04:57 -04:00
Al
0f56eb5aca [fix] Only add nodes to dictionary if they have place= as there are named nodes that are not admin_centres 2016-07-21 17:04:57 -04:00
Al
0afbc84a2b [phrases] pulling phrase extraction out of command-line script so it's more reusable for other tasks 2016-07-21 17:04:57 -04:00
Al
99fa6eee61 [osm] Neighborhood index now uses OSM admin R-tree to check whether nodes that are otherwise classified as non-suburbs (e.g. Santa Monica is a city) but may still match one of the neighborhood data sets, are excluded from the final neighborhood index 2016-07-21 17:04:57 -04:00
Al
3eae77c148 [fix] Reverting to dropping ways/relations in planet-neighborhoods.osm since the polygon index itself can now be checked 2016-07-21 17:04:57 -04:00
Al
f5ecba155f [osm] get_first_component for passing in a properties dictionary and returning an address formatting component 2016-07-21 17:04:57 -04:00
Al
ad5f34664a [osm] Adding admin_center properties to admin boundary polygons 2016-07-21 17:04:57 -04:00
Al
904cb7f4c1 [osm] Extract all members with role= as dependencies 2016-07-21 17:04:57 -04:00
Al
ff66151109 [components] Check global keys first (e.g. place= before admin_level=) in boundary component mapping 2016-07-21 17:04:57 -04:00
Al
46f83ce3ef [addresses] Implementing phrases for numbered blocks 2016-07-21 17:04:57 -04:00
Al
e2097f79fd [osm] For the OSM neighborhoods data set, including neighborhoods, cities, etc. which are ways/relations to more effectively dedupe with ClickThatHood and Quattroshapes 2016-07-21 17:04:57 -04:00
Al
9c931a329e [neighborhoods] /zetashapes/clickthathood/ 2016-07-21 17:04:57 -04:00
Al
cc280c7001 [addresses] Implementing alphabet_probability, so may still use the Latin alphabet in some cases 2016-07-21 17:04:57 -04:00
Al
99c1b633ac [test] Printing invalid phrases in address config tests 2016-07-21 17:04:57 -04:00
Al
c91950ea6c [osm] Adding OSM file for places stored as nodes. Adding a general venue definition accessible from the geodata Python package. OSM definitions expand simple variables so can reuse/combine definitions in the bash script 2016-07-21 17:04:57 -04:00
Al
6d0e5359e7 [addresses] Implementing list-based field combinations 2016-07-21 17:04:57 -04:00
Al
eca6fc7de3 [addresses] Implementing whitespace_probability and ordinal_suffix probability for Roman numerals 2016-07-21 17:04:57 -04:00
Al
e4d8faab73 [osm] Japanese addresses only use named valid venues, not just anything with a name 2016-07-21 17:04:57 -04:00
Al
068e24a206 [fix] ordinal spellout for numbers which map directly to a simple rule 2016-07-21 17:04:57 -04:00
Al
d6c44a0c09 [fix] alternatives lists in config utils 2016-07-21 17:04:57 -04:00
Al
793671d0b9 [addresses] Sample from higher floors in buildings higher than 10 stories since those are relatively rare and we get enough lower numbered floors from random sampling 2016-07-21 17:04:57 -04:00
Al
47f926c4b6 [addresses] Handling digit rewrites (spellout, Roman numerals, etc.) in the base class 2016-07-21 17:04:57 -04:00
Al
d97b00b4c1 [addresses] Removing temporary file list and allowing any file ending in .yaml in resources/addresses to be parsed/imported 2016-07-21 17:04:57 -04:00
Al
1e79f31649 [fix] components 2016-07-21 17:04:57 -04:00
Al
2d35b89345 [addresses] Using Digits.rewrite in unit generation as well as adding a new config option for generating positive numbers only 2016-07-21 17:04:57 -04:00
Al
bbeb9a14ca [addresses] Using Digits.rewrite for entrance, staircase, floor numbers, and PO boxes 2016-07-21 17:04:57 -04:00
Al
4d0506a295 [addresses] Adding Digits, which allows for replacing numbers with their unicode full-width equivalents or doing number spellout 2016-07-21 17:04:57 -04:00
Al
ed77ceead3 [addresses] Adding some of the new configs and returning None if no phrase alternatives exist 2016-07-21 17:04:57 -04:00
Al
2d2e2489ff [addresses] Fixes for standalone components, conditional adds, and allowing generated unit numbers to use known floor number 2016-07-21 17:04:57 -04:00
Al
fe2bb06ac2 [osm] Since most streets in Japan do not have names, define a separate set of valid address constraints and merge the files into planet-addresses.osm 2016-07-21 17:04:57 -04:00
Al
9efc2d4d79 [addresses] Adding ability to determine unit numbers using a known floor number 2016-07-21 17:04:57 -04:00
Al
6fc18b9adb [addresses] Roman numerals can be returned by Floor.random, relaxing the Zipfian distribution on floors so we get higher floors 2016-07-21 17:04:57 -04:00
Al
4412ba1177 [test] Adding tests for address configs 2016-07-21 17:04:57 -04:00
Al
d3a6a032ab [fix] a few errors with non-numbers in numeric_phrase 2016-07-21 17:04:57 -04:00
Al
2505afa2b9 [addresses] Adding new configs 2016-07-21 17:04:57 -04:00
Al
dfd29911fd [addresses] Implementing Roman numerals and cardinal/ordinal number spellout in numbering base class 2016-07-21 17:04:57 -04:00
Al
11c6564783 [addresses] Russian address config 2016-07-21 17:04:57 -04:00
Al
ee27dc5ea1 [addresses/dictionaries] Updates to Portuguese configs, variations for Brasil 2016-07-21 17:04:57 -04:00
Al
8a5dd26dbf [numex] Adding method to do cardinal number spellout by hundreds e.g. twenty-three seventeen instead of two thousand three three hundred seventeen 2016-07-21 17:04:57 -04:00
Al
eee68d1ca5 [numex] Ordinal spellout using the numex configs 2016-07-21 17:04:57 -04:00
Al
8383d5bb12 [numex] Adding numeric expression spellout in the Python geodata module for generating training data 2016-07-21 17:04:57 -04:00
Al
53ea1c139a [osm/addresses] using new is_numeric in AddressComponents expansion and removing venue names that are identical to the house number 2016-07-21 17:04:57 -04:00
Al
b8aba86471 [addresses] Implementing unit types which use concatenated floors with offsets for basement (e.g. Norway) 2016-07-21 17:04:57 -04:00
Al
c29d1ad947 [addresses] Implementing number_min_abs_value, number_max_abs_value outside of number_abs_value constraint 2016-07-21 17:04:57 -04:00
Al
589497cb16 [addresses] Adding Portuguese sub-building config 2016-07-21 17:04:57 -04:00
Al
03b9825390 [addresses/units] Adding special handling for floor phrase + unit concatenation in the unit field (handles bruksenhetsnummer/bolignummer-style addresses in Norway) 2016-07-21 17:04:57 -04:00
Al
9d7239d0ad [addresses] Adding null-phrase/null-phrase-alpha-only handling and zero padding to numbered components in sub-building configs 2016-07-21 17:04:57 -04:00
Al
420b169d48 [addresses] adding nb.yaml to valid configs 2016-07-21 17:04:57 -04:00