Commit Graph

55 Commits

Author SHA1 Message Date
Al
422045b699 [addresses] Deriving top floor from num_floors (passed in) 2016-07-21 17:04:57 -04:00
Al
e9e9dac8e1 [addresses] Add generated units, floors, etc. in expanded address components 2016-07-21 17:04:57 -04:00
Al
286791ac8d [addresses] generate entrance phrases 2016-07-21 17:04:57 -04:00
Al
b666232833 [addresses] generate staircase phrases 2016-07-21 17:04:57 -04:00
Al
e02206f2f6 [po boxes] random PO box generation 2016-07-21 17:04:57 -04:00
Al
6f2c81c767 [addresses] Removing subdivisions/buildings rtree, moving probabilities to config 2016-07-21 17:04:57 -04:00
Al
c33f404e1a [addresses] Using the new parser config for address component expansions 2016-07-21 17:04:57 -04:00
Al
ad81095879 [osm] moving osm_address_components to its own module 2016-07-21 17:04:57 -04:00
Al
4271f7f84b [units] Refactoring unit generator to use base alphanumeric generator 2016-07-21 17:04:57 -04:00
Al
c18f53027f [floors] Refactoring floor/level generator to use base alphanumeric generator 2016-07-21 17:04:57 -04:00
Al
d38918ca7f [directions] wrapper for adding cardinal/relative directions probabilistically 2016-07-21 17:04:57 -04:00
Al
ed12d7bbe4 [names] component expansion uses the new configurable affix replacements 2016-07-21 17:04:57 -04:00
Al
6f8e94b851 [config] Adding default/alternative probability distribution to config.utils 2016-07-21 17:04:57 -04:00
Al
f96bcd1727 [numbering] choose_alphanumeric_type for base NumericPhrase 2016-07-21 17:04:57 -04:00
Al
2f784ddec6 [fix] state abbreviation instead of full name 2016-07-21 17:04:57 -04:00
Al
62c0801683 [fix] state name 2016-07-21 17:04:57 -04:00
Al
670da6c4bc [fix] method name 2016-07-21 17:04:57 -04:00
Al
477c96ea00 [fix] var name 2016-07-21 17:04:57 -04:00
Al
00be2e4d0e [fix] all_languages var 2016-07-21 17:04:57 -04:00
Al
1e2894a665 [fix] normalize place names before adding OSM components, modify components in place, delete keys and use the boundary components if the component is ambiguous 2016-07-21 17:04:57 -04:00
Al
cec0d6f6df [fix] tuple 2016-07-21 17:04:57 -04:00
Al
e198bbf23e [fix] whitespace 2016-07-21 17:04:57 -04:00
Al
b190c88cc1 [fix] state 2016-07-21 17:04:57 -04:00
Al
3c6691d295 [fix] kwargs 2016-07-21 17:04:57 -04:00
Al
4909fa7ee1 [fix] deriving whitespace and state in normalized_place_name, adding all candidate languages to arguments 2016-07-21 17:04:57 -04:00
Al
330394ff51 [fix] raw OSM reverse geocoded components vs. versus normalized version 2016-07-21 17:04:57 -04:00
Al
f7697cf20d [fix] import 2016-07-21 17:04:57 -04:00
Al
e5fdd915d0 [fix] check the first phrase for components and bail if it matches something other than the specified tag 2016-07-21 17:04:57 -04:00
Al
8370a41ec0 [fix] import 2016-07-21 17:04:57 -04:00
Al
651bc32650 [addresses] more thoroughly solving the addr:city='Harlem' issue 2016-07-21 17:04:57 -04:00
Al
5a31b60cbe [addresses] Adding normalized_place_name, a method for separating compound fields like addr:city='New York NY' into simply 'New York', solving the compound phrase problem. Also solves the mislabeled place name problem, causing the system to ignore the user tag and fall back on reverse geocoded components in cases e.g. where addr:city='Harlem', which is a known neighborhood but not a city when reverse geocoded. A few other refactors for expanded address components 2016-07-21 17:04:57 -04:00
Al
fa99b4ce77 [addresses] wrapping up some of the functionality from OSM formatter to be used in on an arbitrary address component dictionary 2016-07-21 17:04:57 -04:00
Al
7e5ecb30cf [addresses] sample_alphabet (Zipfian) in PO box rather than a uniform choice 2016-07-21 17:04:57 -04:00
Al
3d765e9eca [addresses] Fixing direction_probability, adding ability to have phrases which only apply to numbers, adding the possibility of null phrases to non-numeric "numbers" e.g. A-Z, etc. 2016-07-21 17:04:57 -04:00
Al
04a5a9e611 [fix] Removing YAML inheritance as it doesn't merge nested dictionaries 2016-07-21 17:04:57 -04:00
Al
37747709ee [addresses] Using YAML inheritance instead of baking it into the config parser 2016-07-21 17:04:57 -04:00
Al
8aac200d74 [addresses] config for phrases around postcodes like CP in Spanish 2016-07-21 17:04:57 -04:00
Al
a7fe6408c0 [addresses] /po_box/po_boxes/ 2016-07-21 17:04:57 -04:00
Al
1e107f09ab [addresses] Generate house number related phrases 2016-07-21 17:04:57 -04:00
Al
90c88a3a24 [fix] None handling and number dictionaries 2016-07-21 17:04:57 -04:00
Al
e13c536b03 [addresses] different dictionaries for sampling cardinal/unit directions, not converting None to a string 2016-07-21 17:04:57 -04:00
Al
7f3667caf8 [dictionaries] Removing ambiguous abbreviations for flat 2016-07-21 17:04:57 -04:00
Al
c47762b91c [addresses] Unit/apartment number generation 2016-07-21 17:04:57 -04:00
Al
ca68391ea6 [addresses] sample positive floors 2016-07-21 17:04:57 -04:00
Al
9f652591ad [mv] Moving sampling to math.sampling 2016-07-21 17:04:57 -04:00
Al
32b6217aa8 [addresses] Conjunction can be subclassed 2016-07-21 17:04:57 -04:00
Al
535453f77d [addresses] Adding ability to randomly append relative/cardinal directions 2016-07-21 17:04:57 -04:00
Al
f026e8a764 [addresses] Adding base class for numeric phrases (appended to a number using numeric/numeric_affix), using probability 1.0 if only one of numeric/numeric_affix/ordinal is specified 2016-07-21 17:04:57 -04:00
Al
f7764b70cd [addresses] implementing null_probability (raw number, no phrase), orindal genders, and direction_probability 2016-07-21 17:04:57 -04:00
Al
b5386eb601 [addresses] generator for floor numbers as well as special aliases like basement, mezzanine, etc. using the address configs 2016-07-21 17:04:57 -04:00