Commit Graph

632 Commits

Author SHA1 Message Date
Al
cdf8829942 [fix] no longer requiring argv for unicode_properties script 2016-07-21 17:04:57 -04:00
Al
a0e6a828c9 [languages] Adding country_and_languages to the language rtree itself 2016-07-21 17:04:57 -04:00
Al
6703da8fc3 [fix] languages and disambiguation do initialization by default 2016-07-21 17:04:57 -04:00
Al
ee1aa564c4 [normalization] normalize tokens should not replace digits by default 2016-07-21 17:04:57 -04:00
Al
3a9ac9d96f [fix] six.u 2016-07-21 17:04:57 -04:00
Al
49ac3dc553 [disambiguation] Adding best_country_and_language 2016-07-21 17:04:57 -04:00
Al
7b42e52c6a [fix] token_types.PHRASE 2016-07-21 17:04:57 -04:00
Al
e21b793b03 [polygons] Adding ISO3166 alpha 2/3 codes to OSM polygons index 2016-07-21 17:04:57 -04:00
Al
7e5ecb30cf [addresses] sample_alphabet (Zipfian) in PO box rather than a uniform choice 2016-07-21 17:04:57 -04:00
Al
3845c58ca3 [points] Adding load method for point reverse geocoding 2016-07-21 17:04:57 -04:00
Al
c506649252 [fix] languages_intialized 2016-07-21 17:04:57 -04:00
Al
1fd4fbb7a2 [normalization] Adding default token options for numbers so we split alpha from numeric tokens and don't normalize digits 2016-07-21 17:04:57 -04:00
Al
3d765e9eca [addresses] Fixing direction_probability, adding ability to have phrases which only apply to numbers, adding the possibility of null phrases to non-numeric "numbers" e.g. A-Z, etc. 2016-07-21 17:04:57 -04:00
Al
03704fff6a [intersections] Lower memory version of intersection freader 2016-07-21 17:04:57 -04:00
Al
620f0594aa [points] haversine distance in a different method 2016-07-21 17:04:57 -04:00
Al
d5dc34ec1d [gazetteers] moving PHRASE to a token type 2016-07-21 17:04:57 -04:00
Al
04a5a9e611 [fix] Removing YAML inheritance as it doesn't merge nested dictionaries 2016-07-21 17:04:57 -04:00
Al
f3bbe2ee74 [fix] file rename 2016-07-21 17:04:57 -04:00
Al
9f37a26a6d [points] Adding point reverse geocoding index 2016-07-21 17:04:57 -04:00
Al
9977a7a254 [mv] Moving osm_admin_boundaries to just admin_boundaries 2016-07-21 17:04:57 -04:00
Al
37747709ee [addresses] Using YAML inheritance instead of baking it into the config parser 2016-07-21 17:04:57 -04:00
Al
cd10951afb [addresses] Generalizing the functions used for address configs so they can be reused for per-country OSM configs, etc. 2016-07-21 17:04:57 -04:00
Al
79368f3f02 [intersections] Intersections generator for OSM 2016-07-21 17:04:57 -04:00
Al
799bbe4912 [neighborhoods] Moving neighborhoods index to its own package 2016-07-21 17:04:57 -04:00
Al
8aac200d74 [addresses] config for phrases around postcodes like CP in Spanish 2016-07-21 17:04:57 -04:00
Al
5bbb60e241 [fix] instance var 2016-07-21 17:04:57 -04:00
Al
3fd73c0bc8 [fix] import 2016-07-21 17:04:57 -04:00
Al
5713a07106 [fix] set 2016-07-21 17:04:57 -04:00
Al
79b5516e75 [fix] chmod +x 2016-07-21 17:04:57 -04:00
Al
fe2921a20a [chains] Adding code to generate chain_stores repo TSV files from OSM 2016-07-21 17:04:57 -04:00
Al
a7fe6408c0 [addresses] /po_box/po_boxes/ 2016-07-21 17:04:57 -04:00
Al
1e107f09ab [addresses] Generate house number related phrases 2016-07-21 17:04:57 -04:00
Al
62748b4644 [dictionaries] /house_number/house_numbers/ 2016-07-21 17:04:57 -04:00
Al
90c88a3a24 [fix] None handling and number dictionaries 2016-07-21 17:04:57 -04:00
Al
e13c536b03 [addresses] different dictionaries for sampling cardinal/unit directions, not converting None to a string 2016-07-21 17:04:57 -04:00
Al
7f3667caf8 [dictionaries] Removing ambiguous abbreviations for flat 2016-07-21 17:04:57 -04:00
Al
c47762b91c [addresses] Unit/apartment number generation 2016-07-21 17:04:57 -04:00
Al
ca68391ea6 [addresses] sample positive floors 2016-07-21 17:04:57 -04:00
Al
9f652591ad [mv] Moving sampling to math.sampling 2016-07-21 17:04:57 -04:00
Al
32b6217aa8 [addresses] Conjunction can be subclassed 2016-07-21 17:04:57 -04:00
Al
535453f77d [addresses] Adding ability to randomly append relative/cardinal directions 2016-07-21 17:04:57 -04:00
Al
f026e8a764 [addresses] Adding base class for numeric phrases (appended to a number using numeric/numeric_affix), using probability 1.0 if only one of numeric/numeric_affix/ordinal is specified 2016-07-21 17:04:57 -04:00
Al
efc40c5698 [fix] polygons 2016-07-21 17:04:57 -04:00
Al
c7ea5d9637 [fix] typo 2016-07-21 17:04:57 -04:00
Al
0a80ec7129 [polygons] Adding __iter__ and __len__ to polygon index and keeping track of the number of polygons for iteration 2016-07-21 17:04:57 -04:00
Al
f7764b70cd [addresses] implementing null_probability (raw number, no phrase), orindal genders, and direction_probability 2016-07-21 17:04:57 -04:00
Al
22687323c2 [numbers] suffixed_number 2016-07-21 17:04:57 -04:00
Al
6d4e54cd7a [dictionaries] making entrances/postcodes plural for consistency 2016-07-21 17:04:57 -04:00
Al
410eb0006a [dictionaries] Moving intersections to cross streets 2016-07-21 17:04:57 -04:00
Al
2f9a58f37b [expansion] Add postcode dictionary to gazetteer types 2016-07-21 17:04:57 -04:00