Commit Graph

1894 Commits

Author SHA1 Message Date
Al
39121db707 [parser] Parser default config 2016-05-10 00:59:03 -04:00
Al
7ef207f54d [boundaries] removing omissions from boundary names config 2016-05-10 00:52:14 -04:00
Al
44f0054170 [boundaries] Adding component-specific admin name probabilities to config (e.g. choose the ISO alpha-2 code 20% of the time, etc.) 2016-05-08 17:56:26 -04:00
Al
dcae484851 [boundaries] Config for boundary name changes (Kings County is a state_district but Brooklyn should not be used for that context) and omissions (usually we add islands as address components, but not e.g. Manhattan Island) 2016-05-08 11:47:51 -04:00
Al
29cd94f87a [osm] moving osm_address_components to its own module 2016-05-05 19:00:57 -04:00
Al
2db255c55f [units] Refactoring unit generator to use base alphanumeric generator 2016-05-05 18:47:24 -04:00
Al
ec0779aaea [floors] Refactoring floor/level generator to use base alphanumeric generator 2016-05-05 18:46:42 -04:00
Al
7c505985c5 [directions] wrapper for adding cardinal/relative directions probabilistically 2016-05-05 18:44:58 -04:00
Al
d20da7fbe7 [names] component expansion uses the new configurable affix replacements 2016-05-05 18:32:23 -04:00
Al
3ca6338427 [config] Adding default/alternative probability distribution to config.utils 2016-05-05 18:28:26 -04:00
Al
84d2bb1fa8 [names] Adding name affix normalizations to a YAML config 2016-05-05 18:26:45 -04:00
Al
bb4796900b [dictionaries] More Spanish abbreviations 2016-05-05 15:46:29 -04:00
Al
d64af86fa7 [numbering] choose_alphanumeric_type for base NumericPhrase 2016-05-05 15:45:51 -04:00
Al
87d5eac6ea [fix] import 2016-05-05 15:45:17 -04:00
Al
51418e5f38 [fix] name 2016-05-05 15:44:56 -04:00
Al
2450b93674 [fix] uppercased state abbreviations 2016-05-05 14:34:55 -04:00
Al
440db23bad [fix] state abbreviation instead of full name 2016-05-05 14:27:56 -04:00
Al
480a8b68b4 [fix] defaults 2016-05-05 14:17:49 -04:00
Al
1dcb980a99 [fix] state name 2016-05-05 14:12:34 -04:00
Al
d2a2e7ffde [fix] method name 2016-05-05 14:10:06 -04:00
Al
3645da508c [fix] var name 2016-05-05 14:08:08 -04:00
Al
2e571c19e9 [fix] all_languages var 2016-05-05 13:49:40 -04:00
Al
0d6622a390 [fix] normalize place names before adding OSM components, modify components in place, delete keys and use the boundary components if the component is ambiguous 2016-05-05 13:47:44 -04:00
Al
9bc14c3556 [fix] tuple 2016-05-05 13:31:46 -04:00
Al
817044a26d [fix] whitespace 2016-05-05 13:31:03 -04:00
Al
3863104384 [fix] state 2016-05-05 13:29:15 -04:00
Al
ae63d6b4c9 [fix] kwargs 2016-05-05 13:24:22 -04:00
Al
58b84d6244 [fix] deriving whitespace and state in normalized_place_name, adding all candidate languages to arguments 2016-05-05 13:20:53 -04:00
Al
03c837526c [fix] raw OSM reverse geocoded components vs. versus normalized version 2016-05-05 13:04:57 -04:00
Al
c4c1f9e17b [fix] import 2016-05-05 12:54:15 -04:00
Al
f59ee4a136 [fix] check the first phrase for components and bail if it matches something other than the specified tag 2016-05-05 12:46:01 -04:00
Al
cf0dcd849c [fix] import 2016-05-05 12:30:42 -04:00
Al
bc6a34a3b6 [addresses] more thoroughly solving the addr:city='Harlem' issue 2016-05-05 12:30:04 -04:00
Al
4f0a142153 [addresses] Adding normalized_place_name, a method for separating compound fields like addr:city='New York NY' into simply 'New York', solving the compound phrase problem. Also solves the mislabeled place name problem, causing the system to ignore the user tag and fall back on reverse geocoded components in cases e.g. where addr:city='Harlem', which is a known neighborhood but not a city when reverse geocoded. A few other refactors for expanded address components 2016-05-05 12:18:33 -04:00
Al
7a51d1fbc7 [formatting] Defining some of the new tag names in AddressFormatter as well as insert_component which reparses the address formatter template and inserts a given components, removing it from an existing block if necessary 2016-05-05 11:25:35 -04:00
Al
0739fe1b72 [aliases] Adding get method for aliases 2016-05-05 11:23:10 -04:00
Al
822c93c692 [aliases] packaging up field aliasing 2016-05-05 02:21:40 -04:00
Al
dcd26d095c [phrases] Using simple string encoding/decoding for default serialize/deserialize in PhraseFilter base class 2016-05-05 02:20:12 -04:00
Al
0a67df8a38 [osm] Adding parse_osm_number_range for addr:flats and addr:unit 2016-05-04 02:44:59 -04:00
Al
239180f09d [fix] typo 2016-05-02 21:35:12 -04:00
Al
05fb41ccb1 [dictionaries] no postcodes French dictionary 2016-05-02 21:33:49 -04:00
Al
d410c9ae80 [states] Moving state abbreviations config to YAML 2016-05-02 19:00:35 -04:00
Al
3c750a868e [phrases] Using safe_encode/safe_decode as default trie serializer/deserializer 2016-05-02 15:45:39 -04:00
Al
39700cbb11 [fix] import 2016-05-02 14:41:55 -04:00
Al
7171e3b021 [fix] file encoding 2016-05-02 14:41:06 -04:00
Al
5f146ae4f7 [fix] /postal.text.normalize/geodata.text.normalize/ 2016-05-02 14:05:56 -04:00
Al
379719cbd9 [polygons] Moving neighborhoods reverse geocoder to match the naming convention, adding coding: utf-8 2016-05-02 14:05:15 -04:00
Al
3e31715e48 [phrases] adding __init__ to base PhraseFilter 2016-05-02 11:59:10 -04:00
Al
84f7d01972 [osm] Adding place=city/town/village/hamlet/municipality to admin borders data set 2016-05-02 11:52:09 -04:00
Al
2d42ae02e4 [dictionaries] Removing department from numbered unit types in English, is more for named subdivisions e.g. Accounting Department, which we don't yet support 2016-05-01 15:27:45 -04:00