180 Commits

Author SHA1 Message Date
Al
b2f8180d19 [openaddresses] Ignore any fields in OpenAddresses which have N/A as a value 2016-08-25 23:58:38 -04:00
Al
c23a7a4030 [openaddresses] Ditto for numeric boundary names 2016-08-25 22:58:52 -04:00
Al
34b01e203d [openaddresses] Don't allow single-letter boundary names as they're probably just typos 2016-08-25 22:58:26 -04:00
Al
859868aea2 [openaddresses] Adding option to strip non-digits from postcode, addresses with a postcode and no house_number+street may still be useful, keeping them around as place queries to help with postcode contexts 2016-08-25 16:36:18 -04:00
Al
da619e3cf4 [osm] Adding border_type=city to override tags 2016-08-25 15:21:33 -04:00
Al
a6dad74a2b [openaddresses] cleaning comma-delimited boundary components in OpenAddresses data sets 2016-08-24 15:06:04 -04:00
Al
d250f58293 [openaddresses] Also skipping addresses where street == unit 2016-08-24 14:10:41 -04:00
Al
7c3ad708d8 [openaddresses] Ensuring integer house numbers are > 0, street is not simply a numeric token (usually a copy of the house number) and that street != house number generally 2016-08-24 13:46:56 -04:00
Al
b7c600e496 [openaddresses] adding numeric_postcodes_only and add_osm_neighborhoods options 2016-08-23 02:11:21 -04:00
Al
ed0b49884e [openaddresses] Changes to OA config utilizing some of the new cleanup options. Adding language to brussels-fr and brussels-nl, adding New York and New Jersey statewide with the understanding that OSM components will be added in NJ and postcodes will be stripped of letters in NY 2016-08-23 00:38:43 -04:00
Al
8ec288d8f8 [openaddresses] Adding ability to specify language of a particular OpenAddresses CSV a priori. Unless otherwise specified, non-numeric unit fields will be discarded and phrases will be added randomly for numeric unit fields. 2016-08-23 00:29:09 -04:00
Al
99f71b718f [openaddresses] New command-line arguments to OpenAddresses training data script 2016-08-22 22:12:47 -04:00
Al
23be122d2e [openaddresses] Adding ability to use OSM boundaries for OpenAddresses (not turned on by default), cleaning up street names, requiring at least house number and street, validating house number to provide some assurance that it's not a badly-formatted NULL value, adding ability to strip letters from postcode for data sets like New York's statewide where there are some codes attached. 2016-08-22 22:09:00 -04:00
Al
cec4914233 [openaddresses] In some OpenAddresses data sets, the house number is just a copy of the street name, so eliminate non-numeric house numbers to be safe 2016-07-31 01:12:04 -04:00
Al
0bbced4966 [fix] subdir config in OpenAddresses formatter 2016-07-21 17:04:57 -04:00
Al
77a4476b8e [openaddresses] CLDR country names for OpenAddresses training set 2016-07-21 17:04:57 -04:00
Al
a57ace0be0 [openaddresses] OpenAddresses training script 2016-07-21 17:04:57 -04:00
Al
584a4e0ee8 [openaddresses] Added components via OA config 2016-07-21 17:04:57 -04:00
Al
55d66af422 [openaddresses] Adding abbreviated unit 2016-07-21 17:04:57 -04:00
Al
d910c6ca94 [fix] OpenAddresses formatting 2016-07-21 17:04:57 -04:00
Al
802a5ee534 [fix] condition 2016-07-21 17:04:57 -04:00
Al
e6a1d11324 [fix] validators 2016-07-21 17:04:57 -04:00
Al
caa155c9c4 [fix] method name 2016-07-21 17:04:57 -04:00
Al
4d0caec3d3 [fix] return value 2016-07-21 17:04:57 -04:00
Al
0e09e1222f [fix] import again 2016-07-21 17:04:57 -04:00
Al
e5267996ea [fix] import 2016-07-21 17:04:57 -04:00
Al
10662e79d5 [fix] directory structure 2016-07-21 17:04:57 -04:00
Al
0c9f1aa30d [fix] import 2016-07-21 17:04:57 -04:00
Al
1d80d8b6b8 [openaddresses] OpenAddresses address formatter, using the config 2016-07-21 17:04:57 -04:00
Al
91b06439e2 [openaddresses] Fetch script for OpenAddresses 2016-07-21 17:04:57 -04:00