Al
|
769a65b808
|
[openaddresses] adding place-only and place+postcode probability to OpenAddresses to capture more place names not in OSM as standalone queries
|
2016-09-08 03:17:21 -04:00 |
|
Al
|
317e4caca9
|
[fix] only percent quote the filename
|
2016-09-07 13:55:34 -04:00 |
|
Al
|
f061d4239b
|
[fix] quote, not urlencode
|
2016-09-07 13:51:08 -04:00 |
|
Al
|
5c915855c9
|
[fix] urlencode
|
2016-09-07 13:48:39 -04:00 |
|
Al
|
62c8fa9048
|
[fix] encoding III
|
2016-09-07 13:39:38 -04:00 |
|
Al
|
6e5908385a
|
[fix] encoding again
|
2016-09-07 13:34:04 -04:00 |
|
Al
|
3bc122f7a0
|
[fix] encoding
|
2016-09-07 13:28:18 -04:00 |
|
Al
|
7e7ee7462a
|
[fix] dutch house number formatting, strip spaces
|
2016-09-02 14:47:52 -04:00 |
|
Al
|
95384e5a2c
|
[openaddresses adding hack for Honolulu until join function can handle null in OpenAddresses
|
2016-09-02 14:29:40 -04:00 |
|
Al
|
4e9f88594b
|
[fix] /safe_encode/safe_decode/
|
2016-09-02 13:50:48 -04:00 |
|
Al
|
8fd69b5e4a
|
[fix] args
|
2016-09-02 12:03:24 -04:00 |
|
Al
|
df8e781e02
|
[openaddresses] adding hack for Italy until machine's join function handles null fields
|
2016-09-02 12:01:04 -04:00 |
|
Al
|
5957f45f40
|
[fix] strings
|
2016-09-02 05:00:39 -04:00 |
|
Al
|
4ab749d962
|
[fix] format_address with minimal_only=False
|
2016-09-02 04:59:03 -04:00 |
|
Al
|
d70662e6d7
|
[fix] postcodes
|
2016-09-02 04:42:34 -04:00 |
|
Al
|
bb1c071623
|
[fix] config move
|
2016-09-02 04:16:59 -04:00 |
|
Al
|
552ebf2bcf
|
[fix] var name
|
2016-09-02 04:03:47 -04:00 |
|
Al
|
a4a09fcb3e
|
[openaddresses] don't allow postcodes that are all zeroes with a dash (Poland, US ZIP+4)
|
2016-09-02 03:39:28 -04:00 |
|
Al
|
3f7bfca1ad
|
[openaddresses] allowing house numbers with slashes as well as number + specific fractions separated by space
|
2016-09-02 02:53:26 -04:00 |
|
Al
|
cdfa9e11bf
|
[openaddresses] excluding all streets with "unknown" in the name. Though possibly excluding one or two valid addresses, the gains far outweigh the costs
|
2016-09-01 17:45:12 -04:00 |
|
Al
|
3aef7e5b8b
|
[openaddresses] making a few methods classmethods so they're easier to test
|
2016-09-01 17:42:07 -04:00 |
|
Al
|
c3c949a147
|
[openaddresses] adding the Netherlands with some hacks for house number until the new format function is deployed in OpenAddresses
|
2016-09-01 17:41:27 -04:00 |
|
Al
|
d7dab92f7b
|
[fix] var name
|
2016-08-31 17:45:50 -04:00 |
|
Al
|
be6c01f5fd
|
[fix] csv
|
2016-08-31 17:45:04 -04:00 |
|
Al
|
d3da513375
|
[fix] import
|
2016-08-31 17:44:16 -04:00 |
|
Al
|
4ed362d5f8
|
[openaddresses] adding script option to download all completed OA files instead of just what's in the config
|
2016-08-31 17:43:07 -04:00 |
|
Al
|
e98cf67f0e
|
[openaddresses] also allowing house numbers like "37/A"
|
2016-08-29 22:56:36 -04:00 |
|
Al
|
78a210c409
|
[openaddresses] replacing backticks with apostrophe, comes up in several countries
|
2016-08-29 21:42:10 -04:00 |
|
Al
|
3f5b3dcb1d
|
[openaddresses] Allowing slashes in house numbers in OpenAddresses
|
2016-08-29 21:26:33 -04:00 |
|
Al
|
ebb34bcc2f
|
[openaddresses] config option to skip rows missing specific fields
|
2016-08-29 19:19:32 -04:00 |
|
Al
|
9b9036243c
|
[fix] overwrite on unzip, logging
|
2016-08-29 00:40:11 -04:00 |
|
Al
|
5b5af04a44
|
[fix] redundant line
|
2016-08-29 00:37:17 -04:00 |
|
Al
|
6284ec39db
|
[fix] name
|
2016-08-29 00:36:45 -04:00 |
|
Al
|
75ece5f5e9
|
[fix] import
|
2016-08-29 00:36:22 -04:00 |
|
Al
|
f5b2b6327e
|
[openaddresses] Using a download script to download the individual OA files of interest rather than the collected file with expansions applied
|
2016-08-29 00:34:39 -04:00 |
|
Al
|
a0cf6ff225
|
[openaddresses] Allowing house numbers like "11 C"
|
2016-08-28 19:11:41 -04:00 |
|
Al
|
ac403bbe49
|
[openaddresses] Adding sin numero validator (sem numero in this case) for Portuguese
|
2016-08-28 18:39:19 -04:00 |
|
Al
|
27c5c8536a
|
[openaddresses] adding debug argument to OpenAddresses training data
|
2016-08-28 17:58:41 -04:00 |
|
Al
|
6740e5a1c6
|
[fix] var name
|
2016-08-28 17:55:10 -04:00 |
|
Al
|
7ea47126ba
|
[fix] logging
|
2016-08-28 15:54:55 -04:00 |
|
Al
|
a58194ca2e
|
[fix] add_admin_boundaries and adding cleaned up house number
|
2016-08-28 15:15:57 -04:00 |
|
Al
|
51590825ee
|
[fix] do component dropout anyway
|
2016-08-28 14:07:49 -04:00 |
|
Al
|
f69e63e311
|
[openaddresses] Place component dropout. Obtain population from OSM components when we have them but otherwise assume it's actually 0 (not unknown), that way the more conservative probabilities will be used i.e. state names will be included more often rather than unqualified cities
|
2016-08-28 13:59:28 -04:00 |
|
Al
|
dea5fbbf2e
|
[logging] printing off filenames in constructing OpenAddresses training data
|
2016-08-28 12:11:53 -04:00 |
|
Al
|
3da80b0706
|
[fix] typo
|
2016-08-28 11:55:40 -04:00 |
|
Al
|
aa62b8e8b4
|
[fix] indentation
|
2016-08-28 11:48:27 -04:00 |
|
Al
|
b8b1ac1261
|
[openaddresses] Handling validation after cleanup, adding per-field regex replacements
|
2016-08-28 11:47:30 -04:00 |
|
Al
|
3ae7a15960
|
[openaddresses] Adding a few special cases for Spanish. Rewrite simple numeric street names to include the oft-omitted Calle (e.g. 27 => Calle 27), which is uniformly omitted in the Spanish-language data in OpenAddresses while still being valid for grid-based cities like Mérida. Humans and signs usually add Calle for numeric streets while it may be omitted for named streets
|
2016-08-27 15:03:23 -04:00 |
|
Al
|
15f9817933
|
[openaddresses] Replacing number sign in house number
|
2016-08-27 02:42:06 -04:00 |
|
Al
|
01ac1371b5
|
[openaddresses] Cleaning up house numbers as well, which can sometimes be stored as floats
|
2016-08-27 01:50:05 -04:00 |
|