Commit Graph

28 Commits

Author SHA1 Message Date
Al
dc2766ae5d [fix] __init__ 2015-08-14 00:49:06 -04:00
Al
62c67aa970 [osm] Using pipe splitter for address components 2015-08-14 00:45:49 -04:00
Al
2bd763be03 [osm] Prefer amenity tag, skip if the building tag is simply building=yes 2015-08-13 21:16:34 -04:00
Al
c844d0484a [fix] carriage returns 2015-08-13 21:07:12 -04:00
Al
ef14aa2b7e [osm] Replacing escape chars at write time as there's no quoting, adding building key to venue training data 2015-08-13 19:30:44 -04:00
Al
46f2c68a69 [osm] Using tsv_no_quote writers in all OSM training data files 2015-08-13 18:40:41 -04:00
Al
cdb9afddd3 [fix] address training data carriage returns 2015-07-25 00:35:27 -04:00
Al
5cba747a93 [fix] variable name 2015-07-17 03:06:09 -04:00
Al
5e7bb54a5c [polygons] only add language polygons if there's one default language 2015-07-17 02:19:55 -04:00
Al
d5ac816066 [fix] import 2015-07-16 13:33:50 -04:00
Al
8899be6eef [osm] choosing the first default language for OSM training data, fixing way/relation offsets 2015-07-16 13:32:16 -04:00
Al
d57f9df7ed [fix] regexes 2015-07-14 14:04:32 -04:00
Al
d494963dcd [fix] lat/lon conversion in address formatting 2015-07-14 13:34:22 -04:00
Al
a0f2ff1e2a [fix] adding encoding declaration 2015-07-13 21:09:18 -04:00
Al
d15737b319 [osm] Validating lat/lon in OSM training data 2015-07-13 21:08:08 -04:00
Al
0c18a57c4e [fix] planet url no longer needed 2015-07-13 14:27:26 -04:00
Al
e8348dde0e [osm] removing all the fetch/convert arguments from training data generator 2015-07-13 14:24:54 -04:00
Al
5e9e08f6b1 [fix] making fetch script executable 2015-07-13 14:19:24 -04:00
Al
465bcd46aa [fix] input file in OSM training data generator 2015-07-13 14:18:24 -04:00
Al
961606ac12 [fix] removing intermediate file in OSM fetch 2015-07-13 14:17:57 -04:00
Al
59bf23ae67 [osm] Planet admin bounds filter 2015-07-13 04:08:55 -04:00
Al
ec1e820268 [parsing] Changing to OpenCageData repo 2015-07-09 13:44:14 -04:00
Al
cb2035867b [fix] osm geodata imports 2015-06-15 18:36:01 -04:00
Al
22fa81b33f [fix] __init__.py 2015-06-15 17:54:27 -04:00
Al
6c8e5b45a4 [fix] removing building alias (for OSm it means building category), fix to fetch script 2015-03-18 08:40:07 -04:00
Al
aeac0fe8c0 [geodata] Script to construct OSM training examples for building language dictionaries, disambiguating between abbreviations, classifying venues by type and formatting addresses for use in a sequence model with Lokku's address-formatting repo. 2015-03-17 18:11:07 -04:00
Al
0437271c92 [geodata] OSM planet fetch needs to convert ways/relations to nodes for all data sets 2015-03-17 16:51:17 -04:00
Al
621b25c964 [geodata] script to fetch/transform OSM planet (needs about 100GB of disk free) training language models 2015-03-16 00:45:14 -04:00