Al
|
6ef40c1769
|
[fix] dupe checking
|
2015-11-30 18:43:11 -05:00 |
|
Al
|
af170de019
|
[fix] Smaller probabilities on adding neighborhoods and admin polygons, eliminating duplicates on the row level
|
2015-11-30 18:35:31 -05:00 |
|
Al
|
621fd79002
|
[fix] var
|
2015-11-30 18:20:26 -05:00 |
|
Al
|
b430fb7657
|
[osm/formatting] Adding pick random name logic to neighborhoods as well, getting rid of drop probabilities as they're covered elsewhere, adding several forms of venue names to the training data
|
2015-11-30 18:10:18 -05:00 |
|
Al
|
d4b6450f19
|
[formatting] Not applying template replacements from address formatting by default
|
2015-11-30 16:11:13 -05:00 |
|
Al
|
839a12b212
|
[osm/formatting] Changing drop probabilities and doing it in random order
|
2015-11-30 15:27:35 -05:00 |
|
Al
|
89677d94a3
|
[parsing] Initial commit of the address parser, training/testing, feature function, I/O
|
2015-11-30 14:48:13 -05:00 |
|
Al
|
9a8ba14887
|
[osm/formatting] Adding per-field drop probabilities to OSM training data to make some fields more likely to be dropped, although it might create more training data
|
2015-11-30 11:10:12 -05:00 |
|
Al
|
c8e4602d4c
|
[fix] Neighborhoods reverse geocoder discriminates between OSM matched with Zetashapes and OSM matched with Quattroshapes
|
2015-11-30 10:59:50 -05:00 |
|
Al
|
15d9e00121
|
[osm/formatting] Adding in more ISO alpha-3 codes for countries in the training data
|
2015-11-28 14:08:07 -05:00 |
|
Al
|
66778737ff
|
[fix] non-local language states
|
2015-11-28 13:48:59 -05:00 |
|
Al
|
69ba631dc9
|
[docs] updating params in OSM training data docs
|
2015-11-28 01:09:14 -05:00 |
|
Al
|
3cd1fee89d
|
[fix] KeyError
|
2015-11-27 14:40:11 -05:00 |
|
Al
|
a77bc03977
|
[fix] language
|
2015-11-27 14:24:32 -05:00 |
|
Al
|
38d4e2d67a
|
[fix] cities
|
2015-11-27 14:05:53 -05:00 |
|
Al
|
3cf98770e3
|
[fix] var name
|
2015-11-27 13:54:38 -05:00 |
|
Al
|
2e0f35b13a
|
[fix] key checks for Quattroshapes cities, removing city in non-local language case
|
2015-11-27 13:45:51 -05:00 |
|
Al
|
105ba313c5
|
[fix] var name
|
2015-11-27 12:00:11 -05:00 |
|
Al
|
3eea355352
|
[fix] argument order
|
2015-11-27 11:47:39 -05:00 |
|
Al
|
51f6a82727
|
[fix] import again
|
2015-11-27 11:38:40 -05:00 |
|
Al
|
644eeb74c6
|
[fix] import
|
2015-11-27 11:17:53 -05:00 |
|
Al
|
2830986073
|
[osm/formatting] Adding in cities from Quattroshapes/GeoNames in the case of non-local languages or in general with a small random probability
|
2015-11-27 11:09:12 -05:00 |
|
Al
|
b0667d0032
|
[fix] only care about levels in Quattroshapes index, not Zetashapes
|
2015-11-26 23:45:50 -05:00 |
|
Al
|
0eb0042826
|
[fix] Same in neighborhoods reverse geocoder lookups
|
2015-11-26 14:17:17 -05:00 |
|
Al
|
4170f6e9e3
|
[fix] same options for geohash-based index
|
2015-11-26 14:14:53 -05:00 |
|
Al
|
4cff1f8a9d
|
[fix] Quattroshapes neighborhoods index uses geohashes for slightly better coverage
|
2015-11-26 12:45:54 -05:00 |
|
Al
|
98d8054a2b
|
[polygons/quattroshapes] Converting Quattroshapes lookups to an R-tree index
|
2015-11-25 19:37:57 -05:00 |
|
Al
|
8a8e45f2a6
|
[fix] filenames
|
2015-11-25 18:08:04 -05:00 |
|
Al
|
bd88628a98
|
[polygons/quattroshapes] Removing local admin and neighborhoods from the Quattroshapes reverse geocoder since they're covered in neighborhoods
|
2015-11-25 18:06:14 -05:00 |
|
Al
|
40d18aa7f6
|
[polygons/osm] Switching back to buffer(0). Still destroys many polygons, may need to look into another solution
|
2015-11-25 17:10:50 -05:00 |
|
Al
|
a50c971732
|
[polygons/osm] Ommitting last node in every way of a connected component since that node is equal to the start node of its neighbor
|
2015-11-25 17:09:19 -05:00 |
|
Al
|
d6d5eab989
|
[geonames] Adding ability to lookup GeoNames alternate names (may obtain IDs from Quattroshapes). Not great for local-language primary names (OSM remains the best) but decent for extracting foreign toponyms
|
2015-11-25 17:07:14 -05:00 |
|
Al
|
3217fa39cd
|
[fix] add country randomly in the formatted language training data in cases where country is not present
|
2015-11-25 14:54:41 -05:00 |
|
Al
|
1a6618957b
|
[fix] Python float precision doesn't appear to be the problem
|
2015-11-25 11:29:08 -05:00 |
|
Al
|
5781813cbd
|
[fix] For countries like Denmark, removing country with a smaller probability
|
2015-11-25 00:39:52 -05:00 |
|
Al
|
e4b8349d98
|
[fix] sparsity of country tags should be enough for language address training data
|
2015-11-25 00:32:01 -05:00 |
|
Al
|
824c779107
|
[fix] Cutting down training repeatedly on country names
|
2015-11-24 23:22:57 -05:00 |
|
Al
|
88529d28e2
|
[fix] country formatting in language address training data
|
2015-11-24 23:20:31 -05:00 |
|
Al
|
cd74fcda3c
|
[fix] not requiring minimal keys in format language data
|
2015-11-24 23:13:28 -05:00 |
|
Al
|
e560e53308
|
[fix] formatter
|
2015-11-24 22:27:57 -05:00 |
|
Al
|
8c422a6e61
|
[osm] Adding new localized country names in anguage training data for formatted addresses
|
2015-11-24 21:49:10 -05:00 |
|
Al
|
e40ca0bb89
|
[fix] Removing house numbers from formatted address language training data, using a simple whitespace splitter
|
2015-11-24 21:15:22 -05:00 |
|
Al
|
a92cbb8003
|
[osm] Trying fixed-point precision in converting OSM coordinates to avoid issues with polygon self-intersection when the lines are very close together (e.g. parts of Berlin, UK country polygon)
|
2015-11-24 15:13:16 -05:00 |
|
Al
|
ef9c5c2ca1
|
[fix] args
|
2015-11-24 11:02:35 -05:00 |
|
Al
|
e75c1ce860
|
[fix] limited addresses
|
2015-11-24 11:01:22 -05:00 |
|
Al
|
94039f98ad
|
[fix] argument validation in OSM training data script
|
2015-11-24 10:59:16 -05:00 |
|
Al
|
de9f3120c8
|
[polygons] Trying a slightly higher value for buffer() as suggested by this issue https://github.com/Toblerity/Shapely/issues/277
|
2015-11-23 15:43:23 -05:00 |
|
Al
|
6d20d7348f
|
[osm] Using OSM namespaced tags from polygons in the case of non-local languages
|
2015-11-23 14:42:30 -05:00 |
|
Al
|
e46e1a93a0
|
[fix] ISO code and simple/international name checks should be on the polygons
|
2015-11-23 14:30:38 -05:00 |
|
Al
|
eb7488ab55
|
[fix] Making country replacement probability independent of the probability used for local vs non-local languages
|
2015-11-23 13:46:14 -05:00 |
|