Commit Graph

1719 Commits

Author SHA1 Message Date
Al
2d0a0f1c83 [dictionaries] Adding a few English abbreviations/expansions 2016-04-09 14:53:33 -04:00
Al
26581aeb4d [numex] string keys 2016-04-08 18:13:08 -04:00
Al
d38de71854 [dictionaries] encapsulating reading address dictionaries so it's easy to implement sampling for the address training data 2016-04-08 18:12:30 -04:00
Al
02e82e5342 [numex] Nicer API for ordinal suffixes 2016-04-08 17:10:18 -04:00
Al
737b5d06ed [osm/polygons] Adding properties in building polygons 2016-04-08 12:33:40 -04:00
Al
3bc85db41e [numex] Moving numex files to YAML as well 2016-04-07 13:26:00 -04:00
Al
5fce5e8000 [osm/polygons] add building:part to building polygons 2016-04-07 13:15:42 -04:00
Al
778fba2451 [osm] Moving OSM boundaries to YAML files instead of JSON for consistency 2016-04-06 22:59:46 -04:00
Al
f2f131661a [osm/polygons] Using greater simplify tolerance 2016-04-06 20:24:37 -04:00
Al
69ef201cf1 [fix] simplify_polygons in building geocoder, and adding caching back to OSM admin polygons as it's faster when taking into account startup time. Also adding a few properties to buildings and landuse polygons 2016-04-06 13:53:47 -04:00
Al
502c61d9db [osm/polygons] Same check for closed ways as for relations in OSM polygon readers 2016-04-06 01:35:36 -04:00
Al
984cdc0650 [osm/polygons] From benchmarking it seems to make sense to keep OSM polygons in memory after all 2016-04-05 23:25:45 -04:00
Al
fbebcc11d0 [fix] properties/polygon key split 2016-04-05 22:47:48 -04:00
Al
ee160c715b [osm/polygons] Trying persistent polygons again on OSM/Quattroshapes to test the new settings 2016-04-05 19:46:45 -04:00
Al
b8ccb8bfa1 [osm/polygons] Storing polygon JSON under a different key so it doesn't have to be read from disk after a successful cache matched point-in-polygon test just to retrieve the properties 2016-04-05 19:45:44 -04:00
Al
a8ea5f47c3 [fix] var name 2016-04-05 19:23:08 -04:00
Al
65e0067ed0 [fix] classmethod for loading polygons 2016-04-05 19:20:12 -04:00
Al
a8b0114871 [osm/polygons] Keep OSM/Quattroshapes admin polygons in memory as there are fewer of them and they are large 2016-04-05 19:14:17 -04:00
Al
b693fe11dd [fix] double prep 2016-04-05 18:49:52 -04:00
Al
136700fa7f [fix] return_all in polygon index 2016-04-05 18:42:20 -04:00
Al
e242868fd9 [osm/polygons] Keep stats on cache hits/misses for testing cache sizes 2016-04-05 16:46:19 -04:00
Al
ec29c36cbc [build] Adding lru-dict, a fast C LRU cache, to requirements.txt for geodata package 2016-04-05 14:55:35 -04:00
Al
004165d184 [osm/polygons] Using an LRU cache for prepped polygons in the various PolygonIndex subclasses. That way can store less simplified polygons but keep frequently accessed ones (like countries) in memory 2016-04-05 14:53:07 -04:00
Al
01567d2672 [osm/boundaries] admin_level 10 in Spain = suburb 2016-04-05 01:26:29 -04:00
Al
1af5b88922 [fix] name 2016-04-05 00:51:01 -04:00
Al
49498ccf81 [fix] import 2016-04-04 23:38:30 -04:00
Al
6bb6ddb06a [fix] arg name 2016-04-04 22:41:20 -04:00
Al
0107473c6d [osm] Adding place=plot to subdivisions data set 2016-04-04 22:15:07 -04:00
Al
1ded6567f0 [fix] command for subdivision polys 2016-04-04 21:55:58 -04:00
Al
d570ca406b [fix] import 2016-04-04 21:54:45 -04:00
Al
4aacad3676 [fix] var scope 2016-04-04 21:43:05 -04:00
Al
1844f99baf [fix] Simplify OSM polygons but using the new threshold 2016-04-04 21:39:26 -04:00
Al
860498d509 [osm] Storing polygon properties in a LevelDB, polygons themselves stay in memory 2016-04-04 21:30:16 -04:00
Al
145ede2bcf [osm] Adding building polygon reader, including closed ways for admin polys 2016-04-04 19:59:10 -04:00
Al
adb9c31321 [requirements] Adding LevelDB to requirements 2016-04-04 19:55:37 -04:00
Al
a35e037941 [dictionaries] More Spanish abbreviations 2016-04-03 15:59:01 -04:00
Al
61cff4dc87 [osm] Splitting out generic amenities like ATM, fuel, restrooms, etc. so they can be used in category queries. Adding subdivision polygons, postcode polygons, building polygons, adding a few types of place keys to venues data set 2016-04-03 15:36:36 -04:00
Al
e642e91e0e [addresses] Adding stairway/entrance strings to the English address configs 2016-03-30 10:25:25 -04:00
Al
af323295c1 [dictionaries] Spanish dictionary for postcode 2016-03-29 20:40:01 -04:00
Al
c1822917eb [categories] Adding categories section to the English parser config 2016-03-29 20:38:55 -04:00
Al
5327f4fe25 [osm] adding civil boundaries (e.g. postal areas in Dublin), fixing output files 2016-03-29 18:54:26 -04:00
Al
5b6874ff1d [categories] Using TSV files instead of YAML for category queries, easier to edit 2016-03-29 18:37:37 -04:00
Al
971ab42750 Merge branch 'master' into parser-data 2016-03-29 17:13:56 -04:00
Al
14e8f50cf1 [fix] Expansions when passing in the address_components= option. Was only limiting results at the phrase level, should work at the individual expansion level 2016-03-29 16:46:29 -04:00
Travis
2795d258d1 [auto][ci skip] Adding data files from Travis build #108 2016-03-29 19:11:57 +00:00
Al
a077f72fd7 Merge branch 'blackat87-patch-1' 2016-03-29 14:07:56 -04:00
Al
65335e6e49 [dictionaries] Adding contrà, rio terà as canonical forms, adding c.le for cortile 2016-03-29 14:07:17 -04:00
Al
7e583d3f7a [fix] separating slashes by whitespace as they're separate tokens 2016-03-29 13:44:35 -04:00
Al
6dad58c696 [fix][ci skip] last remaining instance of vignt in libpostal 2016-03-29 12:51:19 -04:00
Al
f819a8ef96 [fix] spacing in numex 2016-03-29 12:42:16 -04:00