Commit Graph

4022 Commits

Author SHA1 Message Date
Al
e6fe576ec7 [fix] var 2016-11-19 03:15:23 -05:00
Al
1f50481cad [fix] args 2016-11-19 03:14:06 -05:00
Al
4d14f80f0c [osm] using the new gazetteer methods to do more thorough checks on single house names (if there are no other components than the standalone venue name, make sure it contains venue words like {library, bar}, etc. and not street type words like {road, street}, etc. so we don't get training examples that are simply "Abbey/house Road/house" with no house number or street name). If the venue name equals the street name or house number, drop it. Same if the venue name equals one of the admin components and no house number or street is present. If the venue name is numeric, require both a house number and a street name. 2016-11-19 03:12:24 -05:00
Al
5140db536a [phrases] additions to venue names dictionaries and a more restrictive version of street types dictionaries 2016-11-19 02:58:27 -05:00
Al
71be0fdfbc [fix] sets 2016-11-19 02:30:40 -05:00
Al
b6f7b5b577 [fix] name 2016-11-19 01:38:15 -05:00
Al
de9bf29af0 [addresses] allowing osm_components argument to AddressComponents.expanded 2016-11-19 01:38:02 -05:00
Al
1df1b60a9f [phrases] adding extract_phrases method to gazetteers, which returns a set of gazetteer phrases found in a given string 2016-11-18 23:35:44 -05:00
Al
8ef8d88186 [fix] don't short-circuit OSM address formatting unless there are no components and no venue names 2016-11-18 23:31:24 -05:00
Al
25ceeed6ef [fix] check before pop 2016-11-18 18:36:35 -05:00
Al
7a89c6e9ce [osm] removing dependencies for house/venue name (purely numeric names taken care of in osm formatter) 2016-11-18 18:32:44 -05:00
Al
ca89a6ca2e [fix] args 2016-11-18 18:09:48 -05:00
Al
72305975eb [openaddresses] adding Nelson Mandela Bay as a pre-release download 2016-11-18 18:00:42 -05:00
Al
6e73d46097 [fix] typo 2016-11-18 00:50:18 -05:00
Al
4e30a23313 [addresses] Adding toponym abbreviation to the input admin components as well as those obtained through reverse geocoding. Also was doing two random tests before abbreviating toponyms, reducing their frequency in the training data, now correctly using a single test. 2016-11-17 19:53:09 -05:00
Al
a9fdfee2ac [polygons] adding optional test_point for complex polygons with an admin_center, and including admin_center lat/lon as part of the properties 2016-11-17 19:36:32 -05:00
Al
c2ccec70ad [polygons] adding lat/lon props to admin centers 2016-11-17 19:21:31 -05:00
Al
71d535e845 [polygons] using try/except in polygons 2016-11-17 17:38:54 -05:00
Al
d701bb1320 [polygons] only applying the new fix-on-read solution in the OSM admin/subdivision indices 2016-11-17 00:33:06 -05:00
Al
c1d4b03bb4 [polygons] moving polygon fixes to the to_polygon method so they get applied both at ingestion and on cache load 2016-11-16 23:25:48 -05:00
Al
a25ae7f9ef [osm/polygons] adding fixed version of a polygon if polygon is invalid and doesn't contain its centroid 2016-11-16 17:38:01 -05:00
Al
0421b8b17c [boundaries] Reading, UK 2016-11-16 03:48:21 -05:00
Al
9c5321d240 [boundaries] Bedford, UK 2016-11-16 03:45:50 -05:00
Al
749e495482 [boundaries] Nottingham, UK 2016-11-16 03:37:21 -05:00
Al
b5464f842b [boundaries] converting admin_level=10 to city in the UK and Ireland 2016-11-16 03:21:15 -05:00
Al
4a0ed7c703 [boundaries] adding a few more city boundary exceptions to England and Scotland 2016-11-16 02:55:30 -05:00
Al
e85a1b906a [fix] East Asian probabilities 2016-11-16 02:54:56 -05:00
Al
3617b3a10c [fix] recursive merge for entries that are empty dictionaries 2016-11-16 02:19:07 -05:00
Al
b03494a736 [boundaries] adding admin_level=6 as cities in West Midlands (county), UK 2016-11-16 01:53:07 -05:00
Al
07f41a7565 [boundaries] adding York as city in UK (listed as admin_level=6) 2016-11-16 01:35:59 -05:00
Al
c5a48b4cd3 [fix] East Asian system po_box probabilities 2016-11-16 01:26:31 -05:00
Al
15b66f541c [fix] refactor to use ComponentDependencies class 2016-11-15 17:07:10 -05:00
Al
68ab69cdc3 [fix] alias in formatter config 2016-11-15 17:04:15 -05:00
Al
dc65f518a5 [openaddresses] adding new US counties from OpenAddresses 2016-11-15 02:32:00 -05:00
Al
67f409cdf6 [places] adding dependencies to admin components e.g. so in some countries city_district must be accompanied by a city, etc. 2016-11-15 02:31:15 -05:00
Al
96fb725e54 [formatting] adding po_box insertions for East Asian addresses 2016-11-14 18:29:54 -05:00
Al
653b2d09c0 [addresses] moving component dependency graphs to a new module 2016-11-14 16:45:15 -05:00
Al
495b27470e [addresses] refactoring address component dependency graphs 2016-11-12 18:09:36 -05:00
Al
b42159205d [openaddresses] adding some of the new US counties from OA 2016-11-12 17:56:29 -05:00
Al
7c9c600e07 [openaddresses] add new counties from upstream 2016-11-06 00:06:59 -04:00
Al
a6c88f54ab [openaddresses] add Forsyth county 2016-11-02 23:59:50 -04:00
Al
7cdccbe31f [openaddresses] adding fixed sources in ID 2016-10-31 11:22:48 -04:00
Al
353c6c7b7a [openaddresses] adding Jefferson County, AL 2016-10-28 10:58:04 -04:00
Al
e9106698d2 [fix] convert newlines 2016-10-27 12:01:48 -04:00
Al
e48f207d10 [openaddresses] updating with new OpenAddresses sources 2016-10-27 11:19:30 -04:00
Al
5cabd9b4f7 [fix] country languages in OpenAddresses 2016-10-24 17:35:39 -04:00
Al
ac0eb1776e [openaddresses] adding Brazoria County, TX 2016-10-24 09:27:11 -04:00
Al
35d3d8cc73 [openaddresses] countries are known a priori, so if the boundaries don't quite line up with OSM, use the country from the path 2016-10-23 19:50:54 -04:00
Al
f429bea15b [fix] subtract abs value 2016-10-23 01:11:09 -04:00
Al
1658c425c5 [fix] clear country cache only at each new country, not each file 2016-10-23 00:57:52 -04:00