Al
|
71be0fdfbc
|
[fix] sets
|
2016-11-19 02:30:40 -05:00 |
|
Al
|
b6f7b5b577
|
[fix] name
|
2016-11-19 01:38:15 -05:00 |
|
Al
|
de9bf29af0
|
[addresses] allowing osm_components argument to AddressComponents.expanded
|
2016-11-19 01:38:02 -05:00 |
|
Al
|
1df1b60a9f
|
[phrases] adding extract_phrases method to gazetteers, which returns a set of gazetteer phrases found in a given string
|
2016-11-18 23:35:44 -05:00 |
|
Al
|
8ef8d88186
|
[fix] don't short-circuit OSM address formatting unless there are no components and no venue names
|
2016-11-18 23:31:24 -05:00 |
|
Al
|
25ceeed6ef
|
[fix] check before pop
|
2016-11-18 18:36:35 -05:00 |
|
Al
|
7a89c6e9ce
|
[osm] removing dependencies for house/venue name (purely numeric names taken care of in osm formatter)
|
2016-11-18 18:32:44 -05:00 |
|
Al
|
ca89a6ca2e
|
[fix] args
|
2016-11-18 18:09:48 -05:00 |
|
Al
|
6e73d46097
|
[fix] typo
|
2016-11-18 00:50:18 -05:00 |
|
Al
|
4e30a23313
|
[addresses] Adding toponym abbreviation to the input admin components as well as those obtained through reverse geocoding. Also was doing two random tests before abbreviating toponyms, reducing their frequency in the training data, now correctly using a single test.
|
2016-11-17 19:53:09 -05:00 |
|
Al
|
a9fdfee2ac
|
[polygons] adding optional test_point for complex polygons with an admin_center, and including admin_center lat/lon as part of the properties
|
2016-11-17 19:36:32 -05:00 |
|
Al
|
c2ccec70ad
|
[polygons] adding lat/lon props to admin centers
|
2016-11-17 19:21:31 -05:00 |
|
Al
|
71d535e845
|
[polygons] using try/except in polygons
|
2016-11-17 17:38:54 -05:00 |
|
Al
|
d701bb1320
|
[polygons] only applying the new fix-on-read solution in the OSM admin/subdivision indices
|
2016-11-17 00:33:06 -05:00 |
|
Al
|
c1d4b03bb4
|
[polygons] moving polygon fixes to the to_polygon method so they get applied both at ingestion and on cache load
|
2016-11-16 23:25:48 -05:00 |
|
Al
|
a25ae7f9ef
|
[osm/polygons] adding fixed version of a polygon if polygon is invalid and doesn't contain its centroid
|
2016-11-16 17:38:01 -05:00 |
|
Al
|
3617b3a10c
|
[fix] recursive merge for entries that are empty dictionaries
|
2016-11-16 02:19:07 -05:00 |
|
Al
|
15b66f541c
|
[fix] refactor to use ComponentDependencies class
|
2016-11-15 17:07:10 -05:00 |
|
Al
|
67f409cdf6
|
[places] adding dependencies to admin components e.g. so in some countries city_district must be accompanied by a city, etc.
|
2016-11-15 02:31:15 -05:00 |
|
Al
|
653b2d09c0
|
[addresses] moving component dependency graphs to a new module
|
2016-11-14 16:45:15 -05:00 |
|
Al
|
495b27470e
|
[addresses] refactoring address component dependency graphs
|
2016-11-12 18:09:36 -05:00 |
|
Al
|
e9106698d2
|
[fix] convert newlines
|
2016-10-27 12:01:48 -04:00 |
|
Al
|
5cabd9b4f7
|
[fix] country languages in OpenAddresses
|
2016-10-24 17:35:39 -04:00 |
|
Al
|
35d3d8cc73
|
[openaddresses] countries are known a priori, so if the boundaries don't quite line up with OSM, use the country from the path
|
2016-10-23 19:50:54 -04:00 |
|
Al
|
f429bea15b
|
[fix] subtract abs value
|
2016-10-23 01:11:09 -04:00 |
|
Al
|
1658c425c5
|
[fix] clear country cache only at each new country, not each file
|
2016-10-23 00:57:52 -04:00 |
|
Al
|
7199ff17e0
|
[fix] truncate postcodes that are longer than specified length
|
2016-10-23 00:52:24 -04:00 |
|
Al
|
889e914dfc
|
[openaddresses] clear all polygon caches
|
2016-10-23 00:11:54 -04:00 |
|
Al
|
0fd431a9d2
|
[fix] abs
|
2016-10-22 23:55:30 -04:00 |
|
Al
|
ec54d3de35
|
[fix] don't convert number to int/float in numeric_phrase (chops leading zeros)
|
2016-10-22 23:49:58 -04:00 |
|
Al
|
63edd53fb3
|
[openaddresses] adding clear_cache method to clear the LRU cache for point-in-polygon indices and using it in OpenAddresses import since it heavily reuses polygons and only for the current file
|
2016-10-22 20:28:59 -04:00 |
|
Al
|
d51a1d6196
|
[addresses] doing hyphenation for existing components in component expansion (i.e. OSM training data)
|
2016-10-21 22:02:19 -04:00 |
|
Al
|
2a355b2cf8
|
[openaddresses] adding address only 10% of the time in OpenAddresses
|
2016-10-20 23:57:30 -04:00 |
|
Al
|
d965ea9371
|
[openaddresses] adding hyphenation/dehyphenation to the OpenAddresses formatter
|
2016-10-20 20:55:17 -04:00 |
|
Al
|
00ebdfed7f
|
[osm] adding alt_place_names to the shared formatting class AddressComponents and making them classmethods
|
2016-10-20 20:41:22 -04:00 |
|
Al
|
d9bc465c82
|
[osm] parsing out semicolon-delimited postal codes from OSM in countries like Poland that use hyphen delimited postcodes without treating them as number ranges
|
2016-10-19 17:46:42 -04:00 |
|
Al
|
ec77a247fa
|
[fix] just ignore records without the "name" tag
|
2016-10-19 13:36:15 -04:00 |
|
Al
|
61078eded9
|
[fix] checking for dictionary key
|
2016-10-19 13:34:13 -04:00 |
|
Al
|
c2b73307de
|
[fix] parens
|
2016-10-19 13:29:56 -04:00 |
|
Al
|
f639151698
|
[osm] checking for non-admin_center nodes which are part of a lower admin level polygon with the same name
|
2016-10-19 13:27:38 -04:00 |
|
Al
|
e380567ac4
|
[osm] adding alt_place_names method which does hyphenation, de-hyphenation and abbreviated toponyms with/without hyphens
|
2016-10-19 02:19:09 -04:00 |
|
Al
|
51afc2619b
|
[fix] only replace whitespace between words, not for instance whitespace around an existing hyphen, and reducing to one space for spaced hyphens
|
2016-10-19 01:24:54 -04:00 |
|
Al
|
e8899eafd6
|
[osm] adding hyphenation/de-hyphenation to OSM admin components
|
2016-10-19 01:00:29 -04:00 |
|
Al
|
98ac232eea
|
[osm] hyphenating and de-hyphenating place names in places training data
|
2016-10-19 00:33:10 -04:00 |
|
Al
|
72e7d3ff5b
|
[addresses/hyphens] adding some methods to hyphenate/dehyphenate place names at random
|
2016-10-18 19:10:31 -04:00 |
|
Al
|
7e007a49ab
|
[osm] removing place=district mapping globally (means city_district in Hungary) and mapping it specifically to state_district/city_district in the places where it's needed
|
2016-10-18 19:02:36 -04:00 |
|
Al
|
d34faf42b8
|
[osm] fix names with pipes in them
|
2016-10-17 02:32:25 -04:00 |
|
Al
|
a796b41d90
|
[geonames] admin codes on geonames/postal_codes tables
|
2016-10-17 00:21:33 -04:00 |
|
Al
|
ff27ee14bb
|
[osm] only add label props if the name property is identical (counterexample, Nottinghamshire's label is listed as West Bridgford, which is really its admin_center)
|
2016-10-16 22:18:52 -04:00 |
|
Al
|
9fb936019a
|
[geoplanet] script to create GeoPlanet postal codes training data
|
2016-10-12 15:05:45 -04:00 |
|