Al
|
7298c895c8
|
[utils] adding a chunked shuffle as the concatenated file sizes may get larger than memory
|
2016-11-21 14:04:34 -05:00 |
|
Al
|
eff0443fcf
|
[openaddresses city_of_flint, not flint
|
2016-11-20 17:24:23 -05:00 |
|
Al
|
1d05c98cc4
|
[openaddresses] add Bucks County, PA
|
2016-11-20 13:02:10 -05:00 |
|
Al
|
a596d03309
|
[fix] return values
|
2016-11-19 12:45:39 -05:00 |
|
Al
|
1ef3d073db
|
[dictionaries] adding green to place names
|
2016-11-19 04:24:25 -05:00 |
|
Al
|
e15036fcce
|
[fix] if there are street types that are not venue words and not vice versa, then call the venue invalid as a standalone term
|
2016-11-19 04:11:33 -05:00 |
|
Al
|
8e905fd17d
|
[fix] if no venue names are passed in to formatted_addresses_with_venue_names, remove any existing venue name from the components as well
|
2016-11-19 03:46:16 -05:00 |
|
Al
|
e6fe576ec7
|
[fix] var
|
2016-11-19 03:15:23 -05:00 |
|
Al
|
1f50481cad
|
[fix] args
|
2016-11-19 03:14:06 -05:00 |
|
Al
|
4d14f80f0c
|
[osm] using the new gazetteer methods to do more thorough checks on single house names (if there are no other components than the standalone venue name, make sure it contains venue words like {library, bar}, etc. and not street type words like {road, street}, etc. so we don't get training examples that are simply "Abbey/house Road/house" with no house number or street name). If the venue name equals the street name or house number, drop it. Same if the venue name equals one of the admin components and no house number or street is present. If the venue name is numeric, require both a house number and a street name.
|
2016-11-19 03:12:24 -05:00 |
|
Al
|
5140db536a
|
[phrases] additions to venue names dictionaries and a more restrictive version of street types dictionaries
|
2016-11-19 02:58:27 -05:00 |
|
Al
|
71be0fdfbc
|
[fix] sets
|
2016-11-19 02:30:40 -05:00 |
|
Al
|
b6f7b5b577
|
[fix] name
|
2016-11-19 01:38:15 -05:00 |
|
Al
|
de9bf29af0
|
[addresses] allowing osm_components argument to AddressComponents.expanded
|
2016-11-19 01:38:02 -05:00 |
|
Al
|
1df1b60a9f
|
[phrases] adding extract_phrases method to gazetteers, which returns a set of gazetteer phrases found in a given string
|
2016-11-18 23:35:44 -05:00 |
|
Al
|
8ef8d88186
|
[fix] don't short-circuit OSM address formatting unless there are no components and no venue names
|
2016-11-18 23:31:24 -05:00 |
|
Al
|
25ceeed6ef
|
[fix] check before pop
|
2016-11-18 18:36:35 -05:00 |
|
Al
|
7a89c6e9ce
|
[osm] removing dependencies for house/venue name (purely numeric names taken care of in osm formatter)
|
2016-11-18 18:32:44 -05:00 |
|
Al
|
ca89a6ca2e
|
[fix] args
|
2016-11-18 18:09:48 -05:00 |
|
Al
|
72305975eb
|
[openaddresses] adding Nelson Mandela Bay as a pre-release download
|
2016-11-18 18:00:42 -05:00 |
|
Al
|
6e73d46097
|
[fix] typo
|
2016-11-18 00:50:18 -05:00 |
|
Al
|
4e30a23313
|
[addresses] Adding toponym abbreviation to the input admin components as well as those obtained through reverse geocoding. Also was doing two random tests before abbreviating toponyms, reducing their frequency in the training data, now correctly using a single test.
|
2016-11-17 19:53:09 -05:00 |
|
Al
|
a9fdfee2ac
|
[polygons] adding optional test_point for complex polygons with an admin_center, and including admin_center lat/lon as part of the properties
|
2016-11-17 19:36:32 -05:00 |
|
Al
|
c2ccec70ad
|
[polygons] adding lat/lon props to admin centers
|
2016-11-17 19:21:31 -05:00 |
|
Al
|
71d535e845
|
[polygons] using try/except in polygons
|
2016-11-17 17:38:54 -05:00 |
|
Al
|
d701bb1320
|
[polygons] only applying the new fix-on-read solution in the OSM admin/subdivision indices
|
2016-11-17 00:33:06 -05:00 |
|
Al
|
c1d4b03bb4
|
[polygons] moving polygon fixes to the to_polygon method so they get applied both at ingestion and on cache load
|
2016-11-16 23:25:48 -05:00 |
|
Al
|
a25ae7f9ef
|
[osm/polygons] adding fixed version of a polygon if polygon is invalid and doesn't contain its centroid
|
2016-11-16 17:38:01 -05:00 |
|
Al
|
0421b8b17c
|
[boundaries] Reading, UK
|
2016-11-16 03:48:21 -05:00 |
|
Al
|
9c5321d240
|
[boundaries] Bedford, UK
|
2016-11-16 03:45:50 -05:00 |
|
Al
|
749e495482
|
[boundaries] Nottingham, UK
|
2016-11-16 03:37:21 -05:00 |
|
Al
|
b5464f842b
|
[boundaries] converting admin_level=10 to city in the UK and Ireland
|
2016-11-16 03:21:15 -05:00 |
|
Al
|
4a0ed7c703
|
[boundaries] adding a few more city boundary exceptions to England and Scotland
|
2016-11-16 02:55:30 -05:00 |
|
Al
|
e85a1b906a
|
[fix] East Asian probabilities
|
2016-11-16 02:54:56 -05:00 |
|
Al
|
3617b3a10c
|
[fix] recursive merge for entries that are empty dictionaries
|
2016-11-16 02:19:07 -05:00 |
|
Al
|
b03494a736
|
[boundaries] adding admin_level=6 as cities in West Midlands (county), UK
|
2016-11-16 01:53:07 -05:00 |
|
Al
|
07f41a7565
|
[boundaries] adding York as city in UK (listed as admin_level=6)
|
2016-11-16 01:35:59 -05:00 |
|
Al
|
c5a48b4cd3
|
[fix] East Asian system po_box probabilities
|
2016-11-16 01:26:31 -05:00 |
|
Al
|
15b66f541c
|
[fix] refactor to use ComponentDependencies class
|
2016-11-15 17:07:10 -05:00 |
|
Al
|
68ab69cdc3
|
[fix] alias in formatter config
|
2016-11-15 17:04:15 -05:00 |
|
Al
|
dc65f518a5
|
[openaddresses] adding new US counties from OpenAddresses
|
2016-11-15 02:32:00 -05:00 |
|
Al
|
67f409cdf6
|
[places] adding dependencies to admin components e.g. so in some countries city_district must be accompanied by a city, etc.
|
2016-11-15 02:31:15 -05:00 |
|
Al
|
96fb725e54
|
[formatting] adding po_box insertions for East Asian addresses
|
2016-11-14 18:29:54 -05:00 |
|
Al
|
653b2d09c0
|
[addresses] moving component dependency graphs to a new module
|
2016-11-14 16:45:15 -05:00 |
|
Al
|
495b27470e
|
[addresses] refactoring address component dependency graphs
|
2016-11-12 18:09:36 -05:00 |
|
Al
|
b42159205d
|
[openaddresses] adding some of the new US counties from OA
|
2016-11-12 17:56:29 -05:00 |
|
Al
|
7c9c600e07
|
[openaddresses] add new counties from upstream
|
2016-11-06 00:06:59 -04:00 |
|
Al
|
a6c88f54ab
|
[openaddresses] add Forsyth county
|
2016-11-02 23:59:50 -04:00 |
|
Al
|
7cdccbe31f
|
[openaddresses] adding fixed sources in ID
|
2016-10-31 11:22:48 -04:00 |
|
Al
|
353c6c7b7a
|
[openaddresses] adding Jefferson County, AL
|
2016-10-28 10:58:04 -04:00 |
|