Al
|
35dbce59d2
|
[osm] base case for default_language, applying the ways/relations requirement again as the nodes are mostly motorway_junction and can often be just a city name, etc.
|
2017-01-16 19:10:27 -05:00 |
|
Al
|
96a98fc63c
|
[fix] var name II
|
2017-01-16 18:57:29 -05:00 |
|
Al
|
582d042e95
|
[fix] var name
|
2017-01-16 18:56:20 -05:00 |
|
Al
|
b28728b017
|
[fix] tuple
|
2017-01-16 18:53:40 -05:00 |
|
Al
|
42b0a4cf68
|
[fix] var name
|
2017-01-16 18:46:08 -05:00 |
|
Al
|
4902e88b81
|
[fix] formatted OSM ways training data should use nodes as well as ways/relations
|
2017-01-16 18:39:53 -05:00 |
|
Al
|
449154d624
|
[fix] arg
|
2017-01-16 15:34:38 -05:00 |
|
Al
|
be763539d3
|
[fix] remove var
|
2017-01-16 15:31:26 -05:00 |
|
Al
|
8c92013c43
|
[fix] args to way_names
|
2017-01-16 15:29:16 -05:00 |
|
Al
|
934f6247c6
|
[osm] options to build the streets-only training data
|
2017-01-16 15:26:04 -05:00 |
|
Al
|
5c53b84044
|
[fix] genitives in OpenAddresses where needed
|
2017-01-16 00:53:02 -05:00 |
|
Al
|
3565834d4e
|
[openaddresses] script path alterations
|
2017-01-16 00:46:27 -05:00 |
|
Al
|
a0150f37d0
|
[osm] better lat/lon conversion for admin_center point
|
2017-01-14 17:48:37 -05:00 |
|
Al
|
c7e644ca51
|
[fix] validating number ranges in extract_valid_postcodes as well
|
2017-01-12 14:09:33 -05:00 |
|
Al
|
59ed268558
|
[osm] require name tag for formatted places
|
2017-01-12 13:00:07 -05:00 |
|
Al
|
d3c4f6fff5
|
[fix] valid names
|
2017-01-12 12:16:41 -05:00 |
|
Al
|
b90d88db3e
|
[fix] import
|
2017-01-12 12:08:40 -05:00 |
|
Al
|
ba0f097d78
|
[boundaries] adding check for valid name key in formatted places, and removing short_name from the Sao Paulo relation as well
|
2017-01-12 12:05:42 -05:00 |
|
Al
|
122d7b2b79
|
[fix] only using the revised address components for CLDR country name
|
2017-01-12 02:33:16 -05:00 |
|
Al
|
88a80f4e30
|
[fix] using normalized tags throughout in OSM formatted place data
|
2017-01-12 02:25:17 -05:00 |
|
Al
|
09b3aeb7d9
|
[fix] component
|
2017-01-11 16:50:54 -05:00 |
|
Al
|
ed5dd28023
|
[addresses] adding some more synonyms to Brasilia street regex
|
2017-01-11 16:31:30 -05:00 |
|
Al
|
bec569adaa
|
[osm] adding new validity check to venue names so if the Jaccard(name tokens, street & house numer tokens) == 1 and the address does not have a known venue type e.g. a restaurant, the "venue name" is actually just the street address and can be discarded
|
2017-01-11 16:23:42 -05:00 |
|
Al
|
7f851810d2
|
[addresses] formatting addresses in Brasilia, so e.g. "Bloco B" is never part of the street name or building name, it's the house number. place=neighbourhood maps to nothing in Brasilia as these are basically subdivisions whose streets are identically named
|
2017-01-11 16:18:04 -05:00 |
|
Al
|
0d030a98c5
|
[osm] adding airport polygon index
|
2017-01-11 04:25:54 -05:00 |
|
Al
|
d528095984
|
[addresses] adding random unit numbers with more digits
|
2017-01-11 04:24:35 -05:00 |
|
Al
|
979fd16215
|
[osm] adding airports and terminals data sets with points and polygons, more file cleanup in OSM fetch script
|
2017-01-10 16:20:32 -05:00 |
|
Al
|
86c7b7f3fe
|
[addresses] no longer normalizing slashes in boundary names for places that have multilingual names, etc.
|
2017-01-08 12:41:51 -05:00 |
|
Al
|
a6d94f998b
|
[addresses] stripping parentheticals in admin boundary names as sometimes cities in e.g. Switzerland are like Oberwil (ZG) in OSM
|
2017-01-08 03:43:22 -05:00 |
|
Al
|
828b67d4f7
|
[osm] adding some new training data for simple road names and their surrounding admin boundaries
|
2017-01-07 15:34:43 -05:00 |
|
Al
|
d51f9dbb0e
|
[addresses] stripping unit phrases from streets in OpenAddresses as well, return value wasn't getting used before
|
2017-01-06 10:19:08 -05:00 |
|
Al
|
cfdef1788c
|
[addresses] stripping unit from street using the libpostal dictionaries in all the address data sets. Happens surprisingly often in OpenStreetMap as well as OpenAddresses
|
2017-01-06 10:06:23 -05:00 |
|
Al
|
321f2034d2
|
[fix] unidata file
|
2017-01-05 04:24:33 -05:00 |
|
Al
|
25723fcea2
|
[transliteration] making the custom rules in transliteration less repetitious and accessible from elsewhere, removing string names for common transliterators and using constants
|
2017-01-05 04:06:51 -05:00 |
|
Al
|
de2dffa315
|
[addresses] adding Calle to purely numeric Spanish street names in OSM as well
|
2017-01-02 23:41:01 -05:00 |
|
Al
|
600b40d2f6
|
[transliteration] adding german-ascii transliteration to Estonian to handle umlauts (ä => ae, etc.)
|
2017-01-02 13:51:56 -05:00 |
|
Al
|
b2b7f6f155
|
[osm] add wikipedia:* to rail station exception
|
2017-01-02 13:13:42 -05:00 |
|
Al
|
400ea589ef
|
[normalize] add NORMALIZE_STRING_SIMPLE_LATIN_ASCII option to pynormalize
|
2017-01-02 02:08:54 -05:00 |
|
Al
|
2d077699e6
|
[places] adding is_in property to the set of tags for the places index. This may allow us to make more granular exceptions for node-based places that are actually suburbs but classified as {hamlet, village, locality, town}, etc. if the is_in contains a city that's also a boundary or nearby point
|
2016-12-29 14:04:13 -05:00 |
|
Al
|
21a2a7419a
|
[addresses] only add village as city component if no city can be found in the area
|
2016-12-29 13:41:05 -05:00 |
|
Al
|
f58ebbdf7f
|
[fix] var name
|
2016-12-28 14:37:00 -05:00 |
|
Al
|
7ee44a584b
|
[fix] genitive case for Russian/Ukrainian toponyms, not locative (#125)
|
2016-12-28 14:34:28 -05:00 |
|
Al
|
e6e4b28e43
|
[addresses] making the город/г. prefix apply to the Russian language rather than the country
|
2016-12-28 13:26:19 -05:00 |
|
Al
|
f995fdf9d2
|
[fix] default None
|
2016-12-28 05:09:15 -05:00 |
|
Al
|
3dc6a69bf5
|
[openaddresses] adding locative names in OpenAddresses as well, which contains some Ukraine data sets
|
2016-12-28 04:59:55 -05:00 |
|
Al
|
91013fe296
|
[fix] moving checks inside the add_locatives function, fixing float cast
|
2016-12-28 04:59:27 -05:00 |
|
Al
|
6f009fb8a6
|
[addresses] adding pymorphy2 for converting Russian and Ukrainian place names (sticking with state and staet_district for the moment) to the locative case as mentioned in #125
|
2016-12-28 04:48:32 -05:00 |
|
Al
|
4344c5fdf3
|
[formatting] adding non-zero invert probabilities to all the former Soviet states. Other template insertions can still apply afterward for #125
|
2016-12-27 23:25:49 -05:00 |
|
Al
|
25e966411d
|
[formatting] adding the ability to invert the address template (line by line, preserving order within each line) with certain probabilities
|
2016-12-27 23:25:49 -05:00 |
|
Al
|
165056ccd8
|
[names] adding configurable prefix/suffix additions for boundary names
|
2016-12-27 20:32:23 -05:00 |
|