Commit Graph

1938 Commits

Author SHA1 Message Date
Al
f0dfd7850c [fix] ignore punctuation in strip_components 2017-02-11 01:00:37 -05:00
Al
f07d93df2c [fix] omitted line 2017-02-11 00:57:47 -05:00
Al
ffc12ec5ab [osm] add new method in OSM formatting to extract one or more expanded postal codes from an addr:postcode tag, using the new country-specific rules 2017-02-11 00:53:52 -05:00
Al
bbcb6444c8 [addresses] add strip_components method which simply removes the names of OSM components from a string (for e.g. postal codes) 2017-02-11 00:07:55 -05:00
Al
4e1d7d9373 [osm] use new postal codes module in OSM formatting 2017-02-10 23:56:23 -05:00
Al
9022fb9149 [places] use country.lower() 2017-02-10 23:54:43 -05:00
Al
a0d674274a [neighborhoods] immutable data structures when loading from JSON 2017-02-10 23:54:24 -05:00
Al
293587bae9 [addresses] adding new config for postal codes around the world. Allows appending the ISO alpha-2 country code to the beginning of the postcode as in e.g. SI-1000 (only used if the postcode begins with a digit). This system was used for postal codes in continental Europe as a recommendation from the CEPT. Now 7 member states still use it, so in those countries add the country-code with higher probability. The config also contains the license plate codes for countries where e.g. L-1234 might be used instead of LU-1234. Allows configuring in which countries postcodes should be validated using Google's per-country validation regexes (and the ability to override with a custom regex), and in which countries other admin component names should be stripped. 2017-02-10 23:53:50 -05:00
Al
7a360f4211 [osm] addr:postcode can be all over the place in OSM. Start with postcodes containing commas or semicolons. If addr:postcode (on address of building) contains either, iterate over the values and pick the first one that matches a postcode validation regex for that country 2017-02-08 16:13:29 -05:00
Al
dbf7242ea0 [fix] /cls/self/ 2017-02-04 19:12:49 -05:00
Al
c600f05f06 [openaddresses] adding Czech Republic to the street not required set 2017-02-04 15:30:46 -05:00
Al
0169448a4d [addresses] adding Central European city district regexes (e.g. Praha 1, Budapest IV, etc.) to country-specific cleanup 2017-02-03 20:54:23 -05:00
Al
85f03184d5 [openaddresses] moving postcode fixes before validation. Adding regex for validating Russian house numbers in the Ukraine 2017-02-02 11:21:00 -05:00
Al
12bc18f74b [openaddresses] fix Chinese house number validation 2017-01-28 02:03:19 -05:00
Al
2b349ef8a8 [fix] nevermind, needed to do the Spanish-language street names before validation (simple numeric names like \"8\" needs to be prefixed with \"Calle\" or they'll fail validation) 2017-01-28 01:08:10 -05:00
Al
2953759321 [openaddresses] formatting Chinese house number (with annex adding a second number potentially) and adding Spanish street names after the language is known by reverse geocoding 2017-01-28 01:01:26 -05:00
Al
c9417436f7 [openaddresses] allowing a single character boundary name in ideographic languages 2017-01-27 23:38:03 -05:00
Al
72881ad315 [fix] conditional + var name 2017-01-27 19:20:41 -05:00
Al
987609ee8e [fix] var name 2017-01-27 18:46:58 -05:00
Al
cd1875d077 [fix] import 2017-01-27 18:35:43 -05:00
Al
01d6d47b08 [osm] removing addr:place mapping to road as it's usually a village in post-Soviet states, etc. Can handle it down the road 2017-01-27 13:54:08 -05:00
Al
11345bf2bf [osm] using new constants in OSM formatting as well 2017-01-27 13:53:00 -05:00
Al
b25f5f26ae [openaddresses] not requiring street name in former Soviet countries (may be village + house_number). Only allowing address-only if street is present 2017-01-27 13:17:07 -05:00
Al
82fb5c1dca [countries] moving country constants to a separate module 2017-01-27 13:15:36 -05:00
Al
a760f96015 [fix] allow only house_number with no street in OpenAddresses Japan 2017-01-27 03:04:38 -05:00
Al
52a53cda1f [fix] postcode formatting in OpenAddresses 2017-01-25 01:41:27 -05:00
Al
287d2f4048 [fix] leading zeros on numeric phrases 2017-01-25 01:40:20 -05:00
Al
bc748b6d62 [addresses] supplying country arg when stripping name affixes both for OSM place-based data sets (ways, localities) and OpenAddresses (shouldn't affect any of the countries currently in OA though) 2017-01-23 23:30:33 -05:00
Al
c36611c060 [addresses] let containing components include all boundaries, not just those that are larger than the current boundary (affects cases like Buenos Aires where the city has a lower admin level than its districts, so would be subject to the boundary config's contained_by override) 2017-01-23 10:48:08 -05:00
Al
7c64a25389 [openaddresses] adding validator for Russian that allows the Moscow house number style 2017-01-20 02:54:07 -05:00
Al
b6aa05ee0d [formatting] fixed a template insertion bug 2017-01-19 03:26:10 -05:00
Al
110665651c [fix] existing cleanup_street_name method 2017-01-19 02:40:18 -05:00
Al
a931c5ddc9 [osm] checking for valid street names in OSM street-only training data so e.g. the street name is not just a simple number like "831" 2017-01-19 02:34:29 -05:00
Al
54d4518960 [fix] sorted subdir configs 2017-01-19 02:29:20 -05:00
Al
a3ce019c32 [openaddresses] adding validator for Russian б/н house numbers 2017-01-18 20:08:25 -05:00
Al
49ffd4ea62 [openaddresses] doing config in sorted order, puts the US last, sorts the states, etc. so there's a consistent sense of progress 2017-01-18 19:32:02 -05:00
Al
072d7ed540 [openaddresses] reset language to config_language every time so language disambiguation gets used as needed 2017-01-18 18:38:53 -05:00
Al
05568194aa [fix] var initialization II 2017-01-18 01:54:18 -05:00
Al
b19ab0ae48 [fix] var initialization 2017-01-18 01:48:02 -05:00
Al
d94fda4d94 [fix] using tail -n+2 in geoplanet script for Linux 2017-01-17 17:32:39 -05:00
Al
d498fa893c [fix] name 2017-01-16 22:15:25 -05:00
Al
8566cb4054 [addresses] refactoring place component cleanup into a method that can be reused with the place and ways training data 2017-01-16 20:43:55 -05:00
Al
024a6a40b1 [addresses] refactoring place dropout into its own method 2017-01-16 19:35:16 -05:00
Al
35dbce59d2 [osm] base case for default_language, applying the ways/relations requirement again as the nodes are mostly motorway_junction and can often be just a city name, etc. 2017-01-16 19:10:27 -05:00
Al
96a98fc63c [fix] var name II 2017-01-16 18:57:29 -05:00
Al
582d042e95 [fix] var name 2017-01-16 18:56:20 -05:00
Al
b28728b017 [fix] tuple 2017-01-16 18:53:40 -05:00
Al
42b0a4cf68 [fix] var name 2017-01-16 18:46:08 -05:00
Al
4902e88b81 [fix] formatted OSM ways training data should use nodes as well as ways/relations 2017-01-16 18:39:53 -05:00
Al
449154d624 [fix] arg 2017-01-16 15:34:38 -05:00