Commit Graph

157 Commits

Author SHA1 Message Date
Al
81c59e116a [countries] use ISO 3166 country name 5% of the time for general addresses, 10% of the time for OpenAddresses. Gives the parser examples of names like "Korea, Republic of" in #168 2017-03-25 19:41:59 -04:00
Al
b4437848c4 [fix] override_country_dir 2017-03-02 14:31:53 -05:00
Al
a5d8700df3 [openaddresses] use override_country_dir config option in OA address formatter 2017-03-01 13:52:07 -05:00
Al
f507f2bb3e [addresses] fix for Colombian house number formatting if the second regex group is not found 2017-02-25 23:24:06 -05:00
Al
64d0783e73 [addresses] Chinese and Colombian house number regex changes 2017-02-25 23:19:12 -05:00
Al
d0679294bf [openaddresses] adding positional args so OpenAddresses ingestion can be run only for specific countries, subdirs, or individual files. 2017-02-24 03:40:09 -05:00
Al
f76faafd8c [openaddresses] adding a few house number phrases as well in Colombia 2017-02-18 12:03:02 -08:00
Al
7cab675809 [openaddresses] adding random formatting to Colombian house numbers that match the {calle}-{building number} format 2017-02-18 11:28:47 -08:00
Al
146412f4f8 [openaddresses] adding country-specific validators and doing no validation on house numbers in Colombia 2017-02-18 11:04:02 -08:00
Al
9af4b1bd42 [openaddresses] fixing street requirement 2017-02-11 03:29:09 -05:00
Al
081f023d60 [fix] name 2017-02-11 02:10:59 -05:00
Al
6705ebaffd [fix] import 2017-02-11 02:09:31 -05:00
Al
c9ade4a7da [openaddresses] adding postal code country phrases in OpenAddresses as well 2017-02-11 01:48:54 -05:00
Al
c600f05f06 [openaddresses] adding Czech Republic to the street not required set 2017-02-04 15:30:46 -05:00
Al
85f03184d5 [openaddresses] moving postcode fixes before validation. Adding regex for validating Russian house numbers in the Ukraine 2017-02-02 11:21:00 -05:00
Al
12bc18f74b [openaddresses] fix Chinese house number validation 2017-01-28 02:03:19 -05:00
Al
2b349ef8a8 [fix] nevermind, needed to do the Spanish-language street names before validation (simple numeric names like \"8\" needs to be prefixed with \"Calle\" or they'll fail validation) 2017-01-28 01:08:10 -05:00
Al
2953759321 [openaddresses] formatting Chinese house number (with annex adding a second number potentially) and adding Spanish street names after the language is known by reverse geocoding 2017-01-28 01:01:26 -05:00
Al
c9417436f7 [openaddresses] allowing a single character boundary name in ideographic languages 2017-01-27 23:38:03 -05:00
Al
72881ad315 [fix] conditional + var name 2017-01-27 19:20:41 -05:00
Al
987609ee8e [fix] var name 2017-01-27 18:46:58 -05:00
Al
b25f5f26ae [openaddresses] not requiring street name in former Soviet countries (may be village + house_number). Only allowing address-only if street is present 2017-01-27 13:17:07 -05:00
Al
a760f96015 [fix] allow only house_number with no street in OpenAddresses Japan 2017-01-27 03:04:38 -05:00
Al
52a53cda1f [fix] postcode formatting in OpenAddresses 2017-01-25 01:41:27 -05:00
Al
bc748b6d62 [addresses] supplying country arg when stripping name affixes both for OSM place-based data sets (ways, localities) and OpenAddresses (shouldn't affect any of the countries currently in OA though) 2017-01-23 23:30:33 -05:00
Al
7c64a25389 [openaddresses] adding validator for Russian that allows the Moscow house number style 2017-01-20 02:54:07 -05:00
Al
54d4518960 [fix] sorted subdir configs 2017-01-19 02:29:20 -05:00
Al
a3ce019c32 [openaddresses] adding validator for Russian б/н house numbers 2017-01-18 20:08:25 -05:00
Al
49ffd4ea62 [openaddresses] doing config in sorted order, puts the US last, sorts the states, etc. so there's a consistent sense of progress 2017-01-18 19:32:02 -05:00
Al
072d7ed540 [openaddresses] reset language to config_language every time so language disambiguation gets used as needed 2017-01-18 18:38:53 -05:00
Al
5c53b84044 [fix] genitives in OpenAddresses where needed 2017-01-16 00:53:02 -05:00
Al
d51f9dbb0e [addresses] stripping unit phrases from streets in OpenAddresses as well, return value wasn't getting used before 2017-01-06 10:19:08 -05:00
Al
de2dffa315 [addresses] adding Calle to purely numeric Spanish street names in OSM as well 2017-01-02 23:41:01 -05:00
Al
3dc6a69bf5 [openaddresses] adding locative names in OpenAddresses as well, which contains some Ukraine data sets 2016-12-28 04:59:55 -05:00
Al
8abbb273b2 [osm] adding the excellent ftfy (https://github.com/LuminosoInsight/python-ftfy) to fix Mojibake, etc. in address components 2016-12-26 21:18:14 -05:00
Al
151287856d [openaddresses] fixing regexes for house number validation 2016-12-23 01:18:46 -05:00
Al
043dafc12a [openaddresses] add osm_neighborhood_overrides_city option for some countries that list what-we-otherwise-think-are-suburbs as the city 2016-12-22 17:50:21 -05:00
Al
7d195ca331 [fix] not allowing postal codes to pass validation if they are simply float zero 2016-12-22 02:59:54 -05:00
Al
cc4098fb05 [openaddresses] abbreviate states as well in OpenAddresses when full version is specified 2016-12-20 17:24:12 -05:00
Al
9e44fcb2bb [addresses] abbreviating neighborhoods/city_districts 2016-12-20 03:01:34 -05:00
Al
56ca37d1f3 [fix] openaddresses config reading 2016-12-19 02:18:24 -05:00
Al
86a8315b9d [openaddresses] adding new config option to OA config for aliasing fields based on a regex 2016-12-18 01:50:58 -05:00
Al
3c6ed7489c [openaddresses] adding regex replacement to remove "*" from any field 2016-12-16 17:09:41 -05:00
Al
ba96f68b62 [fix] openaddresses formatter 2016-12-16 14:22:15 -05:00
Al
da3240d5f6 [openaddresses] making field maps in OpenAddresses config a dictionary rather than a list to make inheritance easier 2016-12-16 06:54:36 -05:00
Al
83aab5a46a [openaddresses] adding option to map values for a particular field 2016-12-16 06:44:19 -05:00
Al
e92963de50 [openaddresses] adding new counties from OpenAddresses, strip commas option for thousands separators 2016-12-09 01:57:21 -05:00
Al
3ff472c8cf [openaddresses] fixing house numbers with multiple consecutive hyphens 2016-12-06 22:50:14 -05:00
Al
da36b71829 [addresses] adding new places index in OSM and OpenAddresses training data 2016-12-05 18:36:17 -05:00
Al
cdbc102821 [boundaries] in addition to population, check if a city has an unambiguous Wikipedia 2016-11-25 13:36:49 -08:00