Commit Graph

157 Commits

Author SHA1 Message Date
Al
ec77a247fa [fix] just ignore records without the "name" tag 2016-10-19 13:36:15 -04:00
Al
61078eded9 [fix] checking for dictionary key 2016-10-19 13:34:13 -04:00
Al
c2b73307de [fix] parens 2016-10-19 13:29:56 -04:00
Al
f639151698 [osm] checking for non-admin_center nodes which are part of a lower admin level polygon with the same name 2016-10-19 13:27:38 -04:00
Al
e380567ac4 [osm] adding alt_place_names method which does hyphenation, de-hyphenation and abbreviated toponyms with/without hyphens 2016-10-19 02:19:09 -04:00
Al
98ac232eea [osm] hyphenating and de-hyphenating place names in places training data 2016-10-19 00:33:10 -04:00
Al
d34faf42b8 [osm] fix names with pipes in them 2016-10-17 02:32:25 -04:00
Al
6ff1024c02 [fix] null candidate languages 2016-10-07 19:49:32 -04:00
Al
169a3c3d70 [osm] drop postcode as well for address-only format 2016-10-07 01:10:16 -04:00
Al
0401a04adb [osm] add address-only formats (sans place tags) for every address as well to better deal handle incomplete queries where location is expected to be inferred by the geocoder, etc. 2016-10-07 00:59:52 -04:00
Al
a67efcffe4 [addresses] add new option to use city population to determine whether components should be dropped out 2016-10-05 18:16:25 -04:00
Al
66af532850 [osm] adding country-specific cleanups to OSM place training data 2016-10-05 17:13:13 -04:00
Al
faf418decb [languages] using country_and_languages method in OSM, neighborhoods and OpenAddresses 2016-10-05 02:49:55 -04:00
Al
85ae5d4a05 [fix] name 2016-08-19 23:38:33 -04:00
Al
7951044d74 [intersections] Abbreviating street names that are not base names with random probabilities 2016-08-19 23:27:29 -04:00
Al
42808c62e3 [fix] dictionary access 2016-08-19 16:02:36 -04:00
Al
41f715d6ee [intersections] Better handling of default languages in intersection queries 2016-08-19 15:59:58 -04:00
Al
a7118b40a7 [intersections] Allowing tags like name_1, etc. to make it into road name permutations for intersections 2016-08-19 13:12:02 -04:00
Al
0b2d3d965f [fix] using lat/lon from the node properties in intersections data 2016-08-19 12:23:08 -04:00
Al
688f103e80 [fix] languages 2016-08-18 02:24:34 -04:00
Al
e3ac3200b3 [fix] disambiguating languages using one of the default street names in intersections data 2016-08-18 02:05:13 -04:00
Al
328398813a [fix] itertools.combinations 2016-08-18 01:26:48 -04:00
Al
737cbf4457 [fix] reference before assignment 2016-08-18 01:24:30 -04:00
Al
b41ba7374b [intersections] intersections training data, using a Cartesian product of all names in the same language, including something like tiger:name_base 2016-08-18 01:19:14 -04:00
Al
701bcb1d79 [intersections] Using name cleanup on intersections, including tiger:name_base which sometimes has semicolon delimiters as well 2016-08-17 18:47:07 -04:00
Al
145af9331e [osm] build OSM training data for intersections using the JSON output from intersections.py rather having to compute each time 2016-08-17 18:11:55 -04:00
Al
7ff0cb2704 [fix] name and a few things for intersections data 2016-08-15 21:26:54 -04:00
Al
7ab6af4335 [fix] bounds 2016-08-15 12:01:22 -04:00
Al
060d3a1f86 [fix] var name 2016-08-15 11:18:00 -04:00
Al
29fc198aba [osm] giving parse_osm_number_range a parameter for max range and setting it to 1000 for postal codes e.g. for major cities that may have several hundred postal codes 2016-08-15 10:34:24 -04:00
Al
637baad629 [osm] Adding at least min_references entries for every selected postcode 2016-08-15 10:30:28 -04:00
Al
aa6b9cd858 [fix] var name for place tags coming from the admin rtree 2016-08-15 10:25:19 -04:00
Al
bc8acb196c [osm] Pulling valid postal codes out into a method 2016-08-13 01:49:26 -04:00
Al
b993e9a163 [fix] add Japanese-language variant if metro station is added 2016-08-06 21:17:14 -04:00
Al
39bd562d04 [addresses] only set language if we needed it for Japanese house_numbers 2016-08-06 21:06:01 -04:00
Al
e68fee7c68 [fix] null check 2016-08-06 20:39:28 -04:00
Al
374c46ada5 [fix] metro station properties 2016-08-06 19:56:13 -04:00
Al
195278cfea [osm] Reverse geocoding to metro station only for addresess in Japan 2016-08-06 19:50:18 -04:00
Al
964728a02d [fix] block phrases for Japanese and namespaced language handling in case Romaji is chosen before normalization 2016-08-06 14:50:39 -04:00
Al
445e8082c8 [addresses] Adding per-country overrides for address component dependencies 2016-08-06 02:36:47 -04:00
Al
813f29f299 [osm] Removing the call to normalize_place_names in place data formatting as we should be able to trust the places more than the addresses 2016-08-02 16:29:34 -04:00
Al
c40ad99ec7 [osm] removing postcode phrase from place training data and adding CLDR countries only after all the other normalizations 2016-08-02 14:52:12 -04:00
Al
5117fb21d3 [fix] access 2016-08-02 03:20:42 -04:00
Al
bd780d3424 [fix] typo 2016-08-02 03:19:22 -04:00
Al
c74d883344 [fix] unindent 2016-08-02 03:17:42 -04:00
Al
f29d043544 [places] Using all of the ideas that apply to places from address formatting for the places-only data set 2016-08-02 03:16:08 -04:00
Al
a1f0c1a3c9 [fix] import 2016-08-02 01:50:17 -04:00
Al
4c8b662648 [fix] block numbers 2016-08-01 14:36:28 -04:00
Al
2faffc81e7 [fix] import 2016-08-01 00:06:47 -04:00
Al
afbb79b81d [osm/parser] Making a much lower probability of generating sub-building components for named venues (usually on the ground floor, etc.) 2016-07-31 20:40:44 -04:00