Commit Graph

1756 Commits

Author SHA1 Message Date
Al
cd91068f0f [neighborhoods] fix neighborhoods index checks to include the borough points while still not making letting something like Santa Monica pass as a neighborhoods when it's a proper city 2016-12-13 02:30:24 -05:00
Al
d158751d92 [addresses] same rules for state_district apply to state, no alt_names etc. unless a city is present 2016-12-12 05:31:32 -05:00
Al
bf3e9749ca [osm] during place formatting, add point-based cities for any places/polygons that are smaller than cities e.g. suburb or city_district, use admin_center as the point for reverse geocoding if available (instead of representative_point() which can be expensive or centroid which can be inaccurate) 2016-12-12 05:29:39 -05:00
Al
da4fe37fb4 [addresses] option to add city points, no random keys for state_district if city or replacement is not present 2016-12-11 16:24:16 -05:00
Al
dfc88a47b2 [fix] typo 2016-12-11 02:46:03 -05:00
Al
e8abf44c16 [neighborhoods] check if there's no defined place-type before classifying a polygon as city_district 2016-12-11 02:44:02 -05:00
Al
fffc81a17a [fix] default value 2016-12-10 18:14:25 -05:00
Al
371198da3c [fix] typo 2016-12-10 18:14:11 -05:00
Al
91982528c6 [fix] normalize place names after adding admin boundaries as well 2016-12-10 18:07:41 -05:00
Al
34d3ae7e9e [addresses] fixing normalized_place_name so it deals with things like Washington DC where Washington DC may actually be one of the OSM names 2016-12-10 17:52:38 -05:00
Al
80ee34cc3a [text] adding normalization with whitespace 2016-12-10 17:50:53 -05:00
Al
4550f00f03 [fix] var name 2016-12-10 15:18:09 -05:00
Al
72771741c3 [fix] order 2016-12-10 15:16:35 -05:00
Al
8595d8da05 [addresses] don't add components to the trie that have the same normalized name as the given component 2016-12-10 15:12:40 -05:00
Al
bb12d0940e [fix] options/docs in osm address training 2016-12-10 13:45:37 -05:00
Al
ffc584f679 [states] adding all forms of the state abbreviation to the trie when doing place name normalization to handle the D.C./DC case 2016-12-10 13:45:22 -05:00
Al
5098599ed6 [addresses] remove Quattroshapes/GeoNames cities as they may have problematic names, and in any case we have point-based cities from OSM now 2016-12-10 02:08:40 -05:00
Al
18c5fd0855 [fix] check for non-None city 2016-12-10 01:23:06 -05:00
Al
dc022f8652 [osm] adding normalized_place_name to Quattroshapes city 2016-12-10 01:20:40 -05:00
Al
c7b1818695 [fix] imports 2016-12-09 19:53:17 -05:00
Al
973466bb13 [states] adding multiple state abbreviations for states that can have periods in the naem like D.C., D.F. in Mexico and Brasil, etc. 2016-12-09 19:48:59 -05:00
Al
675552d254 [addresses] using normalized tokens when stripping off compound place names for things like D.C. 2016-12-09 17:52:57 -05:00
Al
c0a468d7e8 [normalization] adding a normalize_token function and some token options for deleting periods 2016-12-09 17:46:26 -05:00
Al
8f30987bdf [fix] checking if building is a rail station 2016-12-09 02:57:47 -05:00
Al
e92963de50 [openaddresses] adding new counties from OpenAddresses, strip commas option for thousands separators 2016-12-09 01:57:21 -05:00
Al
b60b7c9009 [geoplanet] adding an index of state_districts, states, etc. that contain a city with an identical name. Alias to the city if it's the only contained place, otherwise don't allow the admin name without the city. 2016-12-08 17:00:29 -05:00
Al
640f70c05d [geoplanet] all_places table, specified dirs 2016-12-08 02:50:08 -05:00
Al
f9945103ba [addresses] if suburb/city_district is already listed, and we're finding the closest city by point rather than by boundary, use the closest actual city, not something smaller like a village/hamlet 2016-12-08 02:39:27 -05:00
Al
28d9ef12c0 [geoplanet] fixing geoplanet aliases insert warning 2016-12-08 02:31:10 -05:00
Al
763c86dcd4 [geoplanet] add County to the names of US counties outside of Louisiana and Alaska, add Parish in Lousiana 2016-12-08 02:30:37 -05:00
Al
7436d9693a [names] adding new name_affixes call to replace both prefixes/suffixes in one call, using in GeoPlanet training and the generic AddressComponents normalizations 2016-12-07 05:49:16 -05:00
Al
9386a999f6 [names] adding country-specific affixes and only normalizing the word City as a suffix in UK/Ireland 2016-12-07 05:37:25 -05:00
Al
3ff472c8cf [openaddresses] fixing house numbers with multiple consecutive hyphens 2016-12-06 22:50:14 -05:00
Al
e13787a6f6 [fix] var name again 2016-12-05 18:49:23 -05:00
Al
e1c6eff5e2 [fix] var 2016-12-05 18:46:49 -05:00
Al
da36b71829 [addresses] adding new places index in OSM and OpenAddresses training data 2016-12-05 18:36:17 -05:00
Al
628fecea59 [addresses] adding point-based city/equivalent reverse geocoding for places that don't have as many defined polygons in OSM 2016-12-05 18:30:46 -05:00
Al
f87f0df717 [places] adding generic place index for reverse geocoding to points 2016-12-05 02:05:54 -05:00
Al
e32c232c67 [localities] /planet-neighborhoods/planet-localities/ 2016-12-04 23:05:11 -05:00
Al
cca80b046c [abbreviation] fixing abbreviations within hyphenated phrases, particularly for prefix/suffix matches 2016-12-03 17:55:11 -05:00
Al
adab232674 [osm] don't include rail stations with no venue phrases (if there's a railway station at Foo, only include it if it's named "Foo Station", not just plain "Foo") 2016-12-01 02:03:38 -05:00
Al
ef243fbb18 [fix] var name 2016-11-25 13:41:07 -08:00
Al
cdbc102821 [boundaries] in addition to population, check if a city has an unambiguous Wikipedia 2016-11-25 13:36:49 -08:00
Al
87634a36e1 [openaddresses] for cases where city populations are not known (i.e. not getting boundaries from OSM, most of the sources in OpenAddresses), place-only records should have at least two identifying components. Helps when city names, etc. are highly ambiguous and need to be qualified 2016-11-25 00:56:38 -08:00
Al
5c3ccc3bc6 [places] better handling of population exceptions in places config 2016-11-25 00:38:49 -08:00
Al
e07c74f077 [fix] config 2016-11-24 03:57:52 -05:00
Al
46b7043dc7 [fix] typo 2016-11-24 03:50:11 -05:00
Al
fcf4717335 [openaddresses] adding city_replacements handling to OA formatter 2016-11-23 20:16:48 -05:00
Al
3dc2a922fb [addresses/languages] if there's only one default language and we don't have a road name or a unicode script to disambiguate, assume the default (e.g. English in the US unless there's a Spanish/French road name). Can affect things like state abbreviations 2016-11-22 18:27:54 -05:00
Al
ee6edbbd91 [countries] take first encountered country code instead of reversing the components (for cases like Puerto Rico, Hong Kong, etc.) 2016-11-22 11:55:41 -05:00