Commit Graph

4306 Commits

Author SHA1 Message Date
Al
f8664b0deb [formatting] making regex-based tests during insert_component optional.If exact_order=True, insert the given component directly before/after the reference component, otherwise for components that already exist in the template only need to care about relative position. Adding a method to determine if template language is important for a particular country/language pair. 2016-10-12 14:42:34 -04:00
Al
3db6b7fbf1 [dictionaries] adding new abbreviations for Sankt in German and Scandinavian languages 2016-10-11 18:05:11 -04:00
Al
2663b81670 [address_formatting] caching parsed templates from pystache yields about a 2.5x speedup per call, should shave off several hours of CPU time for large training sets 2016-10-11 15:36:49 -04:00
Al
2314acef1b [geoplanet] bypassing Québec as a county (just city and state) 2016-10-11 02:33:27 -04:00
Al
02fc172b5c [geoplanet] abbreviations for UK and NYC, fixing country codes for IM, GG and JE 2016-10-11 02:11:26 -04:00
Al
6ff1024c02 [fix] null candidate languages 2016-10-07 19:49:32 -04:00
Al
30074524d8 [fix] return empty list for languages in country_and_languages 2016-10-07 18:57:22 -04:00
Al
29698781cb [boundaries] making Kingston parish a city and only using the name Kingston, just so the parser doesn't have to disambiguate between references to the parish vs. the city, both referred to as Kingston 2016-10-07 18:52:46 -04:00
Al
ff7fec6ed1 [osm/polygons] need to include id/type in polygon properties now that they're getting added earlier in the pipeline 2016-10-07 01:21:02 -04:00
Al
169a3c3d70 [osm] drop postcode as well for address-only format 2016-10-07 01:10:16 -04:00
Al
4ff3f50e01 [fix] Dublin postcode formatting 2016-10-07 01:06:37 -04:00
Al
2e8b6e6a29 [fix] args 2016-10-07 01:03:22 -04:00
Al
0401a04adb [osm] add address-only formats (sans place tags) for every address as well to better deal handle incomplete queries where location is expected to be inferred by the geocoder, etc. 2016-10-07 00:59:52 -04:00
Al
14fa8a08c0 [fix][ci skip] attempting something less cluttered for the readme 2016-10-07 00:50:36 -04:00
Al
ed26d8e398 [geoplanet] a few more GeoPlanet fixes for LocalAdmins in LU and CH 2016-10-07 00:34:57 -04:00
Al
6ce05812fe [docs][ci skip] edit to intro/project description 2016-10-06 23:59:26 -04:00
Al
5f7bf6008a [fix][ci skip] cliffhanger, paragraph order 2016-10-06 23:49:42 -04:00
Al
5a571b1d7a [docs][ci skip] moving flags below intro paragraph in readme 2016-10-06 23:45:08 -04:00
Al
de99120c66 [fix][ci skip] alignment of flags on readme 2016-10-06 23:23:23 -04:00
Al
425bca6149 [docs][ci skip] two rows of flags on the readme 2016-10-06 23:12:10 -04:00
Al
906bd524c3 [fix][ci skip] removing comments 2016-10-06 23:00:49 -04:00
Al
a588230d13 Merge branch 'master' of https://github.com/openvenues/libpostal 2016-10-06 22:55:57 -04:00
Al
527b78ddf7 [docs][ci skip] adding more flags to the repo via span tags 2016-10-06 22:55:36 -04:00
Travis
04f8130c46 [auto][ci skip] Adding data files from Travis build #168 2016-10-07 00:46:48 +00:00
Al
8a8b4b6ee9 Merge branch 'Jeffrey04-ms-dictionary-expansion' 2016-10-06 20:31:03 -04:00
Al
03d0afb820 [fix] removing level types and given names from synonyms since they're already covered 2016-10-06 20:30:48 -04:00
Al
5f42e66f31 [fix] removing road/rd from the synonyms list for jalan as they're covered by the English dictionaries 2016-10-06 20:29:35 -04:00
Al
c4e147ed20 [fix] separating words that have different roots 2016-10-06 20:29:09 -04:00
Al
2c48acd680 [dictionaries] removing flat/rumah pangsa/pangsapuri from place_names, aliasing gim to gimnasium rather than the other way around, removing duplicate/mixed English + Malay line 2016-10-06 20:28:44 -04:00
Al
244dbbdd4a [fix] separating synonyms that are for different words 2016-10-06 20:27:15 -04:00
Al
ecd71ee10d [fix] var name 2016-10-06 15:36:51 -04:00
Al
c44e6280b4 [geoplanet] Setting postal codes connected to non-admin features to parent/grandparent features. Setting postal codes connected to unitary authorities in the UK to their respective towns 2016-10-06 14:07:01 -04:00
Al
aff12106c4 [geoplanet] adding Island place_type 2016-10-06 14:04:28 -04:00
Al
3d021c0a2c [boundaries] place=district for Ireland postal districts 2016-10-06 12:18:39 -04:00
Al
b1f386cb11 [fix] typo 2016-10-06 01:37:42 -04:00
Al
7d5ef87348 [fix] geoplanet zip file 2016-10-06 01:37:30 -04:00
Al
a67efcffe4 [addresses] add new option to use city population to determine whether components should be dropped out 2016-10-05 18:16:25 -04:00
Al
66af532850 [osm] adding country-specific cleanups to OSM place training data 2016-10-05 17:13:13 -04:00
Al
6b0186782d [openaddresses] doing country-specific cleanups in OpenAddresses 2016-10-05 17:07:29 -04:00
Al
182c0b3d26 [addresses] adding country-specific cleanups for Kingston (city=Kingston 12 split into city=Kingston, postcode=12) and Dublin (e.g. Dublin 3 specified various ways will be treated as a city_district, whereas Eirecodes are treated as postal codes) 2016-10-05 17:05:24 -04:00
Al
2798420fdc [osm] add boundary=postal_district to admin borders for Ireland 2016-10-05 15:26:16 -04:00
Al
4cea9ff54e [boundaries] map postal_district (Dublin 3, etc.) to city_district. Eire codes will be postal code 2016-10-05 15:25:13 -04:00
Al
918e1f62ba [names] remove "County" as an ignorable prefix 2016-10-05 15:03:18 -04:00
Al
7b3a59878c [fix] bracket 2016-10-05 14:27:24 -04:00
Al
fb6909970e [openaddresses] adding Colusa and Inyo counties in California 2016-10-05 13:43:43 -04:00
Al
5744fc5a3c [fix] import 2016-10-05 03:23:34 -04:00
Al
70a5ded45c [fix] encode element id 2016-10-05 03:14:19 -04:00
Al
432f9dd42e [fix] format of candidate_languages in the new OSM rtree 2016-10-05 03:12:07 -04:00
Al
bb32253689 [fix] args 2016-10-05 02:54:52 -04:00
Al
faf418decb [languages] using country_and_languages method in OSM, neighborhoods and OpenAddresses 2016-10-05 02:49:55 -04:00