Commit Graph

447 Commits

Author SHA1 Message Date
Al
4a2cfe8e28 [fix] multiple languages comma-separated, not listed separately 2015-07-17 02:02:22 -04:00
Al
559d4ebc85 [fix] admin1 polygon exceptions were using the wrong field names 2015-07-17 01:51:09 -04:00
Al
0387f741b9 [polygons] Virgin Islands road signs are in Danish mostly 2015-07-17 01:26:23 -04:00
Al
9f451f9054 [fix] newline 2015-07-17 01:25:41 -04:00
Al
3613dd9683 [dictionaries] Dutch stopwords updates 2015-07-17 01:25:22 -04:00
Al
1594b32736 [dictionaries] Moving around some synonyms in English dictionaries and adding a few entries 2015-07-17 00:55:12 -04:00
Al
1d7247d7e1 [polygons] Adding Belgium regional languages 2015-07-17 00:53:25 -04:00
Al
d5ac816066 [fix] import 2015-07-16 13:33:50 -04:00
Al
8899be6eef [osm] choosing the first default language for OSM training data, fixing way/relation offsets 2015-07-16 13:32:16 -04:00
Al
eb97c99e24 [numex] Welsh numex 2015-07-16 04:44:59 -04:00
Al
ed03f1e9dc [dictionaries] Welsh dictionaries 2015-07-16 04:44:48 -04:00
Al
20de039449 [fix] admin1 language exceptions 2015-07-16 04:44:27 -04:00
Al
06612b4685 [dictionaries] Removing saints for now 2015-07-16 03:51:32 -04:00
Al
b9103a39fa [expansion] Moving filename=>dictionary type mapping to the Python generation script and validating there 2015-07-16 03:51:11 -04:00
Al
5f2be3022b [expansion] dictionary_type_t enum instead of uint64_t 2015-07-16 03:49:37 -04:00
Al
f713c53993 [utils] Adding an option to char_array_add_joined to strip separators for path manipulation 2015-07-16 03:49:00 -04:00
Al
f181c04e7a [expansion] expansion rule structs and Python script to generate rules from dictionaries tree. Note that a canonical_index of -1 indicates that a given phrase is the canonical (saves space) 2015-07-16 02:49:53 -04:00
Al
3d1d4d3673 [dictionaries] Edits to dictionaries after validation, addition to Spanish/Catalan 2015-07-16 02:15:38 -04:00
Al
5caa09e2d2 [fix] Occitan regions were using the qs_a1r_alt value instead of qs_a1r 2015-07-16 01:06:55 -04:00
Al
076c07e21f [fix] Add minor languages to the language set 2015-07-16 00:58:58 -04:00
Al
84d8860d98 [dictionaries] Occitan dictionaries (used for street names in parts of Southern France, Catalonia and Italy) 2015-07-16 00:41:41 -04:00
Al
fbddaf5f15 [polygons] Adding Occitan speaking regions to regional exceptions 2015-07-16 00:34:23 -04:00
Al
b0d272ed67 [numex] masculine/feminine ordinal indicators in Spanish, Portuguese and Italian numex ordinal abbreviations 2015-07-15 19:13:26 -04:00
Al
e2c61cd2c0 [dictionaries] Adding accented characters to the older dictionaries as canonical forms 2015-07-15 19:12:00 -04:00
Al
1fe3c9b79b [polygons] Adding a return_all version of point_in_poly e.g. for regions like Navarra where we want to add a non-default Basque dictionary but still retain Spanish as the default from the national polygon 2015-07-15 14:34:20 -04:00
Al
a8b2fb5b90 [tokenization] Regenerating scanner file 2015-07-14 18:16:24 -04:00
Al
43293d0ae3 [tokenization] Fixing a tokenization where mid-number characters appear in the middle of a word+numeric sequence e.g. Zigor,2 should be 3 separate tokens. Sequences like 35,37,39 are still treated as a single token for the moment. 2015-07-14 18:15:58 -04:00
Al
d57f9df7ed [fix] regexes 2015-07-14 14:04:32 -04:00
Al
d494963dcd [fix] lat/lon conversion in address formatting 2015-07-14 13:34:22 -04:00
Al
a0f2ff1e2a [fix] adding encoding declaration 2015-07-13 21:09:18 -04:00
Al
d15737b319 [osm] Validating lat/lon in OSM training data 2015-07-13 21:08:08 -04:00
Al
0c18a57c4e [fix] planet url no longer needed 2015-07-13 14:27:26 -04:00
Al
e8348dde0e [osm] removing all the fetch/convert arguments from training data generator 2015-07-13 14:24:54 -04:00
Al
5e9e08f6b1 [fix] making fetch script executable 2015-07-13 14:19:24 -04:00
Al
465bcd46aa [fix] input file in OSM training data generator 2015-07-13 14:18:24 -04:00
Al
961606ac12 [fix] removing intermediate file in OSM fetch 2015-07-13 14:17:57 -04:00
Al
00b538c7d1 [fix] newline 2015-07-13 14:17:30 -04:00
Al
59bf23ae67 [osm] Planet admin bounds filter 2015-07-13 04:08:55 -04:00
Al
7c988fa717 [fix] imports 2015-07-13 01:50:42 -04:00
Al
e603bad9f3 [fix] adding admin_level to the allowed properties list for language polygons 2015-07-13 01:49:54 -04:00
Al
a9967ec9bd [numex] Regenerating numex file 2015-07-13 01:16:39 -04:00
Al
7cd740e2f5 [numex] A couple of fixes to various numex rules after testing 2015-07-13 01:16:20 -04:00
Al
fcff210d77 [rtree] Language polygon index returns polygons from most specific admin level to least specific 2015-07-13 00:58:47 -04:00
Al
86fe289320 [numex] Re-generated numex data file 2015-07-13 00:56:48 -04:00
Al
95509cbe65 [numex] Fixing a few numex typos 2015-07-13 00:55:43 -04:00
Al
6b8b06cfdd [numex] Irish numex 2015-07-13 00:52:27 -04:00
Al
f227deddde [dictionaries] Irish dictionaries, now support all official EU languages 2015-07-13 00:52:16 -04:00
Al
f2d8e043eb [dictionaries] Additions to Maltese dictionariesg 2015-07-13 00:51:55 -04:00
Al
2163c6a51f dictionaries] Adding accents back to Catalan dictionaries 2015-07-13 00:25:28 -04:00
Al
8e4426c588 [numex] Maltese numex 2015-07-12 21:27:29 -04:00