Commit Graph

437 Commits

Author SHA1 Message Date
Al
ed03f1e9dc [dictionaries] Welsh dictionaries 2015-07-16 04:44:48 -04:00
Al
20de039449 [fix] admin1 language exceptions 2015-07-16 04:44:27 -04:00
Al
06612b4685 [dictionaries] Removing saints for now 2015-07-16 03:51:32 -04:00
Al
b9103a39fa [expansion] Moving filename=>dictionary type mapping to the Python generation script and validating there 2015-07-16 03:51:11 -04:00
Al
5f2be3022b [expansion] dictionary_type_t enum instead of uint64_t 2015-07-16 03:49:37 -04:00
Al
f713c53993 [utils] Adding an option to char_array_add_joined to strip separators for path manipulation 2015-07-16 03:49:00 -04:00
Al
f181c04e7a [expansion] expansion rule structs and Python script to generate rules from dictionaries tree. Note that a canonical_index of -1 indicates that a given phrase is the canonical (saves space) 2015-07-16 02:49:53 -04:00
Al
3d1d4d3673 [dictionaries] Edits to dictionaries after validation, addition to Spanish/Catalan 2015-07-16 02:15:38 -04:00
Al
5caa09e2d2 [fix] Occitan regions were using the qs_a1r_alt value instead of qs_a1r 2015-07-16 01:06:55 -04:00
Al
076c07e21f [fix] Add minor languages to the language set 2015-07-16 00:58:58 -04:00
Al
84d8860d98 [dictionaries] Occitan dictionaries (used for street names in parts of Southern France, Catalonia and Italy) 2015-07-16 00:41:41 -04:00
Al
fbddaf5f15 [polygons] Adding Occitan speaking regions to regional exceptions 2015-07-16 00:34:23 -04:00
Al
b0d272ed67 [numex] masculine/feminine ordinal indicators in Spanish, Portuguese and Italian numex ordinal abbreviations 2015-07-15 19:13:26 -04:00
Al
e2c61cd2c0 [dictionaries] Adding accented characters to the older dictionaries as canonical forms 2015-07-15 19:12:00 -04:00
Al
1fe3c9b79b [polygons] Adding a return_all version of point_in_poly e.g. for regions like Navarra where we want to add a non-default Basque dictionary but still retain Spanish as the default from the national polygon 2015-07-15 14:34:20 -04:00
Al
a8b2fb5b90 [tokenization] Regenerating scanner file 2015-07-14 18:16:24 -04:00
Al
43293d0ae3 [tokenization] Fixing a tokenization where mid-number characters appear in the middle of a word+numeric sequence e.g. Zigor,2 should be 3 separate tokens. Sequences like 35,37,39 are still treated as a single token for the moment. 2015-07-14 18:15:58 -04:00
Al
d57f9df7ed [fix] regexes 2015-07-14 14:04:32 -04:00
Al
d494963dcd [fix] lat/lon conversion in address formatting 2015-07-14 13:34:22 -04:00
Al
a0f2ff1e2a [fix] adding encoding declaration 2015-07-13 21:09:18 -04:00
Al
d15737b319 [osm] Validating lat/lon in OSM training data 2015-07-13 21:08:08 -04:00
Al
0c18a57c4e [fix] planet url no longer needed 2015-07-13 14:27:26 -04:00
Al
e8348dde0e [osm] removing all the fetch/convert arguments from training data generator 2015-07-13 14:24:54 -04:00
Al
5e9e08f6b1 [fix] making fetch script executable 2015-07-13 14:19:24 -04:00
Al
465bcd46aa [fix] input file in OSM training data generator 2015-07-13 14:18:24 -04:00
Al
961606ac12 [fix] removing intermediate file in OSM fetch 2015-07-13 14:17:57 -04:00
Al
00b538c7d1 [fix] newline 2015-07-13 14:17:30 -04:00
Al
59bf23ae67 [osm] Planet admin bounds filter 2015-07-13 04:08:55 -04:00
Al
7c988fa717 [fix] imports 2015-07-13 01:50:42 -04:00
Al
e603bad9f3 [fix] adding admin_level to the allowed properties list for language polygons 2015-07-13 01:49:54 -04:00
Al
a9967ec9bd [numex] Regenerating numex file 2015-07-13 01:16:39 -04:00
Al
7cd740e2f5 [numex] A couple of fixes to various numex rules after testing 2015-07-13 01:16:20 -04:00
Al
fcff210d77 [rtree] Language polygon index returns polygons from most specific admin level to least specific 2015-07-13 00:58:47 -04:00
Al
86fe289320 [numex] Re-generated numex data file 2015-07-13 00:56:48 -04:00
Al
95509cbe65 [numex] Fixing a few numex typos 2015-07-13 00:55:43 -04:00
Al
6b8b06cfdd [numex] Irish numex 2015-07-13 00:52:27 -04:00
Al
f227deddde [dictionaries] Irish dictionaries, now support all official EU languages 2015-07-13 00:52:16 -04:00
Al
f2d8e043eb [dictionaries] Additions to Maltese dictionariesg 2015-07-13 00:51:55 -04:00
Al
2163c6a51f dictionaries] Adding accents back to Catalan dictionaries 2015-07-13 00:25:28 -04:00
Al
8e4426c588 [numex] Maltese numex 2015-07-12 21:27:29 -04:00
Al
d9c748f9b4 [dictionaries] Maltese dictionaries 2015-07-12 21:27:10 -04:00
Al
1b50bc4986 [numex] Croatian numex 2015-07-12 04:24:18 -04:00
Al
a0b0034491 [dictionaries] Croatian dictionaries 2015-07-12 04:24:04 -04:00
Al
ca6876165a [numex] Bulgarian numex 2015-07-12 03:38:37 -04:00
Al
55f1e6e391 [dictionaries] Bulgarian dictionaries 2015-07-12 03:38:28 -04:00
Al
d302a6ed65 [numex] Lithuanian numex 2015-07-12 03:23:46 -04:00
Al
bfd5155b4b [dictionaries] Lithuanian dictionaries 2015-07-12 03:07:27 -04:00
Al
3615fe3e51 [numex] Latvian numex 2015-07-12 03:07:15 -04:00
Al
fbe3458705 [dictionaries] Latvian dictionaries 2015-07-12 03:07:06 -04:00
Al
a912cd4a87 [numex] Slovenian numex 2015-07-12 02:33:51 -04:00