Commit Graph

  • 2114b21399 [fix] A few anomalies in the Wikipedia/Wiktionary-generated given names Al 2015-07-21 16:07:28 -04:00
  • 3509b203f8 [gazetteers] Moving data out of the header file Al 2015-07-21 16:06:49 -04:00
  • 179918917a [fix] header guard and include Al 2015-07-21 15:38:45 -04:00
  • f99a90d64e [expansion] Generated data file for address expansions Al 2015-07-21 15:38:10 -04:00
  • 68a6d8ee33 [fix] return NULL from transliterator_read on failure Al 2015-07-21 00:58:01 -04:00
  • 9360ff2c4b [geodb] geodb_builder using new trie_get/set_data_at_index methds Al 2015-07-20 16:53:48 -04:00
  • 9374745140 [fix] var name and placement Al 2015-07-20 16:53:19 -04:00
  • 9f697e0256 [transliteration] transliterate now using the new trie_get_data_at_index API Al 2015-07-20 16:47:56 -04:00
  • 7f96726e82 [phrases] Adding trie_get_data/trie_set_data + at_index methods Al 2015-07-20 16:39:58 -04:00
  • b9771921fc [fix] Path joins in geodb_builder use new char_array methods Al 2015-07-20 16:31:43 -04:00
  • d55d505329 [phrases] trie_get_data and trie_set_data interface for simpler dictionary-style trie get/set Al 2015-07-20 16:29:48 -04:00
  • 7f67ed7dc0 [fix] less ambiguous variable name in the generated expansions data file Al 2015-07-20 02:58:26 -04:00
  • 96d20f8693 [dictionaries] Removing the convention of separating ideograms with space, tokenizer can accomplish the same thing Al 2015-07-20 02:50:35 -04:00
  • 3ff6526392 [dictionaries] Azerbaijani dictionaries Al 2015-07-20 02:29:36 -04:00
  • 21b915f090 [dictionaries] Bosnian dictionaries and updates to Croatian Al 2015-07-20 02:29:23 -04:00
  • c9280341b8 [languages] Adding Russian dictionaries to Georgia Al 2015-07-20 01:43:40 -04:00
  • 916465f994 [dictionaries] Georgian dictionaries Al 2015-07-20 01:42:45 -04:00
  • b925e7b9a2 [dictionaries] Sinhala dictionaries Al 2015-07-20 00:51:59 -04:00
  • bee9f7d5ec [languages] Audit of road sign languages Al 2015-07-20 00:29:36 -04:00
  • b415d79b10 [fix] space=>tabs Al 2015-07-19 22:26:28 -04:00
  • c5c9f4db81 [languages] Sinhala is the primary language for Sri Lanka, English dictinaries used Al 2015-07-19 22:24:45 -04:00
  • 0b741a353f [dictinaries] Icelandic dictionaries Al 2015-07-19 22:00:49 -04:00
  • 71b8db9c47 [fix] post office box for English Al 2015-07-19 16:33:04 -04:00
  • 5659a69897 [dictionaries] Adding Farsi word for sheikh Al 2015-07-19 15:55:57 -04:00
  • 3886920bcd [dictionaries] Urdu dictionaries Al 2015-07-19 15:55:37 -04:00
  • fee8503248 [dictionaries] Thai dictinoaries Al 2015-07-19 15:46:25 -04:00
  • eaa9a81d76 [dictionaries] Vietnamese dictionaries Al 2015-07-19 04:04:42 -04:00
  • 7b20bd0aeb [dictionaries] Serbian dictionaries Al 2015-07-19 03:43:48 -04:00
  • 09f9796766 [numex] Belarusian numex Al 2015-07-19 03:26:51 -04:00
  • c3da167a53 [dictionaries] Belarusian dictionaries Al 2015-07-19 03:26:16 -04:00
  • b8b6e011d8 [dictionaries] Hebrew dictionaries Al 2015-07-19 03:26:02 -04:00
  • a9cea59aef [dictionaries] Filipino dictionaries Al 2015-07-19 03:25:32 -04:00
  • 101b040272 [dictionaries] Farsi dictionaries Al 2015-07-19 03:25:06 -04:00
  • e7f212d568 [dictionaries] Malay dictionaries Al 2015-07-19 03:24:56 -04:00
  • fa5b7bce33 [dictionaries] Indonesian dictionaries Al 2015-07-19 03:14:08 -04:00
  • 89dfe8b207 [numex] Turkish numex Al 2015-07-19 03:13:38 -04:00
  • f8adee9ed0 [dictionaries] Arabic dictionaries Al 2015-07-19 03:13:06 -04:00
  • ed52462239 [dictionaries] Turkish dictionaries Al 2015-07-19 03:12:49 -04:00
  • c8bfbe58ee [dictionaries] Bulgarian place names Al 2015-07-19 03:11:35 -04:00
  • 98c13d24fb [numex] Neuter ordinal suffixes for Russian Al 2015-07-19 03:10:54 -04:00
  • 43dc8ec010 [dictionaries] Russian dictionary additions Al 2015-07-19 03:10:29 -04:00
  • 0a100831c3 [fix] English street types Al 2015-07-19 03:00:53 -04:00
  • 3d348945e5 [languages] road sign languages for ambiguous countries Al 2015-07-19 02:56:33 -04:00
  • 35cb6542cd [dictionaries] Breton dictionaries Al 2015-07-17 03:16:11 -04:00
  • 0629171347 [dictionaries] Papiamento dictionaries for the ABC islands Al 2015-07-17 03:15:51 -04:00
  • 6a696249c4 [dictionaries] Scottish Gaelic dictionaries Al 2015-07-17 03:14:27 -04:00
  • 955ff13459 [dictionaries] Gaelic place names Al 2015-07-17 03:14:06 -04:00
  • 5cba747a93 [fix] variable name Al 2015-07-17 03:06:06 -04:00
  • 5e7bb54a5c [polygons] only add language polygons if there's one default language Al 2015-07-17 02:19:55 -04:00
  • 4a2cfe8e28 [fix] multiple languages comma-separated, not listed separately Al 2015-07-17 02:02:22 -04:00
  • 559d4ebc85 [fix] admin1 polygon exceptions were using the wrong field names Al 2015-07-17 01:50:43 -04:00
  • 0387f741b9 [polygons] Virgin Islands road signs are in Danish mostly Al 2015-07-17 01:26:23 -04:00
  • 9f451f9054 [fix] newline Al 2015-07-17 01:25:41 -04:00
  • 3613dd9683 [dictionaries] Dutch stopwords updates Al 2015-07-17 01:25:22 -04:00
  • 1594b32736 [dictionaries] Moving around some synonyms in English dictionaries and adding a few entries Al 2015-07-17 00:55:12 -04:00
  • 1d7247d7e1 [polygons] Adding Belgium regional languages Al 2015-07-17 00:53:25 -04:00
  • d5ac816066 [fix] import Al 2015-07-16 13:33:50 -04:00
  • 8899be6eef [osm] choosing the first default language for OSM training data, fixing way/relation offsets Al 2015-07-16 13:32:16 -04:00
  • eb97c99e24 [numex] Welsh numex Al 2015-07-16 04:44:59 -04:00
  • ed03f1e9dc [dictionaries] Welsh dictionaries Al 2015-07-16 04:44:48 -04:00
  • 20de039449 [fix] admin1 language exceptions Al 2015-07-16 04:44:24 -04:00
  • 06612b4685 [dictionaries] Removing saints for now Al 2015-07-16 03:51:32 -04:00
  • b9103a39fa [expansion] Moving filename=>dictionary type mapping to the Python generation script and validating there Al 2015-07-16 03:51:11 -04:00
  • 5f2be3022b [expansion] dictionary_type_t enum instead of uint64_t Al 2015-07-16 03:49:31 -04:00
  • f713c53993 [utils] Adding an option to char_array_add_joined to strip separators for path manipulation Al 2015-07-16 03:49:00 -04:00
  • f181c04e7a [expansion] expansion rule structs and Python script to generate rules from dictionaries tree. Note that a canonical_index of -1 indicates that a given phrase is the canonical (saves space) Al 2015-07-16 02:17:42 -04:00
  • 3d1d4d3673 [dictionaries] Edits to dictionaries after validation, addition to Spanish/Catalan Al 2015-07-16 02:15:38 -04:00
  • 5caa09e2d2 [fix] Occitan regions were using the qs_a1r_alt value instead of qs_a1r Al 2015-07-16 01:06:55 -04:00
  • 076c07e21f [fix] Add minor languages to the language set Al 2015-07-16 00:58:58 -04:00
  • 84d8860d98 [dictionaries] Occitan dictionaries (used for street names in parts of Southern France, Catalonia and Italy) Al 2015-07-16 00:41:41 -04:00
  • fbddaf5f15 [polygons] Adding Occitan speaking regions to regional exceptions Al 2015-07-16 00:34:17 -04:00
  • b0d272ed67 [numex] masculine/feminine ordinal indicators in Spanish, Portuguese and Italian numex ordinal abbreviations Al 2015-07-15 19:13:26 -04:00
  • e2c61cd2c0 [dictionaries] Adding accented characters to the older dictionaries as canonical forms Al 2015-07-15 19:12:00 -04:00
  • 1fe3c9b79b [polygons] Adding a return_all version of point_in_poly e.g. for regions like Navarra where we want to add a non-default Basque dictionary but still retain Spanish as the default from the national polygon Al 2015-07-15 14:33:49 -04:00
  • a8b2fb5b90 [tokenization] Regenerating scanner file Al 2015-07-14 18:16:24 -04:00
  • 43293d0ae3 [tokenization] Fixing a tokenization where mid-number characters appear in the middle of a word+numeric sequence e.g. Zigor,2 should be 3 separate tokens. Sequences like 35,37,39 are still treated as a single token for the moment. Al 2015-07-14 18:15:58 -04:00
  • d57f9df7ed [fix] regexes Al 2015-07-14 14:04:32 -04:00
  • d494963dcd [fix] lat/lon conversion in address formatting Al 2015-07-14 13:34:22 -04:00
  • a0f2ff1e2a [fix] adding encoding declaration Al 2015-07-13 21:09:18 -04:00
  • d15737b319 [osm] Validating lat/lon in OSM training data Al 2015-07-13 21:08:08 -04:00
  • 0c18a57c4e [fix] planet url no longer needed Al 2015-07-13 14:27:26 -04:00
  • e8348dde0e [osm] removing all the fetch/convert arguments from training data generator Al 2015-07-13 14:24:54 -04:00
  • 5e9e08f6b1 [fix] making fetch script executable Al 2015-07-13 14:19:24 -04:00
  • 465bcd46aa [fix] input file in OSM training data generator Al 2015-07-13 14:18:24 -04:00
  • 961606ac12 [fix] removing intermediate file in OSM fetch Al 2015-07-13 14:17:57 -04:00
  • 00b538c7d1 [fix] newline Al 2015-07-13 14:17:30 -04:00
  • 59bf23ae67 [osm] Planet admin bounds filter Al 2015-07-13 04:08:50 -04:00
  • 7c988fa717 [fix] imports Al 2015-07-13 01:50:42 -04:00
  • e603bad9f3 [fix] adding admin_level to the allowed properties list for language polygons Al 2015-07-13 01:49:54 -04:00
  • a9967ec9bd [numex] Regenerating numex file Al 2015-07-13 01:16:39 -04:00
  • 7cd740e2f5 [numex] A couple of fixes to various numex rules after testing Al 2015-07-13 01:16:20 -04:00
  • fcff210d77 [rtree] Language polygon index returns polygons from most specific admin level to least specific Al 2015-07-13 00:58:47 -04:00
  • 86fe289320 [numex] Re-generated numex data file Al 2015-07-13 00:56:48 -04:00
  • 95509cbe65 [numex] Fixing a few numex typos Al 2015-07-13 00:55:43 -04:00
  • 6b8b06cfdd [numex] Irish numex Al 2015-07-13 00:52:27 -04:00
  • f227deddde [dictionaries] Irish dictionaries, now support all official EU languages Al 2015-07-13 00:52:16 -04:00
  • f2d8e043eb [dictionaries] Additions to Maltese dictionariesg Al 2015-07-13 00:51:55 -04:00
  • 2163c6a51f dictionaries] Adding accents back to Catalan dictionaries Al 2015-07-13 00:25:28 -04:00
  • 8e4426c588 [numex] Maltese numex Al 2015-07-12 21:27:29 -04:00
  • d9c748f9b4 [dictionaries] Maltese dictionaries Al 2015-07-12 21:27:10 -04:00