Commit Graph

5475 Commits

Author SHA1 Message Date
Al
fc250724e1 [numex] tercera=>3ra 2015-06-06 20:39:57 -04:00
Al
7c613a068f [dictionaries] English dictionary updates 2015-06-06 20:39:27 -04:00
Al
2856c2b401 [utils] string_utils category functions take a category instead of a codepoint 2015-06-05 16:55:21 -04:00
Al
3030dbe4be [fix] transliteration states 2015-06-05 00:09:29 -04:00
Al
e32916f3df [fix] closing file in numex table builder 2015-06-04 23:59:21 -04:00
Al
b244aa30f2 [numex] Setting numex_table to NULL during teardown, adding some logging 2015-06-04 23:57:52 -04:00
Al
3bd5172afd [numex] Adding NUMEX_NULL_RULE at the first index 2015-06-04 17:21:44 -04:00
Al
3400a59e1c [numex] adding a NUMEX_NULL_RULE 2015-06-04 17:21:16 -04:00
Al
95a4bb8e7c [numex] teardown in numex table builder 2015-06-04 17:20:26 -04:00
Al
114b728f96 [fix] var 2015-06-04 17:18:05 -04:00
Al
528dd05983 [numex] Adding utf8_is_number_or_letter 2015-06-04 14:49:12 -04:00
Al
ca746304e3 [utils] Adding a few methods to string_utils for finding utf8proc category groups 2015-06-04 13:20:14 -04:00
Al
eac7a296ba [numex] New numex data file including top 15 languages in OSM 2015-06-04 11:55:07 -04:00
Al
6470cbe467 [numex] Catalan and Chinese numex rules converted from RBNF, now covering top 15 languages in OSM addresses 2015-06-04 11:53:43 -04:00
Al
e2c8c08772 [numex] 1era for Spanish feminine ordinal indicator 2015-06-04 11:52:50 -04:00
Al
0429db3507 [numex] Adding ordinal indicator type for Japanese 2015-06-04 11:52:25 -04:00
Al
d98c535c52 [numex] Adding ordinal indicator to enum 2015-06-04 11:25:24 -04:00
Al
2d098fdab6 [numex] Adding ordinal_indicator rule type for CJK ordinals 2015-06-04 11:24:13 -04:00
Al
3cb8b2d297 [numex] trie builder adding a separate suffix-based namespace for looking up ordinal indicators 2015-06-04 03:17:03 -04:00
Al
7d3ef39463 [numex] struct/method changes for new ordinal indicators 2015-06-04 03:15:51 -04:00
Al
ab802bc361 [numex] Changes to existing numex rules files. Adding Dutch, Japanese, Polish, Danish, Swedish and Finnish numex rules (priority based on frequency in OpenStreetMap) 2015-06-04 03:13:39 -04:00
Al
65abde908b [numex] New numex data file 2015-06-04 03:10:00 -04:00
Al
4c49f63caf [numex] Adding categories to numex for plurals, etc. Ordinal indicators support multiple variants (primer in Spanish can be written as 1er or 1r for instance) and longer suffixes e.g. for tracking 1=>1st but 11=>11th 2015-06-04 03:09:39 -04:00
Al
3d95875a11 [phrases] trie_add_len 2015-06-04 02:41:48 -04:00
Al
fa784677f2 [phrases] trie_add_suffix_at_index method 2015-06-04 02:30:53 -04:00
Al
9bdf118423 [transliteration] Fix to transliteration in cases where the pre/post context doesn't match and we fall back to the no-context match 2015-06-03 22:58:29 -04:00
Al
48d2ca31c4 [transliteration] New ggenerated data file with the German/Scandinavian additions 2015-06-03 22:56:50 -04:00
Al
b2fe9d4db0 [transliteration] Adding uppercase umlauts and Scandinativan a-ring 2015-06-03 22:55:45 -04:00
Al
760714a234 [fix] warnings in transliterate.c 2015-06-03 19:29:35 -04:00
Al
7dcb4bf6f4 [numex] correct signature 2015-06-02 16:08:25 -04:00
Al
93d65d0186 [numex] numex table builder, fix to constant 2015-06-02 13:57:34 -04:00
Al
a44997c71c [fix] new generated numex data file 2015-06-02 13:45:06 -04:00
Al
2ea21dfffb [fix] constants 2015-06-02 13:44:25 -04:00
Al
2d5d854754 [fix] compilation/warnings 2015-06-02 13:43:55 -04:00
Al
208366af98 [fix] removing stopwords index 2015-06-02 12:43:48 -04:00
Al
49816382c1 [numex] New generated data file 2015-06-02 12:37:39 -04:00
Al
9d0d83bc14 [numex] adding stopword rules with the regular numex rules 2015-06-02 12:37:22 -04:00
Al
816a0408ab [numex] numex_rule.h 2015-06-02 12:30:56 -04:00
Al
8ef3a50b79 [numex] Initial generated numex data file 2015-06-02 12:28:28 -04:00
Al
4ad978f22c [numex] Using the new representation for generated data 2015-06-02 12:28:07 -04:00
Al
958c219b88 [utils] constants.h 2015-06-02 12:26:19 -04:00
Al
2dc870b3da [numex] Python script to generate numex data 2015-06-02 10:15:02 -04:00
Al
6b3d434c31 [fix] removing unnecessary definition 2015-06-01 17:13:57 -04:00
Al
9c935c9cc7 [fix] Base data dir path 2015-06-01 17:13:06 -04:00
Al
505456d9d2 [fix] removing unnecessary header 2015-06-01 17:12:33 -04:00
Al
080f382065 [numex] Removing concatenated property from language struct as all numeric spellouts might be concatenated 2015-06-01 17:12:07 -04:00
Al
a20b768237 [numex] Russian numex rules (a start at least, might need a native speaker to review the RBNF transform in CLDR) 2015-06-01 17:08:57 -04:00
Al
05ffbffb23 [numex] Latin numex rules i.e. Roman numerals, used for most languages 2015-06-01 17:08:04 -04:00
Al
028bb5a1aa [numex] German numex rules 2015-06-01 17:07:35 -04:00
Al
9bd75cee23 [numex] Romance language numex rules (Spanish, French, Italian, Portuguese) 2015-06-01 17:07:23 -04:00