Commit Graph

35 Commits

Author SHA1 Message Date
Al
0738a57caa [fix] restoring ctype.h include 2015-08-07 01:52:08 -04:00
Al
d7ebcd046e [fix] includes 2015-08-07 01:00:26 -04:00
Al
3178eda501 [utils] string_contains_hyphen method 2015-08-02 14:35:18 -06:00
Al
7aee159c0c [utils] string_tree_num_tokens 2015-07-27 12:36:34 -04:00
Al
a67ec44a08 [utils] cstring_array_terminate, moving msgpack_utils to separate file 2015-07-25 18:41:02 -04:00
Al
e549e76806 [utils] string_tree_iterator_foreach_token 2015-07-25 13:49:02 -04:00
Al
2adaf475c2 [utils] cstring_array (contiguous) to array of malloc'd strings 2015-07-25 12:14:01 -04:00
Al
96538469dd [utils] Adding a cstring_array_foreach macro 2015-07-23 15:57:12 -04:00
Al
f713c53993 [utils] Adding an option to char_array_add_joined to strip separators for path manipulation 2015-07-16 03:49:00 -04:00
Al
b76f9e47d1 [utils] max string size for int8_t and int16_t 2015-07-08 16:46:12 -04:00
Al
d7f73e62f1 [utils] Adding cstring_array_clear method 2015-07-06 12:48:26 -04:00
Al
4fd4fa7dca [fix] moving int string size constants to string_utils.h 2015-07-02 17:50:09 -04:00
Al
b58877ec6c [utils] string_is_lower/string_is_upper method 2015-07-01 14:49:22 -04:00
Al
a5dacf3d2b [utils] Adding method to get a particular token alternative from a string tree 2015-06-28 15:15:29 -04:00
Al
bcee9832b3 [utils] cstring_array_get_token=>cstring_array_get_string 2015-06-25 10:05:35 -04:00
Al
8520df96c8 [utils] utf8 comparison can handle a non-valid UTF-8 sequence e.g. for trie suffix comparison where we may be in the middle of a multi-byte character. Adding a standard utf8_common_prefix method 2015-06-12 16:11:40 -04:00
Al
ab5ea6d791 [utils] Common prefix-style return value instead of a utf8 strcmp 2015-06-11 10:59:51 -04:00
Al
aad5f3edd3 [utils] UTF-8 lowercasing and string comparison, including a version which ignores dashes/spaces 2015-06-10 18:27:14 -04:00
Al
81be8e771e [numex] regen data file. utf8_is_hyphen requires a character, all other methods use category 2015-06-08 21:32:38 -04:00
Al
06835d5c37 [utils] string_utils category functions take a category instead of a codepoint 2015-06-06 20:41:07 -04:00
Al
528dd05983 [numex] Adding utf8_is_number_or_letter 2015-06-04 14:49:12 -04:00
Al
ca746304e3 [utils] Adding a few methods to string_utils for finding utf8proc category groups 2015-06-04 13:20:14 -04:00
Al
8ac8f83b7f [utils] changing signature of utf8proc_iterate_reversed so it takes the same arguments as utf8proc_iterate for function pointer purposes 2015-05-25 15:35:28 -04:00
Al
9d309ca9d3 [fix] moving constant 2015-05-18 14:25:21 -04:00
Al
58571f70cc [utils] adding a boolean flag on string tree iterators for single path trees 2015-05-18 13:57:11 -04:00
Al
e8fdd4564d [utils] adding string_tree for listing sets of token alternatives and string_tree_iterator to generate permutations over the strings, needed for transliteration and ambiguous address elements/place names 2015-05-16 23:16:10 -04:00
Al
e21873635c [utils] Using token offsets to calculate lengths for contiguous string arrays, inlining a few functions 2015-04-15 20:17:03 -04:00
Al
4729dfe178 [utils] string_[rl]strip => string_[rl]trim, removing warning about allocation 2015-04-06 02:19:19 -04:00
Al
198e51b8a3 [utils] more/better char_array methods 2015-04-05 22:01:46 -04:00
Al
5f3d74de18 [fix] contiguous string array 2015-04-03 11:22:50 -04:00
Al
c81aa72254 [utils] a few changes to contiguous string arrays 2015-04-01 19:02:11 -04:00
Al
1ac4438e39 [utils] More consistent naming in string_utils 2015-03-27 21:12:08 -04:00
Al
70195fffd5 [utils] new methods on string_utils for better dynamic strings which retains the benefits of sds without having to worry about the pointer changing, renaming contiguous string array methods to something more succinct 2015-03-27 20:55:36 -04:00
Al
3ed5795cff [fix] fixing some formatting 2015-03-03 12:54:27 -05:00
Al
5216aba1b6 [utils] string utils, file utils, contiguous arrays of strings used for storing tokenized strings, klib for generic hashtables and vectors, antirez's sds for certain types of string building, utf8proc for iterating over utf-8 strings and unicode normalization 2015-03-03 12:33:13 -05:00