Al
22c4e99ea0
[parser] As part of reading/tokenizing the address parser data set,
...
several copies of the same training example will be generated.
1. with only lowercasing
2. with simple Latin-ASCII normalization (no umlauts, only things that
are common to all languages)
3. basic UTF-8 normalizations (accent stripping)
4. language-specific Latin-ASCII transliteration (e.g. ü => ue in German)
This will apply both on the initial passes when building the phrase
gazetteers and during each iteration of training. In this way, only the
most basic normalizations like lowercasing need to be done at runtime
and it's possible to use only minimal normalizations like lowercasing.
May have a small effect on randomization as examples are created in a
deterministic order. However, this should not lead to cycles since the
base examples are shuffled, thus still satisfying the random permutation
requirement of an online/stochastic learning algorithm.
2016-12-02 13:09:03 -05:00
..
2015-04-12 15:14:01 -04:00
2015-08-07 17:15:39 -04:00
2015-10-26 18:50:22 -04:00
2015-11-29 11:28:31 -05:00
2015-10-12 13:42:10 -05:00
2015-03-11 17:47:57 -04:00
2015-12-28 17:19:50 -05:00
2015-12-23 19:33:09 -05:00
2016-11-27 00:56:48 -08:00
2016-11-27 00:56:48 -08:00
2016-11-27 00:56:48 -08:00
2016-11-28 10:47:15 -08:00
2015-08-07 02:43:44 -04:00
2016-07-21 17:04:57 -04:00
2016-12-02 13:09:03 -05:00
2016-07-28 17:13:58 -04:00
2016-07-21 17:04:57 -04:00
2016-11-30 15:58:03 -08:00
2016-07-28 15:19:48 -04:00
2016-12-02 13:09:03 -05:00
2015-09-29 13:06:34 -04:00
2015-09-23 19:37:55 -04:00
2016-11-30 15:42:09 -08:00
2016-01-05 16:43:17 -05:00
2015-09-10 10:26:24 -07:00
2015-09-10 10:26:24 -07:00
2016-02-15 17:34:00 -05:00
2015-08-07 17:28:21 -04:00
2015-08-07 02:43:44 -04:00
2016-11-21 14:04:34 -05:00
2016-08-06 00:40:38 -04:00
2015-09-19 01:39:58 -04:00
2016-01-08 00:46:41 -05:00
2016-01-05 16:39:50 -05:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-01-02 15:39:49 -05:00
2016-01-02 15:39:49 -05:00
2016-07-21 17:04:57 -04:00
2016-07-21 17:04:57 -04:00
2015-10-11 00:45:26 -05:00
2015-10-11 00:45:26 -05:00
2016-07-21 17:04:57 -04:00
2016-02-29 22:36:00 -05:00
2015-10-09 15:37:10 -04:00
2015-07-08 17:02:59 -04:00
2016-02-25 14:53:31 -06:00
2016-07-21 17:04:57 -04:00
2015-10-09 15:37:10 -04:00
2015-10-09 15:37:10 -04:00
2015-10-09 15:36:14 -04:00
2015-10-09 15:36:14 -04:00
2015-12-17 12:25:05 -05:00
2015-12-17 12:25:05 -05:00
2016-01-27 03:18:53 -05:00
2016-01-17 21:11:37 -05:00
2016-01-17 21:11:37 -05:00
2016-01-27 03:18:53 -05:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-01-17 21:37:45 -05:00
2016-01-09 03:42:57 -05:00
2016-01-10 03:05:36 -05:00
2016-05-04 23:08:06 -04:00
2016-11-27 00:56:48 -08:00
2016-07-21 17:04:57 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-02-15 17:34:00 -05:00
2016-08-06 00:43:24 -04:00
2016-08-06 00:40:01 -04:00
2016-01-09 01:43:25 -05:00
2016-01-08 00:48:11 -05:00
2015-07-25 18:41:02 -04:00
2015-08-07 02:43:44 -04:00
2016-11-30 10:09:33 -08:00
2016-08-21 19:45:32 -04:00
2016-11-30 15:58:55 -08:00
2015-08-09 01:00:57 -04:00
2016-03-02 16:28:28 -05:00
2016-03-21 18:15:53 -04:00
2015-09-16 21:13:05 -04:00
2015-07-08 17:02:59 -04:00
2016-07-21 17:04:57 -04:00
2015-09-20 20:10:47 -04:00
2016-07-21 17:04:57 -04:00
2015-12-01 11:25:28 -05:00
2015-12-01 11:25:28 -05:00
2016-11-30 10:15:12 -08:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-08-06 00:40:01 -04:00
2016-11-30 10:09:33 -08:00
2016-11-30 10:09:33 -08:00
2016-01-05 16:40:35 -05:00
2016-07-21 17:04:57 -04:00
2015-10-04 18:27:04 -04:00
2016-08-21 19:43:09 -04:00
2016-08-21 19:42:10 -04:00
2016-08-21 19:41:11 -04:00
2015-05-29 16:54:05 -04:00
2016-08-21 19:41:11 -04:00
2016-08-21 19:41:34 -04:00
2016-11-27 01:00:25 -08:00
2016-11-27 01:00:25 -08:00
2016-01-05 16:43:17 -05:00
2016-01-05 16:43:17 -05:00
2016-03-02 16:05:32 -05:00
2016-02-29 22:36:00 -05:00
2016-01-17 20:53:44 -05:00
2016-01-17 20:53:44 -05:00
2016-01-11 16:39:21 -05:00
2015-09-23 04:04:38 -04:00
2016-08-06 00:31:16 -04:00
2016-08-05 23:02:52 -04:00