From 168132145241f763db6dbd64660f5aa2620e1d2b Mon Sep 17 00:00:00 2001 From: Peter Johnson Date: Sun, 12 Aug 2018 19:22:21 +0200 Subject: [PATCH] minor typo minor typo --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 570ee0e1..a7f3a7b8 100644 --- a/README.md +++ b/README.md @@ -494,7 +494,7 @@ language (IX => 9) which occur in the names of many monarchs, popes, etc. - **Fast, accurate tokenization/lexing**: clocked at > 1M tokens / sec, implements the TR-29 spec for UTF8 word segmentation, tokenizes East Asian -languages chracter by character instead of on whitespace. +languages character by character instead of on whitespace. - **UTF8 normalization**: optionally decompose UTF8 to NFD normalization form, strips accent marks e.g. à => a and/or applies Latin-ASCII transliteration.