[dictionaries] Removing the convention of separating ideograms with space, tokenizer can accomplish the same thing

This commit is contained in:
Al
2015-07-20 02:50:35 -04:00
parent 3ff6526392
commit 96d20f8693
11 changed files with 58 additions and 60 deletions

View File

@@ -6,15 +6,15 @@ dōng|dong
nán|nan
西
xī|xi
西
西北
xīběi|xibei
xī běi|xi bei
西
西南
xīnán|xinan
xī nán|xi nan
东北
dōngběi|dongbei
dōng běi|dong bei
东南
dōngnán|dongnan
dōng nán|dong nan

View File

@@ -17,5 +17,5 @@ lǐ|li
cūn|cun
lín|lin
邮编
yóubiān|youbian

View File

@@ -1,28 +1,28 @@
村道
cūn dào|cun dao|cūndào|cundao
大道
dàdào|dadao
大街
dàjiē|dajie
dào|dao
大院
dà yuàn|dayuan
国道
guódào|guodao
胡同
hútòng|hutong
jiē|jie
lù|lu
省道
shěng dào|sheng dao|shěngdào|shengdao
县道
xiàn dào|xian dao|xiàndào|xiandao
xiàng|xiang
乡道
xiāng dào|xiang dao|xiāngdào|xiangdao
duàn|duan