[dictionaries] adding common Hindi tokens
Reviewed the Hindi ngrams list, identified a number of common items which could be added to the model
This commit is contained in:
24
resources/dictionaries/hi/personal_titles.txt
Normal file
24
resources/dictionaries/hi/personal_titles.txt
Normal file
@@ -0,0 +1,24 @@
|
||||
baba
|
||||
babu
|
||||
bhagat
|
||||
guru
|
||||
jagirdar
|
||||
maharaja|maharaj
|
||||
mahatma|महात्मा
|
||||
pandit
|
||||
raja
|
||||
rajarshi
|
||||
rajkumar
|
||||
rajkumari
|
||||
rani
|
||||
rishi
|
||||
sahib
|
||||
sant
|
||||
sardar
|
||||
senapati
|
||||
shah
|
||||
shrimati|smt|srimathi|श्रीमती
|
||||
shri|shree|sri|श्री
|
||||
sushri
|
||||
swami
|
||||
ustad
|
||||
Reference in New Issue
Block a user