[dictionaries] Splitting English dictionaries into distinct level types (numbered, basement, mezzanine, etc.) for adding tokens automatically in address parser training data

This commit is contained in:
Al
2016-03-17 13:05:55 -04:00
parent 3879746f5a
commit 604b9b2cdd
4 changed files with 20 additions and 0 deletions

View File

@@ -0,0 +1 @@
basement|bsm|bsmt|bsmnt|basement|bsment

View File

@@ -0,0 +1 @@
mezzanine|mezz

View File

@@ -0,0 +1,6 @@
floor|fl|flr|/ f
level|lev|levl|lvel|lvl|l
platform|pf
podium|pd
rooftop|rt|rf
upper|upr

View File

@@ -0,0 +1,12 @@
ground|g|gd
ground floor|g|gd|gdfl|gd fl|gd/fl|gd / fl|gf|g / f
lobby
lower ground floor|lg|lgf|lgfl|l / g|l / gf|l / g / f|l / g / fl
lower level|lwr level|lower lvl|lwr lvl
lower left|lower l|lwr l
lower right|lower r|lwr r
rooftop|rt|rf|r / t
upper ground floor|ug|ugf|ugfl|ug / f|ug / fl
upper|uppr|upr
upper left|upper l|upr left|uppr left|upr l|uppr l
upper right|upper r|upr right|uppr right|upr r|uppr r