Commit Graph

15 Commits

Author SHA1 Message Date
Al
e15036fcce [fix] if there are street types that are not venue words and not vice versa, then call the venue invalid as a standalone term 2016-11-19 04:11:33 -05:00
Al
5140db536a [phrases] additions to venue names dictionaries and a more restrictive version of street types dictionaries 2016-11-19 02:58:27 -05:00
Al
71be0fdfbc [fix] sets 2016-11-19 02:30:40 -05:00
Al
b6f7b5b577 [fix] name 2016-11-19 01:38:15 -05:00
Al
1df1b60a9f [phrases] adding extract_phrases method to gazetteers, which returns a set of gazetteer phrases found in a given string 2016-11-18 23:35:44 -05:00
Al
551cce8cb1 [fix] making a separate gazetteer for toponym abbreviations 2016-09-10 01:08:58 -04:00
Al
2e7f8f1ae7 [abbreviations] Adding toponyms gazetteer for probabilistically abbreviating things like Mount=>Mt, Saint=>St, Fort=>Ft in place names 2016-08-24 18:52:00 -04:00
Al
8b57a7acf2 [osm] abbreviate toponyms (qualifiers) with some probability so we get those versions in the model's phrase dictionaries 2016-08-22 20:55:35 -04:00
Al
4e4686fbfe [gazetteers] Street and synonym dictionary for catching other abbreviations that occur in street names 2016-07-21 17:04:57 -04:00
Al
b50120f45c [chains] Adding chains gazetteer 2016-07-21 17:04:57 -04:00
Al
771a360a85 [phrases] Using safe_encode/safe_decode as default trie serializer/deserializer 2016-07-21 17:04:57 -04:00
Al
d5dc34ec1d [gazetteers] moving PHRASE to a token type 2016-07-21 17:04:57 -04:00
Al
e1f1e34dca [expansion] Modifying the Python gazetteers to use new dictionaries API 2016-07-21 17:04:57 -04:00
Al
f3a9f4a257 [fix] removing init_gazetteers, doing it at the module level 2016-07-21 17:04:57 -04:00
Al
b22646ee30 [mv] Moving gazetteers into their own module 2016-01-22 03:15:56 -05:00