[polygons/osm] Adding a unified neighborhood reverse geocoder incorporating Zetashapes, OSM and Quattroshapes. Uses the new Soft TFIDF implementation to approximately match OSM names to Quattroshapes/Zetashapes names and geohash indices for more coarse point-in-polygon tests (OSM neighborhoods are stored as points not polygons, so need to match with a geometry from the other sources)
This commit is contained in:
@@ -70,7 +70,7 @@ class NameDeduper(object):
|
||||
tokens2 = cls.content_tokens(s2)
|
||||
|
||||
if not cls.possible_match(tokens1, tokens2):
|
||||
return max(cls.dupe_threshold - 0.1, 0.0)
|
||||
return 0.0
|
||||
|
||||
tokens1_only = [t for t, c in tokens1]
|
||||
tokens2_only = [t for t, c in tokens2]
|
||||
@@ -87,7 +87,7 @@ class NameDeduper(object):
|
||||
tokens2 = cls.content_tokens(s2)
|
||||
|
||||
if not cls.possible_match(tokens1, tokens2):
|
||||
return max(cls.dupe_threshold - 0.1, 0.0)
|
||||
return 0.0
|
||||
|
||||
tokens1_only = [t for t, c in tokens1]
|
||||
tokens2_only = [t for t, c in tokens2]
|
||||
|
||||
Reference in New Issue
Block a user