This website requires JavaScript.
Explore
Help
Sign In
tommy
/
libpostal
Watch
1
Star
0
Fork
0
You've already forked libpostal
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
535453f77d7ed462dd89720ba9b8f69355f36d5a
libpostal
/
scripts
/
geodata
/
addresses
History
Al
535453f77d
[addresses] Adding ability to randomly append relative/cardinal directions
2016-07-21 17:04:57 -04:00
..
__init__.py
[addresses] address config class for general sampling of forms specified in the address configs (default/alternatives to choose a phrase, canonical/abbreviated/sample to choose an abbreviation or surface form for that phrase)
2016-07-21 17:04:57 -04:00
config.py
[addresses] address config class for general sampling of forms specified in the address configs (default/alternatives to choose a phrase, canonical/abbreviated/sample to choose an abbreviation or surface form for that phrase)
2016-07-21 17:04:57 -04:00
conjunctions.py
[addresses] conjunction class for building phrases like "5th and 6th" or "Units 1 & 2" across languages using the address configs
2016-07-21 17:04:57 -04:00
directions.py
[addresses] Adding ability to randomly append relative/cardinal directions
2016-07-21 17:04:57 -04:00
floors.py
[addresses] generator for floor numbers as well as special aliases like basement, mezzanine, etc. using the address configs
2016-07-21 17:04:57 -04:00
numbering.py
[addresses] Adding base class for numeric phrases (appended to a number using numeric/numeric_affix), using probability 1.0 if only one of numeric/numeric_affix/ordinal is specified
2016-07-21 17:04:57 -04:00
po_box.py
[addresses] PO Box phrase generator
2016-07-21 17:04:57 -04:00
sampling.py
[addresses] utilities for sampling from an arbitrary discrete distribution, building cumulative distributions, and sampling from a Zipfian distribution which seems to be a reasonable way of generating plausible apartment/floor numbers when the height/number of units is unknown. Picking a letter uniformly at random means P('Unit A') == P('Unit Z') when 'A' should be much more likely. Sampling from a Zipfian gets the desired effect in situations where address components are numbered by "counting from 0/1/A" while still allowing for a long tail
2016-07-21 17:04:57 -04:00