This website requires JavaScript.
Explore
Help
Sign In
tommy
/
libpostal
Watch
1
Star
0
Fork
0
You've already forked libpostal
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
ce2b2d95590a4e8ce6b71d0f688cf9b350e98d34
libpostal
/
scripts
/
geodata
/
addresses
History
Al
ce2b2d9559
[addresses] Conjunction can be subclassed
2016-04-19 11:22:13 -04:00
..
__init__.py
[addresses] address config class for general sampling of forms specified in the address configs (default/alternatives to choose a phrase, canonical/abbreviated/sample to choose an abbreviation or surface form for that phrase)
2016-04-14 01:06:54 -04:00
config.py
[addresses] address config class for general sampling of forms specified in the address configs (default/alternatives to choose a phrase, canonical/abbreviated/sample to choose an abbreviation or surface form for that phrase)
2016-04-14 01:06:54 -04:00
conjunctions.py
[addresses] Conjunction can be subclassed
2016-04-19 11:22:13 -04:00
directions.py
[addresses] Adding ability to randomly append relative/cardinal directions
2016-04-19 11:21:23 -04:00
floors.py
[addresses] generator for floor numbers as well as special aliases like basement, mezzanine, etc. using the address configs
2016-04-14 14:22:08 -04:00
numbering.py
[addresses] Adding base class for numeric phrases (appended to a number using numeric/numeric_affix), using probability 1.0 if only one of numeric/numeric_affix/ordinal is specified
2016-04-19 11:07:25 -04:00
po_box.py
[addresses] PO Box phrase generator
2016-04-14 02:38:45 -04:00
sampling.py
[addresses] utilities for sampling from an arbitrary discrete distribution, building cumulative distributions, and sampling from a Zipfian distribution which seems to be a reasonable way of generating plausible apartment/floor numbers when the height/number of units is unknown. Picking a letter uniformly at random means P('Unit A') == P('Unit Z') when 'A' should be much more likely. Sampling from a Zipfian gets the desired effect in situations where address components are numbered by "counting from 0/1/A" while still allowing for a long tail
2016-04-14 01:13:39 -04:00