This website requires JavaScript.
Explore
Help
Sign In
tommy
/
libpostal
Watch
1
Star
0
Fork
0
You've already forked libpostal
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
5bbc0e15d77a8e7827723ffd4ef6a7e9d704a9ac
libpostal
/
scripts
/
geodata
/
i18n
History
Al
293587bae9
[addresses] adding new config for postal codes around the world. Allows appending the ISO alpha-2 country code to the beginning of the postcode as in e.g. SI-1000 (only used if the postcode begins with a digit). This system was used for postal codes in continental Europe as a recommendation from the CEPT. Now 7 member states still use it, so in those countries add the country-code with higher probability. The config also contains the license plate codes for countries where e.g. L-1234 might be used instead of LU-1234. Allows configuring in which countries postcodes should be validated using Google's per-country validation regexes (and the ability to override with a custom regex), and in which countries other admin component names should be stripped.
2017-02-10 23:53:50 -05:00
..
__init__.py
[tokenization] Script to generate TR-29 ranges for re2c scanner
2015-04-14 15:50:50 -04:00
cldr_languages.py
[fix] cldr languages dir
2015-08-11 20:04:25 -04:00
download_cldr.py
[fix] ensure CLDR dir
2015-08-11 20:04:42 -04:00
google.py
[addresses] adding new config for postal codes around the world. Allows appending the ISO alpha-2 country code to the beginning of the postcode as in e.g. SI-1000 (only used if the postcode begins with a digit). This system was used for postal codes in continental Europe as a recommendation from the CEPT. Now 7 member states still use it, so in those countries add the country-code with higher probability. The config also contains the license plate codes for countries where e.g. L-1234 might be used instead of LU-1234. Allows configuring in which countries postcodes should be validated using Google's per-country validation regexes (and the ability to override with a custom regex), and in which countries other admin component names should be stripped.
2017-02-10 23:53:50 -05:00
languages.py
[osm] adding admin1 ids to the OSM country rtree
2016-10-04 23:12:15 -04:00
normalize.py
[fix] import
2015-08-22 23:19:43 -04:00
scanner.py
[cldr] simple Python scanner for creating dynamic scanners for CLDR rule parsing
2015-04-14 15:49:24 -04:00
transliteration_rules.py
[transliteration] making the custom rules in transliteration less repetitious and accessible from elsewhere, removing string names for common transliterators and using constants
2017-01-05 04:06:51 -05:00
unicode_data.py
[fix] unidata file
2017-01-05 04:24:33 -05:00
unicode_paths.py
[mv] Moving all repo data files to a resources dir, data is only for runtime files
2015-07-21 18:11:36 -04:00
unicode_properties.py
[fix] only accept language suffixes that are valid scripts or transliterations of CJK languages. Set language to language suffix so Romaji forms get used, etc.
2016-12-24 17:17:09 -05:00
word_breaks.py
[unicode] Wide version of word breaks
2015-09-22 18:55:33 -04:00