Commit Graph

1781 Commits

Author SHA1 Message Date
Al
3bf7edfc5a [fix] file extension part II 2016-04-22 16:48:26 -04:00
Al
62ba9c53ee [fix] file extension 2016-04-22 16:47:54 -04:00
Al
87f9ec4227 [fix] chmod +x 2016-04-22 16:46:23 -04:00
Al
9b54fda8b2 [chains] Adding code to generate chain_stores repo TSV files from OSM 2016-04-22 16:45:28 -04:00
Al
22fabccf31 [fix] double pipes 2016-04-22 16:25:16 -04:00
Al
aba1ebb7de [dictionaries] H&R block was missing 2016-04-22 16:11:45 -04:00
Al
6080441ef8 [rm] Removing first attempt at chain stores in favor of new dictionary type 2016-04-22 13:30:42 -04:00
Al
b56729c5a6 [chains] Adding chain stores derived from frequent OSM venue names at https://github.com/openvenues/chain_stores + research 2016-04-22 13:28:48 -04:00
Al
907c8fe96d [addresses] /po_box/po_boxes/ 2016-04-20 17:07:39 -04:00
Al
6ff0b25f40 [addresses] Generate house number related phrases 2016-04-20 17:06:30 -04:00
Al
1eeda65cfd [dictionaries] /house_number/house_numbers/ 2016-04-20 15:57:12 -04:00
Al
dba8be445d [fix] None handling and number dictionaries 2016-04-20 14:59:54 -04:00
Al
901f720368 [addresses] different dictionaries for sampling cardinal/unit directions, not converting None to a string 2016-04-19 17:05:10 -04:00
Al
d91735c3c2 [addresses] Updating English config to support new options for occasionally adding whitespace between unit numbers 2016-04-19 17:03:46 -04:00
Al
10320723b1 [dictionaries] Removing ambiguous abbreviations for flat 2016-04-19 17:02:58 -04:00
Al
38ec82a42b [addresses] Unit/apartment number generation 2016-04-19 17:01:24 -04:00
Al
1acf0d592b [addresses] sample positive floors 2016-04-19 16:59:16 -04:00
Al
868fcb752b [mv] Moving sampling to math.sampling 2016-04-19 11:57:42 -04:00
Al
c31926f3dd [addresses] Adding more numeric/numeric_affix probabilities to English config 2016-04-19 11:25:12 -04:00
Al
ce2b2d9559 [addresses] Conjunction can be subclassed 2016-04-19 11:22:13 -04:00
Al
c92af0da78 [addresses] Adding ability to randomly append relative/cardinal directions 2016-04-19 11:21:23 -04:00
Al
450aee95c2 [addresses] Adding base class for numeric phrases (appended to a number using numeric/numeric_affix), using probability 1.0 if only one of numeric/numeric_affix/ordinal is specified 2016-04-19 11:07:25 -04:00
Al
1b2e92dc14 [fix] polygons 2016-04-19 10:15:31 -04:00
Al
9abc679f09 [fix] typo 2016-04-19 00:53:39 -04:00
Al
ccbbf84e8d [dictionaries] Updates to Spanish dictionaries, casa can be a numbered unit type 2016-04-19 00:45:32 -04:00
Al
b8125a232d [dictionaries] Updates to English dictionaries 2016-04-19 00:44:33 -04:00
Al
47ffd18c8c [polygons] Adding __iter__ and __len__ to polygon index and keeping track of the number of polygons for iteration 2016-04-19 00:42:57 -04:00
Al
9271fda30e [addresses] Combined unit + house number (32/4, etc.) is more common in Canada, Australia, Singapore, etc. Not as much in the US, UK 2016-04-18 17:05:55 -04:00
Al
d88f130edf [addresses] changing plurals to use the standard probability structure 2016-04-18 15:12:59 -04:00
Al
7272d44575 [dictionaries] Updates to Spanish dictionaries to support the new structure, new abbreviations for Colombia, etc. 2016-04-15 14:21:43 -04:00
Al
2a570481ba [addresses] implementing null_probability (raw number, no phrase), orindal genders, and direction_probability 2016-04-15 03:25:41 -04:00
Al
430ad2e187 [numbers] suffixed_number 2016-04-15 02:04:58 -04:00
Al
028dbacc87 [dictionaries] making entrances/postcodes plural for consistency 2016-04-15 01:10:03 -04:00
Al
883ef2ec56 [dictionaries] Moving intersections to cross streets 2016-04-14 17:53:45 -04:00
Al
5850793768 [expansion] Add postcode dictionary to gazetteer types 2016-04-14 14:33:02 -04:00
Al
6babbfaf02 [addresses] generator for floor numbers as well as special aliases like basement, mezzanine, etc. using the address configs 2016-04-14 14:22:08 -04:00
Al
36b3d515ad [expansion] Modifying the Python gazetteers to use new dictionaries API 2016-04-14 14:17:09 -04:00
Al
2ff4940e36 [expansion] Adding number and intersections to dictionary types 2016-04-14 14:15:33 -04:00
Al
49b02796c0 [addresses] Adding abbreviations as a separate module so it can be used with multiple data sets 2016-04-14 03:10:01 -04:00
Al
a6553b77d3 [addresses] PO Box phrase generator 2016-04-14 02:38:45 -04:00
Al
9eb444b193 [addresses] PO Box fixes in the address config 2016-04-14 02:38:04 -04:00
Al
d29ade7210 [addresses] conjunction class for building phrases like "5th and 6th" or "Units 1 & 2" across languages using the address configs 2016-04-14 01:21:44 -04:00
Al
f0ac3522da [addresses] base class for numbered components (floors, units, house numbers in some languages/countries). Can generate many variants of a number (e.g. Floor 2, 2nd Floor, Floor #2, Floor No. 2, etc.) 2016-04-14 01:17:43 -04:00
Al
fe006e0d62 [addresses] utilities for sampling from an arbitrary discrete distribution, building cumulative distributions, and sampling from a Zipfian distribution which seems to be a reasonable way of generating plausible apartment/floor numbers when the height/number of units is unknown. Picking a letter uniformly at random means P('Unit A') == P('Unit Z') when 'A' should be much more likely. Sampling from a Zipfian gets the desired effect in situations where address components are numbered by "counting from 0/1/A" while still allowing for a long tail 2016-04-14 01:13:39 -04:00
Al
58feeab714 [addresses] address config class for general sampling of forms specified in the address configs (default/alternatives to choose a phrase, canonical/abbreviated/sample to choose an abbreviation or surface form for that phrase) 2016-04-14 01:06:54 -04:00
Al
518140a1b5 [addresses] Adding corner_of key to the English address config 2016-04-14 01:04:01 -04:00
Al
db9d51e655 [dictionaries] Intersections dictionary for English 2016-04-14 01:01:28 -04:00
Al
8fdd3e9314 [addresses] Additions to the English address config 2016-04-14 00:56:39 -04:00
Al
e37431912d [boundaries/fix] admin_level 7 in Australia should map to city, not state_district 2016-04-13 18:27:35 -04:00
Al
7bb5da94bb [dictionaries] Making the word for "number" a separate dictionary as it can apply in several places 2016-04-13 18:27:04 -04:00