Commit Graph

5474 Commits

Author SHA1 Message Date
Al B
4858732e60 Merge pull request #626 from ddelange/patch-2
Unify and clean the unofficial project references
2025-02-08 11:55:16 -05:00
Al B
ee7aa9a3ad Merge pull request #684 from openvenues/fix_compilation_windows_11
conditional include of strndup.h, fixes #682
2025-01-31 21:02:35 -05:00
Al
052d99de74 [fix] ifdef includes around stndup 2025-01-31 16:14:33 -05:00
Al
5d0b6dece5 [readme] Github Actions status instead of Travis build 2025-01-30 16:10:01 -05:00
Al B
a56e80cc9c Merge pull request #683 from openvenues/lang_classifier_gcc14
Using exported libpostal_language_classifier_response_t struct throughout, fixes #677
2025-01-30 14:14:37 -05:00
Al
d491739b21 [fix] sparse_matrix_utils also had an incompatible pointer 2025-01-30 13:13:29 -05:00
Al
0540d7c7e3 [api/compat] PR #465 redefined the language classifier response struct in the API and was casting between incompatible pointer types. Using the exported struct throughout. 2025-01-30 01:45:18 -05:00
Al B
9e5af6b044 Merge pull request #679 from SiarheiFedartsou/sf-apple-silicon2
Do not use SSE flags on arm64 platform
2024-12-23 19:11:48 -05:00
Al
4f51945402 [github] no need to use macos-15-latest which requires upgrading, just doing macos-latest which is arm64 2024-12-23 19:06:08 -05:00
Al B
7245a178cf Merge pull request #680 from rjurney/rjurney/readme-update
Update README.md to include 'build-essential' package in apt list and 3 common configuration options
2024-12-23 18:28:43 -05:00
Russell Jurney
3caca83f04 Update README.md to fix /tmp reference to [...some dir with a few GB of space...] 2024-12-23 17:39:43 -05:00
Russell Jurney
ff72880e62 Update README.md to include build-essential in apt list and 3 options for configuration 2024-12-23 17:35:51 -05:00
Siarhei Fedartsou
ff623ae2dc Do not use SSE flags on arm64 platform 2024-12-03 18:35:47 +01:00
Siarhei Fedartsou
e36f8599ba Do not use SSE flags on arm64 platform 2024-12-03 18:33:47 +01:00
Egor Shestakov
aa712e60c1 [fix] initialize next_id by zero in trie_new_from_cstring_array function 2024-06-06 00:34:55 +07:00
karanj
6e2311c217 Merge branch 'openvenues:master' into master 2023-10-07 21:49:39 +10:00
Al B
8f2066b1d3 Merge pull request #640 from openvenues/fix_aligned_resize
Fix aligned resize
2023-08-21 17:23:35 -04:00
Al
330bd2e158 [fix] Windows name 2023-08-18 22:37:50 -04:00
Al
e4982b733f [fix] memcpy in aligned vector/matrix resize needs to know the original size. Since this is an internal static function and does not affect client APIs, changing it to aligned_malloc, aligned_free, and aligned_resize, which takes the original pointer size as well as the new. On Windows it simply passes the pointer, new size, and alignment through _aligned_realloc, whereas on other platforms only the aligned_malloc is for new_size bytes and uses old_size bytes for memcpy 2023-08-18 13:39:36 -04:00
Marek Pruteanu
385d2ba09a add g-ral as a variant for general in Romanian personal titles 2023-07-27 20:00:39 +03:00
Al B
1fe1f0af1f Merge pull request #632 from openvenues/fix_memalign
fixes for SSE/memalign issue in crf_context test
2023-07-06 14:16:05 -07:00
Al
7bdcf96c9d [memalign] no more realloc on aligned pointers, just do an aligned malloc and copy to it. Slower but safe and this is not called that often in practice, usually to resize larger matrices. 2023-07-06 16:01:02 -04:00
Al
5a1f6df5a9 [sse] ok something about that PR breaks either way. Now trying it with SSE 2023-07-06 02:05:17 -04:00
Al
57eaa414ce [revert] reverting the commits from #578, leaving header file in repo for the moment 2023-07-06 01:54:46 -04:00
Al
c76d020c18 [fix] same result running test as a separate step 2023-07-06 01:36:32 -04:00
Al
d979fbb779 [test] trying make check in the same step, to see if that makes a difference 2023-07-06 01:28:49 -04:00
Al
59325c3b13 [test] testing with sse2 disabled to see if the build is working generally 2023-07-06 01:16:22 -04:00
Al
7a448b718d [crf] using 32 bytes for posix_memalign to align blocks of 4 doubles for remez algorithm to fix test which uses an odd-sized context 2023-07-05 21:02:41 -04:00
Al
b65e7d5bce [fix] no sudo on brew on Mac in github actions, just like on a regular machine/in the docs 2023-07-05 20:47:14 -04:00
Al
2b93af09d9 [build] removing travis build 2023-07-05 20:43:18 -04:00
Al
5669372a90 [fix] sudo in github actions for build tool installs 2023-07-05 20:42:50 -04:00
Al
2f20c9359e [github] adding Github action to run tests on mac and ubuntu initially 2023-07-05 20:38:48 -04:00
Al B
dc794b1b64 Merge pull request #631 from madrisan/libpostal_data_syntax
Fix dash syntax error in libpostal_data
2023-07-03 10:36:13 -07:00
Davide Madrisan
dcb63d8768 Fix dash syntax error in libpostal_data
Fix the syntax error reported by dash:

    ./src/libpostal_data: 39: [: ==: unexpected operatora

when the variable DATAMODEL is empty.

Signed-off-by: Davide Madrisan <davide.madrisan@gmail.com>
2023-06-29 14:36:10 +02:00
Al B
32d636f378 Merge pull request #630 from motiejus/patch-1
avoid UB in bit shifts
2023-06-24 19:35:45 -07:00
Motiejus Jakštys
5d77298e88 avoid UB in bit shifts
unsigned char* gets promoted to `int`, which cannot always be shifted by 24 bits.

Justine Tunney blogs about it here: https://justine.lol/endian.html

Example:

```deserialize.c
#include <stdlib.h>
#include <stdio.h>
#include <stdint.h>

uint32_t file_deserialize_uint32_ok(unsigned char *buf) {
    return ((uint32_t)buf[0] << 24) | ((uint32_t)buf[1] << 16) | ((uint32_t)buf[2] << 8) | (uint32_t)buf[3];
}

uint32_t file_deserialize_uint32(unsigned char *buf) {
    return (buf[0] << 24) | (buf[1] << 16) | (buf[2] << 8) | buf[3];
}

int main() {
    unsigned char arr[4] = {0xaa, 0xaa, 0xaa, 0xaa};

    printf("%d\n", file_deserialize_uint32_ok((unsigned char*)arr));
    printf("%d\n", file_deserialize_uint32((unsigned char*)arr));
}
```

Output:
```
$ clang-16 -fsanitize=undefined ./deserialize.c -o deserialize && ./deserialize
-1431655766
deserialize.c:10:20: runtime error: left shift of 170 by 24 places cannot be represented in type 'int'
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior deserialize.c:10:20 in 
-1431655766
```
2023-06-23 12:16:35 +03:00
ddelange
ef215786f1 Unify and clean the unofficial project references 2023-04-29 22:21:18 +02:00
Al B
4c98eaa7dc Merge pull request #625 from ddelange/patch-1
Add support for OpenBLAS
2023-04-18 14:31:49 -04:00
ddelange
6f95677427 Explicit -lopenblas 2023-04-18 12:00:10 +02:00
ddelange
8eb721f6a1 Fix typo 2023-04-18 11:19:31 +02:00
ddelange
0ad268f991 Add support for OpenBLAS 2023-04-18 10:57:53 +02:00
dependabot[bot]
5c83807702 Bump future from 0.15.2 to 0.18.3 in /scripts
Bumps [future](https://github.com/PythonCharmers/python-future) from 0.15.2 to 0.18.3.
- [Release notes](https://github.com/PythonCharmers/python-future/releases)
- [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst)
- [Commits](https://github.com/PythonCharmers/python-future/compare/v0.15.2...v0.18.3)

---
updated-dependencies:
- dependency-name: future
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-04-16 03:06:14 +00:00
PIT-Development
e2590bca97 docs: fix typos in contributing.md (#622)
* Respect typo

Repeect should be respect

* Update CONTRIBUTING.md

Also include guildelines to guidelines
2023-04-13 08:38:52 +02:00
Stefan Charsley
d881c82def Update gh.yaml 2023-03-13 15:39:36 +13:00
Stefan Charsley
ed38d7ff50 Update tw.yaml 2023-03-13 15:24:03 +13:00
Al B
9546eacb26 Merge pull request #616 from oskar700/ot-senzing-datamodel
Adding senzing model from @oskar700 and @brianmacy, along with a new MODEL switch in configure
2023-02-19 17:31:01 -05:00
karanj
18b298668b [dictionaries] adding common Hindi tokens
Reviewed the Hindi ngrams list, identified a number of common items which could be added to the model
2023-02-16 21:45:06 +11:00
Oskar Thorbjornsson
00568da290 Modifying README and config parameter, based on code review. 2023-02-14 21:02:51 -08:00
Oskar Thorbjornsson
0c0818c683 Update Senzing link. 2023-02-13 17:03:42 -08:00
Oskar Thorbjornsson
a11f33fb3d Add a link to info about Senzing data model. 2023-02-13 13:32:38 -08:00