Martin Kroeker
dc4fcb48df
Fix inverted conditional for caxpy/zaxpy
4 years ago
Martin Kroeker
7a48247761
fix c/zrot and sgemv for POWER5
4 years ago
Martin Kroeker
7dfc45e840
Remove casts for PPC/POWER and complete parameters for POWER3/4
4 years ago
Zhang Xianyi
9f3d903817
Merge pull request #3259 from zhaofengli/riscv64-fixes
riscv64 fixes
4 years ago
Zhaofeng Li
590be3fae3
riscv64: Add Makefile
4 years ago
Zhaofeng Li
3521cd48cb
RISCV64_GENERIC: Use generic kernel for DSDOT for better precision
The implementation in `riscv64/dot.c` fails the `test_dsdot` test, and
the generic kernel seems to have better precision. Tested on SiFive
FU740 (HiFive Unmatched) and QEMU.
Also see #1469 .
4 years ago
Zhaofeng Li
1e0192a5cc
riscv64/imin: Fix wrong comparison
Same as #1990 .
4 years ago
Martin Kroeker
fe9aff17fe
Merge pull request #3258 from martin-frbg/hbaction
revert "try to work around gcc update problems" in Homebrew workflow
4 years ago
Martin Kroeker
8c25b440a0
revert "try to work around gcc update problems"
...as homebrew has dropped at least gcc8 now
4 years ago
Martin Kroeker
42f048cf6c
Merge pull request #3249 from MikaelUrankar/develop
Fix typo
5 years ago
MikaelUrankar
4fbc0777f4
Fix typo
5 years ago
Martin Kroeker
d7472606d5
Merge pull request #3244 from martin-frbg/issue3237
Add fast path for small xSYR with INCX==1
5 years ago
Martin Kroeker
03297ff9f0
Add fast path for small xSYR with INCX==1
5 years ago
Martin Kroeker
2d8d0af0ea
Merge pull request #3243 from martin-frbg/lapack564
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
5 years ago
Martin Kroeker
5f677e782e
Merge pull request #3196 from guowangy/skylakex-gemm-batch-k
GEMM: skylake: improve the performance when m is small
5 years ago
Martin Kroeker
04c60cee5d
Merge pull request #3242 from martin-frbg/issue3239
Handle inadvertent use of DYNAMIC_ARCH=0
5 years ago
Martin Kroeker
3a53207cc9
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
5 years ago
Martin Kroeker
0e73d20629
Handle inadvertent use of DYNAMIC_ARCH=0
5 years ago
Martin Kroeker
02087a62e7
Merge pull request #3205 from intelmy/sgemv_n_opt
optimize on sgemv_n for small n
5 years ago
Martin Kroeker
03b4d79a7e
Merge pull request #3238 from martin-frbg/lapack555
Correct function name in error message from SLASQ2 (LAPACK PR555)
5 years ago
Martin Kroeker
5c729c6dce
Correct function name in error message from SLASQ2 (Reference-LAPACK PR 555)
5 years ago
Martin Kroeker
e1911b2e60
Merge pull request #3236 from martin-frbg/issue3234
Add -lm for FreeBSD on ARM/ARM64
5 years ago
Martin Kroeker
8f33da4f94
Merge pull request #3235 from dnoan/develop
Update Makefile.arm64
5 years ago
Martin Kroeker
26ccf643a3
Add -lm for FreeBSD on ARM/ARM64
5 years ago
Noan
32264ba496
Update Makefile.arm64
Added -march and -mtune flags for EMAG processors when GCC 9 or later
5 years ago
Martin Kroeker
4ecf631f95
Merge pull request #3228 from martin-frbg/issue3226
filter out -mavx flag on Sandybridge zgemm/ztrmm kernels
5 years ago
Martin Kroeker
5af510081d
Merge pull request #3233 from martin-frbg/issue3230
Add autodetection for Intel Ice Lake SP
5 years ago
Martin Kroeker
164551d5a2
Merge pull request #3232 from martin-frbg/lapack553
Reduce stack size requirements in the LAPACK LIN tests (LAPACK PR 553)
5 years ago
Martin Kroeker
310b76aad7
Merge pull request #3231 from martin-frbg/issue3227
Support compilation with pre-C99 versions of MSVC
5 years ago
Martin Kroeker
c4da892ba0
Only filter out -mavx on Sandybridge ZGEMM/ZTRMM kernels
5 years ago
Martin Kroeker
cbfd3c87e1
Recognize Intel Ice Lake SP as Cooper Lake
5 years ago
Martin Kroeker
26e87ac517
Support Intel Ice Lake SP as Cooper Lake
5 years ago
Martin Kroeker
15b9d6b4a7
Delete zchkaa.f
5 years ago
Martin Kroeker
f7bcd962c1
Delete schkaa.f
5 years ago
Martin Kroeker
93cc066921
Delete dchkaa.f
5 years ago
Martin Kroeker
2c7d4a7766
Delete cchkaa.f
5 years ago
Martin Kroeker
eef1c42f03
Convert ?chkaa to use dynamic allocation for the larger arrays
5 years ago
Martin Kroeker
73f637e584
Support compilation with pre-C99 versions of MSVC
5 years ago
Martin Kroeker
8b90e5f202
Drop redundant inclusion of complex.h
5 years ago
Martin Kroeker
bd60fb6ffc
filter out -mavx flag on zgemm kernels as it can cause problems with older gcc
5 years ago
Martin Kroeker
37ea8702ee
Merge pull request #3192 from damonyu1989/develop
Update the intrinsic api to the offical name.
5 years ago
Martin Kroeker
ec7d6c02bc
Add an Android crossbuild on OSX to Azure CI ( #3224 )
* Add an Android crossbuild on OSX
5 years ago
Martin Kroeker
c90c23e78f
Merge pull request #3223 from martin-frbg/develop
Use percent instead of ampersand as placeholder for substitutions
5 years ago
Martin Kroeker
bda8820da7
Use percent instead of ampersand as placeholder for substitutions
5 years ago
Martin Kroeker
c0ca63ea46
Fix missing conditionals for non-SKX kernels
5 years ago
Martin Kroeker
f497bb949b
Merge pull request #3219 from austinpagan/Gemm.ErrorFix
Add error message token for SBGEMM in gemm.c
5 years ago
Martin Kroeker
f86b1bc3da
Merge pull request #3220 from drhpc/drhpc-fixup
Delete lapack_wrappers.c.orig
5 years ago
drhpc
206e03fdac
Delete lapack_wrappers.c.orig
This looks like a leftover from patching and confuses further patching;-)
5 years ago
Gordon Fossum
8b599836db
Add error message token for SBGEMM in gemm.c
5 years ago
Martin Kroeker
9721b57ecf
Update version to 0.3.15.dev
5 years ago