Werner Saar
c2891330bc
updated KERNEL.ARMV6
10 years ago
Werner Saar
ceaa931e48
updated gemv kernel for armv6
10 years ago
Werner Saar
eaa63165df
updated cgemv and zgemv kernels for armv6
10 years ago
Werner Saar
c65357c566
updated trmm_kernels for armv6
10 years ago
Werner Saar
e63e9f9f26
updated gemm_kernels for armv6
10 years ago
Jerome Robert
1fe3aab047
Use GEMM_MULTITHREAD_THRESHOLD as a number of ops
...not a matrix size. For GEMM_MULTITHREAD_THRESHOLD=4
(the default value) this does not change anything but
for other values it make the GEMM and GEMV thresholds
changing in the same way.
Close #742
10 years ago
Werner Saar
aafd3ab60e
updated cdot and zdot on arm
10 years ago
Jerome Robert
1a1935507b
[z]ger: increase multithread threshold
The ones given in 3ae30cd was by far to low because I
mixed m and m*n in my measures. Note that the new ones
are closed to the [z]gemv ones which is comforting
that both are right.
10 years ago
Werner Saar
d2f84c9c8a
Ref #740 : updated nrm2_vfp.S
10 years ago
Werner Saar
ca32253f32
Ref #740 : updated asum_vfp.S and iamax_vfp.S
10 years ago
Werner Saar
9066d1f982
Ref #750 and Ref #740 : bugfix for sdot, dsdot and ddot on arm
10 years ago
Lauri Tirkkonen
8d85be770d
actually install the shared lib on illumos
10 years ago
Lauri Tirkkonen
7ba1d9b9ca
actually build the shared lib on illumos
10 years ago
Lauri Tirkkonen
31aff441ce
use $(AWK) in Makefile.install and switch it to nawk
10 years ago
Lauri Tirkkonen
e737e32fd1
RLIMIT_NPROC doesn't exist on illumos
10 years ago
Lauri Tirkkonen
8635d425c1
make parallel make work on illumos
10 years ago
Lauri Tirkkonen
97cd4b8aee
illumos fixes to memory.c
10 years ago
wernsaar
72390e3ffb
Merge pull request #747 from wernsaar/develop
Ref #730 : added performance updates for syrk and syr2k
10 years ago
Werner Saar
b07d733a71
added updates for syrk and syr2k
10 years ago
Zhang Xianyi
fa3018c30e
Merge pull request #745 from jakirkham/minor_fix_scipy_prof
BENCH: Minor fixes in SciPy benchmarks
10 years ago
Zhang Xianyi
6caa40302e
Merge pull request #744 from jeromerobert/bug731
Bug731
10 years ago
John Kirkham
a48b247e9e
benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type.
10 years ago
John Kirkham
b1b115ecd6
benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type.
10 years ago
John Kirkham
07bba933ff
benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order.
10 years ago
John Kirkham
e85f8af519
benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order.
10 years ago
John Kirkham
adfa0ab878
benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues.
10 years ago
John Kirkham
cbb6649e97
benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues.
10 years ago
John Kirkham
77abc9b280
benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`.
10 years ago
John Kirkham
81e8690763
benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`.
10 years ago
John Kirkham
dd04a8ac22
benchmark/scripts/SCIPY/ssyrk.py: Use the environment python.
10 years ago
John Kirkham
cb554b3a9c
benchmark/scripts/SCIPY/dsyrk.py: Use the environment python.
10 years ago
John Kirkham
1153459d1b
benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons.
10 years ago
John Kirkham
1a73390ffe
benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons.
10 years ago
John Kirkham
8b981e41a1
benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it.
10 years ago
John Kirkham
c10b1f555d
benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it.
10 years ago
Jerome Robert
14db1ca508
update CONTRIBUTORS.md
10 years ago
Jerome Robert
66eafb16cf
swap: disable multi-threading for small matrices
Close #731
10 years ago
Jerome Robert
3ae30cd6b9
Disable multi-threading for small matrices in [z]ger
Ref #731
10 years ago
Werner Saar
692d9c881c
Ref #740 : simple solution to clear floating point register on arm
10 years ago
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
10 years ago
Zhang Xianyi
ce2b1edd4e
[av skip] Change test cmd on Travis.
10 years ago
Zhang Xianyi
8cf3657fb6
Refs #738 . Fix previous commit bug. Run BLAS and CBLAS test on Travis.
10 years ago
Zhang Xianyi
44222a7fe0
Refs #738 . Run test on Travis.
10 years ago
Zhang Xianyi
3ac153180c
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
10 years ago
Zhang Xianyi
96b486acee
Merge branch 'jeromerobert-bug736' into develop
10 years ago
Zhang Xianyi
3602a2cd1f
#736 Revert #733 patch to fix bus error on ARM.
10 years ago
Zhang Xianyi
b65de4947a
Merge pull request #739 from sebastien-villemot/develop
Fixes for old outstanding bugs in CBLAS test programs
10 years ago
Sébastien Villemot
04ad946fc8
Fix output descriptors of c_{s,d,c,z}blat3
The NTRA argument can be equal to -1 if one does not want a snapshot file
(and this is the case with sample data {s,d,c,z}in3).
The routines {S,D,C,Z}PRCN3 will try to use their first argument as an output
unit number, so we avoid calling them when NTRA < 0.
Patch originally written by Camm Maguire.
10 years ago
Sébastien Villemot
f704b8d32f
Fix CBLAS double complex level 2 tests
The SNAME variable contains names of C functions like "cblas_dgemv".
Apparently the code was not taking into account the 6-letter "cblas_"
prefix when determining the task to be done.
The issue does not affect c_{s,d,c}blat2.f, which use the correct
offsetting.
Patch originally written by Camm Maguire.
10 years ago
Jerome Robert
708ad330ac
stack alloc: Fix stack smashing detection in 32bits
* Fix commit 87a2ccc
* Close #736
10 years ago