Zhang Xianyi
fa3018c30e
Merge pull request #745 from jakirkham/minor_fix_scipy_prof
BENCH: Minor fixes in SciPy benchmarks
10 years ago
Zhang Xianyi
6caa40302e
Merge pull request #744 from jeromerobert/bug731
Bug731
10 years ago
John Kirkham
a48b247e9e
benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type.
10 years ago
John Kirkham
b1b115ecd6
benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type.
10 years ago
John Kirkham
07bba933ff
benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order.
10 years ago
John Kirkham
e85f8af519
benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order.
10 years ago
John Kirkham
adfa0ab878
benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues.
10 years ago
John Kirkham
cbb6649e97
benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues.
10 years ago
John Kirkham
77abc9b280
benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`.
10 years ago
John Kirkham
81e8690763
benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`.
10 years ago
John Kirkham
dd04a8ac22
benchmark/scripts/SCIPY/ssyrk.py: Use the environment python.
10 years ago
John Kirkham
cb554b3a9c
benchmark/scripts/SCIPY/dsyrk.py: Use the environment python.
10 years ago
John Kirkham
1153459d1b
benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons.
10 years ago
John Kirkham
1a73390ffe
benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons.
10 years ago
John Kirkham
8b981e41a1
benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it.
10 years ago
John Kirkham
c10b1f555d
benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it.
10 years ago
Jerome Robert
14db1ca508
update CONTRIBUTORS.md
10 years ago
Jerome Robert
66eafb16cf
swap: disable multi-threading for small matrices
Close #731
10 years ago
Jerome Robert
3ae30cd6b9
Disable multi-threading for small matrices in [z]ger
Ref #731
10 years ago
Werner Saar
692d9c881c
Ref #740 : simple solution to clear floating point register on arm
10 years ago
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
10 years ago
Zhang Xianyi
ce2b1edd4e
[av skip] Change test cmd on Travis.
10 years ago
Zhang Xianyi
8cf3657fb6
Refs #738 . Fix previous commit bug. Run BLAS and CBLAS test on Travis.
10 years ago
Zhang Xianyi
44222a7fe0
Refs #738 . Run test on Travis.
10 years ago
Zhang Xianyi
3ac153180c
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
10 years ago
Zhang Xianyi
96b486acee
Merge branch 'jeromerobert-bug736' into develop
10 years ago
Zhang Xianyi
3602a2cd1f
#736 Revert #733 patch to fix bus error on ARM.
10 years ago
Zhang Xianyi
b65de4947a
Merge pull request #739 from sebastien-villemot/develop
Fixes for old outstanding bugs in CBLAS test programs
10 years ago
Sébastien Villemot
04ad946fc8
Fix output descriptors of c_{s,d,c,z}blat3
The NTRA argument can be equal to -1 if one does not want a snapshot file
(and this is the case with sample data {s,d,c,z}in3).
The routines {S,D,C,Z}PRCN3 will try to use their first argument as an output
unit number, so we avoid calling them when NTRA < 0.
Patch originally written by Camm Maguire.
10 years ago
Sébastien Villemot
f704b8d32f
Fix CBLAS double complex level 2 tests
The SNAME variable contains names of C functions like "cblas_dgemv".
Apparently the code was not taking into account the 6-letter "cblas_"
prefix when determining the task to be done.
The issue does not affect c_{s,d,c}blat2.f, which use the correct
offsetting.
Patch originally written by Camm Maguire.
10 years ago
Jerome Robert
708ad330ac
stack alloc: Fix stack smashing detection in 32bits
* Fix commit 87a2ccc
* Close #736
10 years ago
Werner Saar
c6a27bbe64
added benchmark tests for ssyrk and dsyrk
10 years ago
Zhang Xianyi
f16b4f10b6
Merge pull request #734 from jeromerobert/common_stackalloc
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
10 years ago
Jerome Robert
87a2ccc37c
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
Ref #727
10 years ago
Zhang Xianyi
e3e20e2242
Merge pull request #733 from yuyichao/arm-asm
Do not use vsub to clear the register values
10 years ago
Yichao Yu
594b9f4c73
Do not use vsub to clear the register values since it doesn't work with non-normal numbers.
10 years ago
wernsaar
c96c6a26fd
Merge pull request #732 from wernsaar/develop
added optimized trsm_kernels
10 years ago
Werner Saar
c8f2c5d636
added optimized trsm_kernels
10 years ago
Werner Saar
5f2fa15e04
include sched.h if OS is Android
10 years ago
Zhang Xianyi
7d144aaabc
Merge pull request #728 from jeromerobert/fix-no-stack-alloc
Fix make MAX_STACK_ALLOC=0
10 years ago
Jerome Robert
f9890a6452
Fix compilation when MAX_STACK_ALLOC is not set
Close #722
10 years ago
Jerome Robert
2c7143459f
Let make MAX_STACK_ALLOC=0 do what expected
It's no longer required to modify Makefile.rule to disable
stack allocation. It's now possible to run:
make MAX_STACK_ALLOC=0
10 years ago
Zhang Xianyi
3857581adf
Merge pull request #726 from jeromerobert/amd-e2-3200
Fix detection of AMD E2-3200
10 years ago
Zhang Xianyi
e9754e6250
Merge pull request #725 from jeromerobert/make-nb-jobs
Allow to force the number of parallel make job
10 years ago
Jerome Robert
76398c3233
Fix detection of AMD E2-3200
10 years ago
Jerome Robert
ba024fcfc0
Allow to force the number of parallel make job
This is particularly useful when using distcc
10 years ago
Zhang Xianyi
b9b52c295d
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
10 years ago
Zhang Xianyi
285d042b10
Fixed rotg bug on ARM.
10 years ago
Zhang Xianyi
01db7908b8
Merge pull request #713 from btracey/patch-2
Fix Dormbr to perform the correct size operations with RowMajor
10 years ago
Zhang Xianyi
5f75df40d5
Merge pull request #711 from btracey/patch-1
Fix Dormlq to perform the correct size operations with RowMajor
10 years ago