Zhang Xianyi
ef75be0e51
Merge pull request #487 from kortschak/dromtg-test
Add test for drotmg bug fixed by 692b14c
11 years ago
kortschak
5344f335a8
Add test for drotmg bug fixed by 692b14c
Test requested in issue xianyi/OpenBLAS#484 .
Run tests by applying the following change and then make:
diff --git a/Makefile.rule b/Makefile.rule
index bea1fe1..9852ff3 100644
--- a/Makefile.rule
+++ b/Makefile.rule
@@ -140,7 +140,7 @@ NO_AFFINITY = 1
-# UTEST_CHECK = 1
+UTEST_CHECK = 1
11 years ago
Hank Anderson
0f6bec0a32
cmake.prebuild now compiles getarch.
Doesn't actually run it yet.
11 years ago
Hank Anderson
92cdac5f87
Added MSVC functions to cpuid_x86.c to replace gcc-specific ASM.
11 years ago
Hank Anderson
1a41022e3e
Added MSVC defines to cpuid.h and getarch.c.
11 years ago
Zhang Xianyi
5cb5af9333
Add configuration options.
11 years ago
Zhang Xianyi
41aad0407f
Merge pull request #482 from jeromerobert/develop
Allow to do gemv and ger buffer allocation on the stack
11 years ago
Hank Anderson
e5c47e44f6
First pass at converting a few makefiles to CMake.
11 years ago
Zhang Xianyi
f8f2e84659
Merge pull request #486 from wernsaar/develop
Optimizations for steamroller
11 years ago
Werner Saar
34633fef01
Merge branch 'develop' of github.com:wernsaar/OpenBLAS into develop
11 years ago
Werner Saar
ddf983d643
added optimizations for steamroller
11 years ago
Zhang Xianyi
17b9db20f1
Merge pull request #483 from wernsaar/develop
added Steamroller as a cpu target
11 years ago
Werner Saar
0dc559ed30
bugfix in dynamic.c
11 years ago
Werner Saar
9566f5fdb0
added Steamroller as a target processor
11 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
11 years ago
Jerome Robert
e9d9a8eae3
Allow to do gemv and ger buffer allocation on the stack
ger and gemv call blas_memory_alloc/free which in their turn
call blas_lock. blas_lock create thread contention when matrices
are small and the number of thread is high enough. We avoid
call blas_memory_alloc by replacing it with stack allocation.
This can be enabled with:
make -DMAX_STACK_ALLOC=2048
The given size (in byte) must be high enough to avoid thread contention
and small enough to avoid stack overflow.
Fix #478
11 years ago
Zhang Xianyi
cbb3ab80e7
Merge pull request #481 from eschnett/develop
Correct ilaver C declaration
11 years ago
Erik Schnetter
cd9868b1b4
Correct ilaver C declaration
11 years ago
Zhang Xianyi
eb738148fe
Merge pull request #479 from wernsaar/develop
workaround for sandybridge zgemm kernel
11 years ago
Werner Saar
587e16fba3
Ref #458 : Backport, sandybrigde uses nehalem zgemm kernel
11 years ago
Werner Saar
4de7b9ae47
increased NMAX to 128
11 years ago
Werner Saar
887aed634d
modified sources for OS Darwin
11 years ago
Werner Saar
6261342de3
small optimization on dgemm_kernel for N=1
11 years ago
Werner Saar
1e566223ed
added code for the size of n
11 years ago
Werner Saar
113b48ca22
modified makefile for acml6.1
11 years ago
Zhang Xianyi
3e81c99b6b
Fixed installation bug on Mac OSX.
11 years ago
Werner Saar
ec85c4a51d
Increased the Threshold value in sep.in
11 years ago
Werner Saar
97de657d38
added tests to sep.as as workaround for gfortran-4.8.x
11 years ago
Zhang Xianyi
71966eba6c
Merge pull request #475 from xantares/patch-2
add OpenBLAS_VERSION to cmake config file
11 years ago
Zhang Xianyi
a359979e17
Merge pull request #474 from xantares/patch-1
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
11 years ago
xantares
7a6a141bc4
add OpenBLAS_VERSION to cmake config file
11 years ago
xantares
b8ff6892f6
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
usually these files are more often located in this subdir
11 years ago
Zhang Xianyi
8fe7a9ce6f
Merge pull request #473 from wernsaar/develop
changed inline assembler labels to short form
11 years ago
Werner Saar
bc5fff7085
changed inline assembler labels to short form
11 years ago
Zhang Xianyi
51ce5ef447
Merge branch 'develop'
11 years ago
Zhang Xianyi
1943ea91a8
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
11 years ago
Zhang Xianyi
37aee1f9b1
Merge branch 'develop'
11 years ago
Zhang Xianyi
f5424fc9de
Update the doc for 0.2.13 version.
11 years ago
Zhang Xianyi
0cf29ba6d2
Fixed a bug of sgemm sandy bridge kernel.
Reported by Julia project. JuliaLang/julia#9084
11 years ago
Zhang Xianyi
50e18033e6
Merge pull request #471 from nolta/patch-4
c_check: set $hostarch to x86_64 instead of amd64
11 years ago
Zhang Xianyi
551b55d1c7
Merge pull request #470 from nolta/patch-3
fix fortran compiler detection on FreeBSD
11 years ago
Mike Nolta
271ceb8bae
c_check: set $hostarch to x86_64 instead of amd64
`uname -m` returns "amd64" on some systems.
11 years ago
Mike Nolta
5f846be2e4
fix fortran compiler detection on FreeBSD
On FreeBSD, passing extra options to `which` causes it to report a non-zero status:
```
$ which gfortran48 -m64
/usr/local/bin/gfortran48
$ echo $?
1
```
```
$ which gfortran48
/usr/local/bin/gfortran48
$ echo $?
0
```
11 years ago
Zhang Xianyi
fe7dcf98f3
Refs #461 . Provide OpenBLASConfig.cmake to support CMake.
If you "make PREFIX=/path/to/OpenBLAS install" ,
The config file will be located in /path/to/OpenBLAS/cmake
Then, you can use "find_package(OpenBLAS)" at CMake.
cmake -DOpenBLAS_DIR=/path/to/OpenBLAS/cmake ..
11 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Zhang Xianyi
a85c2785ae
Refs #467 . Added generic kernel file for x86_64.
11 years ago
Zhang Xianyi
4806715c97
Fixed #456 . Merged the optimizations for APM's
xgene-1 (aarch64).
Merge branch 'benedikt-huber-dave-patch' into develop
11 years ago
Benedikt Huber
58c90d5937
# The first commit's message is:
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
11 years ago
Zhang Xianyi
2987bc7b40
refs #464 . Fixed the bug of detecting L2 associative on x86.
11 years ago
Zhang Xianyi
695e0fa649
#463 fixed a compiling bug on AIX.
11 years ago