Werner Saar
91e1c5080c
modified configuration, to use power6 sgemm kernel for power8
10 years ago
Werner Saar
73f04c2c72
enabled hemv assemly function for power8
10 years ago
Werner Saar
3e633152c6
enabled symv assembly kernels on power8
10 years ago
Werner Saar
d5130ce7e3
enabled gemv assembly on power8
10 years ago
Werner Saar
4824b88fcb
enabled all level1 assembly kernels for power8
10 years ago
Werner Saar
cc26d888b8
BUGFIX: increased BUFFER_SIZE for POWER8
10 years ago
Werner Saar
b752858d6c
added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8
10 years ago
Zhang Xianyi
3e8d6ea74f
Init POWER8 kernels by POWER6.
10 years ago
Zhang Xianyi
be4e5fcd20
Fixed #778 . Merge branch 'buffer51-develop' into develop
10 years ago
buffer51
855e0cb700
Restored LAPACK_COMPLEX_STRUCTURE for Android prior to 21. Refs #682 .
10 years ago
buffer51
7f7d04dcd2
Fixed linking error when compiling ARMv7 for Android (disabled -lpthread and added -Wl,--no-warn-mismatch).
10 years ago
buffer51
4e1b521e27
Fix lapack complex implementation of lauu2 and potf2 for Android (use FLOAT instead of FLOAT[2] as imaginary part is not used).
10 years ago
Zhang Xianyi
a1a96589aa
Fixed #773 blas_quickdivide bug on CMake and Visual Studio x86 32-bit.
10 years ago
Zhang Xianyi
0e68beb89f
Fixed #711 , #698 . Merge branch 'byzhang-develop' into develop
10 years ago
Zhang Xianyi
926ba8b7ca
Merge branch 'develop' of https://github.com/byzhang/OpenBLAS into byzhang-develop
10 years ago
Zhang Xianyi
9f080c47e1
Merge pull request #743 from tkelman/patch-1
re enable Fortran optimization flag on windows
10 years ago
Zhang Xianyi
52eba814ce
Fixed #769 . Merge branch 'martin-frbg-develop' into develop
10 years ago
Martin Kroeker
935356c34f
Update dynamic.c and cpuid_x86.c for Intel Avoton.
Second part of "support Intel Avoton via Nehalem kernel"
10 years ago
Zhang Xianyi
ff9388d625
Refs #768 . Swap the result of zdot x87 fp kernel.
10 years ago
Martin Kroeker
4f05c23673
Update cpuid_x86.c
Add recognition of Intel Atom C27xx (Avoton, model code 4D)
10 years ago
Benyu Zhang
4a1263f609
Fix the source paths
10 years ago
Zhang Xianyi
962376664d
Refs #768 . Swap the result of zdot x87 fp kernel.
10 years ago
Tony Kelman
5fef0d1b75
re enable Fortran optimization flag on windows
partial revert of 299cdcdc29
from #696 , was not explained why that was needed
10 years ago
Zhang Xianyi
578f471808
Fix utest bug when INTERFACE64=1.
10 years ago
Zhang Xianyi
5a8447e97e
Use ctest.h for unit test. Enable unit test on travis CI.
10 years ago
Zhang Xianyi
be95bdaf47
Detect ARMV8 on 32-bit mode by using ARMV7 kernels.
10 years ago
Zhang Xianyi
c44ff4d648
Refs #714 . avoid compiling warnings.
10 years ago
Zhang Xianyi
e003a1294c
Merge pull request #764 from martin-frbg/develop
Update Makefile.system to fix awk/nawk issue #763
10 years ago
Martin Kroeker
44062517eb
Update Makefile.system
Define AWK as "nawk" for SunOS (actually Illumos) only - fixes #763
10 years ago
Zhang Xianyi
13f0f8c10e
Refs #723 . Avoid out of boundary for getf2.
10 years ago
Zhang Xianyi
f5df444ceb
Merge pull request #762 from jeromerobert/bug760
Let openblas_get_num_threads return the number of active threads
10 years ago
Zhang Xianyi
e382713423
Merge pull request #759 from jeromerobert/bug742
Bug742
10 years ago
Zhang Xianyi
aaa8551c57
Merge pull request #749 from lotheac/illumos_fixes
illumos fixes
10 years ago
Jerome Robert
0d87c1ffb6
Let openblas_get_num_threads return the number of active threads
... not the number of allocated threads.
Close #760
10 years ago
wernsaar
0b194426f8
Merge pull request #761 from wernsaar/develop
Ref #740 : all assembly codes now clear floating point register correctly
10 years ago
Werner Saar
63a7d7fb24
updated gemv_n_vfpv3.S for armv7
10 years ago
Werner Saar
b4ede558a5
updated nrm2 kernel for armv7
10 years ago
Werner Saar
de3e2d4349
updated trmm kernels for armv7
10 years ago
Werner Saar
a0e51e96f1
updated gemm kernels for armv7
10 years ago
Lauri Tirkkonen
d6afac9624
don't pass -Y at all to the linker on illumos
the illumos linker can't understand the "-Y/lib"... form that f_check
generates, and -Wl cannot pass options that include commas
10 years ago
Werner Saar
c2891330bc
updated KERNEL.ARMV6
10 years ago
Werner Saar
ceaa931e48
updated gemv kernel for armv6
10 years ago
Werner Saar
eaa63165df
updated cgemv and zgemv kernels for armv6
10 years ago
Werner Saar
c65357c566
updated trmm_kernels for armv6
10 years ago
Werner Saar
e63e9f9f26
updated gemm_kernels for armv6
10 years ago
Jerome Robert
1fe3aab047
Use GEMM_MULTITHREAD_THRESHOLD as a number of ops
...not a matrix size. For GEMM_MULTITHREAD_THRESHOLD=4
(the default value) this does not change anything but
for other values it make the GEMM and GEMV thresholds
changing in the same way.
Close #742
10 years ago
Werner Saar
aafd3ab60e
updated cdot and zdot on arm
10 years ago
Jerome Robert
1a1935507b
[z]ger: increase multithread threshold
The ones given in 3ae30cd was by far to low because I
mixed m and m*n in my measures. Note that the new ones
are closed to the [z]gemv ones which is comforting
that both are right.
10 years ago
Werner Saar
d2f84c9c8a
Ref #740 : updated nrm2_vfp.S
10 years ago
Werner Saar
ca32253f32
Ref #740 : updated asum_vfp.S and iamax_vfp.S
10 years ago