wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
wernsaar
0884b73c69
Lapack-test Windows 32bit now error free
11 years ago
wernsaar
9bd9472ae9
Lapack-test: cleanup of x86 32bit KERNEL file
11 years ago
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
11 years ago
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
11 years ago
wernsaar
9964ed2f79
bugfix for CORE2
11 years ago
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
11 years ago
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
11 years ago
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
11 years ago
wernsaar
e80b144932
enabled compiling of *3M functions
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
11 years ago
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
wernsaar
35d37e124f
bugfix for barcelona zgemv-kernel
11 years ago
wernsaar
d8ba46efdb
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
b94ea89f52
bugfix for haswell cgemm- and zgemm-kernel
11 years ago
wernsaar
35f668bb14
bugfix for cgemm_kernel_8x2_sandy.S
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
365e8de346
added optimized cgemm-kernel for SANDYBRIDGE
11 years ago
wernsaar
578d1b6219
added DSDOT definition and enabled optimized sdot kernel
11 years ago
wernsaar
dabab2b5f4
added new optimized sgemm kernel for SANDYBRIGE
11 years ago
wernsaar
aa2709c4e0
enabled optimized dgemm kernel for NEHALEM
11 years ago
wernsaar
a13bcc1716
enabled optimized sgemv kernel for barcelona and piledriver
11 years ago
wernsaar
d2c82d7543
enabled optimized sgemv kernel for HASWELL
11 years ago
wernsaar
0517672dd0
enabled optimized sgemv kernels for nehalem, sandybridge and bulldozer
11 years ago
wernsaar
23203d52c1
Ref #380 : lowered stack usage for haswell kernels
11 years ago
wernsaar
73545a79cd
Ref #380 : lowered stack usage for piledriver and bulldozer kernels
11 years ago
wernsaar
ff9cfca24c
Ref #385 : added missing return instruction
11 years ago
wernsaar
cee257f384
Ref #51 : added blas extensions zomatcopy and comatcopy
11 years ago
wernsaar
7bfb3011e8
Ref #51 : added blas extension somatcopy
11 years ago
wernsaar
8c8f596238
Ref #51 : added blas extension domatcopy as not opimized reference
11 years ago
wernsaar
faf3ac0aad
Ref #285 : added axpby kernels
11 years ago
Zhang Xianyi
406f5bd22b
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/arm/KERNEL.ARMV6
11 years ago
wernsaar
aaddb05411
bugfix for ARMV6
11 years ago
wernsaar
e826a5a6af
some modifications regarding lapack test
11 years ago
wernsaar
c38379c9dd
bugfixes for ARM regarding lapack tests
11 years ago
wernsaar
a0b07c1440
bugfixs for ARM regarding lapack tests
11 years ago
wernsaar
43fbdb7a5a
added ARMV5 as reference platform
11 years ago
wernsaar
777cebc8c7
added ZERO check to zscal.c because bug in lapack-testing
11 years ago
wernsaar
aa5c73e20f
added ZERO check to zscal.c because bug in lapack-test
11 years ago
wernsaar
5e5ef28ca0
added ZERO check because bug in lapack-test
11 years ago
wernsaar
650ed34336
added ZERO check because bug in lapack-test
11 years ago
wernsaar
5f3b68b4d4
replaced sgemm and cgemm kernels because lapack bugs
11 years ago
wernsaar
2424af62fd
replaced dgemm-kernel because bug in lapack
11 years ago
wernsaar
793509a3b5
replaced files for sdot, sgemv_n and sgemv_t for bug #348
11 years ago
wernsaar
47b22763f8
reduced stack usage on windows to 16K
11 years ago
wernsaar
9db0fb8b02
bugfix for sdsdot
12 years ago