wernsaar
6acbafe45b
added sgemv_n microkernel for haswell
11 years ago
wernsaar
5392d11b04
optimized sgemv_n_microk_sandy.c
11 years ago
wernsaar
c0fe95fb72
added sgemv_n microkernel for sandybridge
11 years ago
wernsaar
d9d4077c93
added sgemv_t microkernel for haswell
11 years ago
wernsaar
02eb72ac42
bugfix in sgemv_t_microk_sandy.c
11 years ago
wernsaar
c06f9986d4
added sgemv_t microkernel for sandybridge
11 years ago
wernsaar
2cce125c79
added optimized sgemv_t for bulldozer and piledriver
11 years ago
wernsaar
b3938fe371
don't use this sgemv_n on Windows
11 years ago
wernsaar
c8a4a56177
performance optimizations for sgemv_n
11 years ago
wernsaar
3c5732615d
added blocked sgemv_n and microkernel for bulldozer and piledriver
11 years ago
wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
wernsaar
0884b73c69
Lapack-test Windows 32bit now error free
11 years ago
wernsaar
9bd9472ae9
Lapack-test: cleanup of x86 32bit KERNEL file
11 years ago
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
11 years ago
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
11 years ago
wernsaar
9964ed2f79
bugfix for CORE2
11 years ago
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
11 years ago
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
11 years ago
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
11 years ago
wernsaar
e80b144932
enabled compiling of *3M functions
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
11 years ago
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
wernsaar
35d37e124f
bugfix for barcelona zgemv-kernel
11 years ago
wernsaar
d8ba46efdb
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
b94ea89f52
bugfix for haswell cgemm- and zgemm-kernel
11 years ago
wernsaar
35f668bb14
bugfix for cgemm_kernel_8x2_sandy.S
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
365e8de346
added optimized cgemm-kernel for SANDYBRIDGE
11 years ago
wernsaar
578d1b6219
added DSDOT definition and enabled optimized sdot kernel
11 years ago
wernsaar
dabab2b5f4
added new optimized sgemm kernel for SANDYBRIGE
11 years ago
wernsaar
aa2709c4e0
enabled optimized dgemm kernel for NEHALEM
11 years ago
wernsaar
a13bcc1716
enabled optimized sgemv kernel for barcelona and piledriver
11 years ago
wernsaar
d2c82d7543
enabled optimized sgemv kernel for HASWELL
11 years ago
wernsaar
0517672dd0
enabled optimized sgemv kernels for nehalem, sandybridge and bulldozer
11 years ago
wernsaar
23203d52c1
Ref #380 : lowered stack usage for haswell kernels
11 years ago
wernsaar
73545a79cd
Ref #380 : lowered stack usage for piledriver and bulldozer kernels
11 years ago
wernsaar
ff9cfca24c
Ref #385 : added missing return instruction
11 years ago
wernsaar
cee257f384
Ref #51 : added blas extensions zomatcopy and comatcopy
11 years ago
wernsaar
7bfb3011e8
Ref #51 : added blas extension somatcopy
11 years ago
wernsaar
8c8f596238
Ref #51 : added blas extension domatcopy as not opimized reference
11 years ago
wernsaar
faf3ac0aad
Ref #285 : added axpby kernels
11 years ago
Zhang Xianyi
406f5bd22b
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/arm/KERNEL.ARMV6
11 years ago
wernsaar
aaddb05411
bugfix for ARMV6
11 years ago
wernsaar
e826a5a6af
some modifications regarding lapack test
11 years ago
wernsaar
c38379c9dd
bugfixes for ARM regarding lapack tests
11 years ago
wernsaar
a0b07c1440
bugfixs for ARM regarding lapack tests
11 years ago