wernsaar
ca6c8d06ce
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
7aa43c8928
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
891b960854
added optimized sgemv_t kernel for haswell
11 years ago
wernsaar
95a8caa2f3
added optimized sgemv_t kernel
11 years ago
wernsaar
8c05b8105b
bugfix in sgemv_n.c
11 years ago
wernsaar
c80084a98f
changed default x86_64 sgemv_n kernel to sgemv_n.c
11 years ago
wernsaar
2bab92961f
enabled optimized sgemv_n kernels for windows
11 years ago
wernsaar
9175b8bd5f
changed long to blaslong for windows compatibility
11 years ago
wernsaar
793f2d43b0
added optimized sgemv_n kernel for nehalem
11 years ago
wernsaar
a4dde45f87
optimized sgemv_n kernel for sandybridge
11 years ago
wernsaar
7fa7ea3e1e
updated haswell optimized sgmv_n kernel
11 years ago
wernsaar
3fbc13eb65
modified sgemv_n for haswell
11 years ago
wernsaar
db6917303f
added a better optimized sgemv_n kernel for bulldozer and piledriver
11 years ago
wernsaar
5087096711
optimization of sandybridge cgemm-kernel
11 years ago
wernsaar
46bc4fd50c
optimized cgemm kernel for haswell
11 years ago
wernsaar
1cc02b4337
optimized sgemm kernel for haswell
11 years ago
wernsaar
1d33547222
optimized zgemm kernel for haswell
11 years ago
wernsaar
125610d23b
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
11 years ago
wernsaar
6acbafe45b
added sgemv_n microkernel for haswell
11 years ago
wernsaar
5392d11b04
optimized sgemv_n_microk_sandy.c
11 years ago
wernsaar
c0fe95fb72
added sgemv_n microkernel for sandybridge
11 years ago
wernsaar
d9d4077c93
added sgemv_t microkernel for haswell
11 years ago
wernsaar
02eb72ac42
bugfix in sgemv_t_microk_sandy.c
11 years ago
wernsaar
c06f9986d4
added sgemv_t microkernel for sandybridge
11 years ago
wernsaar
2cce125c79
added optimized sgemv_t for bulldozer and piledriver
11 years ago
wernsaar
b3938fe371
don't use this sgemv_n on Windows
11 years ago
wernsaar
c8a4a56177
performance optimizations for sgemv_n
11 years ago
wernsaar
3c5732615d
added blocked sgemv_n and microkernel for bulldozer and piledriver
11 years ago
wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
wernsaar
0884b73c69
Lapack-test Windows 32bit now error free
11 years ago
wernsaar
9bd9472ae9
Lapack-test: cleanup of x86 32bit KERNEL file
11 years ago
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
11 years ago
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
11 years ago
wernsaar
9964ed2f79
bugfix for CORE2
11 years ago
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
11 years ago
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
11 years ago
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
11 years ago
wernsaar
e80b144932
enabled compiling of *3M functions
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
11 years ago
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
wernsaar
35d37e124f
bugfix for barcelona zgemv-kernel
11 years ago
wernsaar
d8ba46efdb
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
b94ea89f52
bugfix for haswell cgemm- and zgemm-kernel
11 years ago
wernsaar
35f668bb14
bugfix for cgemm_kernel_8x2_sandy.S
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
365e8de346
added optimized cgemm-kernel for SANDYBRIDGE
11 years ago