Werner Saar
c99cc41cbd
Added optimized zgemv_n kernel for bulldozer, piledriver and steamroller
10 years ago
Werner Saar
c8f2c5d636
added optimized trsm_kernels
10 years ago
Werner Saar
95b1faf667
added optimized cscal and zscal kernels for steamroller and piledriver
10 years ago
Werner Saar
e00cccc41e
added optimized dscal kernel for piledriver
10 years ago
Werner Saar
34ba66606a
add optimized daxpy-kernel for piledriver
10 years ago
Werner Saar
331c417637
optimized saxpy for piledriver
10 years ago
Werner Saar
9299d8cfd6
added optimized cdot- and zdot-kernels for bulldozer
10 years ago
Werner Saar
60c6dec6e6
updated some lines for bulldozer
10 years ago
wernsaar
9908b6031c
bugfix in KERNEL.PILEDRIVER
11 years ago
wernsaar
8a39cdb1c1
added optimized zgemv_t kernel for haswell
11 years ago
wernsaar
80f7786875
enabled optimized sgemv kernels for piledriver
11 years ago
wernsaar
ca6c8d06ce
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
95a8caa2f3
added optimized sgemv_t kernel
11 years ago
wernsaar
2bab92961f
enabled optimized sgemv_n kernels for windows
11 years ago
wernsaar
db6917303f
added a better optimized sgemv_n kernel for bulldozer and piledriver
11 years ago
wernsaar
2cce125c79
added optimized sgemv_t for bulldozer and piledriver
11 years ago
wernsaar
b3938fe371
don't use this sgemv_n on Windows
11 years ago
wernsaar
3c5732615d
added blocked sgemv_n and microkernel for bulldozer and piledriver
11 years ago
wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
a13bcc1716
enabled optimized sgemv kernel for barcelona and piledriver
11 years ago
wernsaar
5118a7f4d1
small optimizations on dgemm_kernel for Piledriver
12 years ago
wernsaar
e172b70ea2
added cgemm_kernel for Piledriver
12 years ago
wernsaar
1cf4b974b2
added zgemm_kernel for Piledriver
12 years ago
wernsaar
7bccff1512
added sgemm_kernel for PILEDRIVER
12 years ago
wernsaar
2840d56aeb
added dgemm_kernel for Piledriver
12 years ago
Zhang Xianyi
6c4a7d0828
Import AMD Piledriver DGEMM kernel generated by AUGEM.
So far, this kernel doesn't deal with edge.
AUGEM: Automatically Generate High Performance Dense Linear Algebra
Kernels on x86 CPUs.
Qian Wang, Xianyi Zhang, Yunquan Zhang, and Qing Yi. In the
International Conference for High Performance Computing, Networking,
Storage and Analysis (SC'13). Denver, CO. Nov, 2013.
12 years ago
Zhang Xianyi
886cbaf4e4
Support AMD Piledriver by bulldozer kernels.
12 years ago