wernsaar
|
53ec5789e2
|
bugfix for Makefile
|
11 years ago |
wernsaar
|
95a707ced3
|
update of KERNEL.BULLDOZER
|
11 years ago |
wernsaar
|
5d97b0754c
|
added optimized sdot kernel for nehalem
|
11 years ago |
wernsaar
|
8a9e868919
|
added optimized sdot for bulldozer
|
11 years ago |
wernsaar
|
7e404de3de
|
bugfix in Makefile
|
11 years ago |
wernsaar
|
e4472ad850
|
added sdot and ddot benchmarks
|
11 years ago |
wernsaar
|
fb0b4552a5
|
added hemv benchmark
|
11 years ago |
wernsaar
|
6f73ffc114
|
added benchmarks for csymv and zsymv
|
11 years ago |
wernsaar
|
c8b0645266
|
added optimized symv_L kernels for nehalem
|
11 years ago |
wernsaar
|
ec05ff3f64
|
added optimized ssymv_L kernel for bulldozer
|
11 years ago |
wernsaar
|
f6f9122660
|
added optimized dsymv_L kernel for bulldozer
|
11 years ago |
wernsaar
|
8247f38dc1
|
added optimized dsymv_U kernel for nehalem
|
11 years ago |
wernsaar
|
ef6374196d
|
updated optimized dsymv_U kernel for bulldozer
|
11 years ago |
wernsaar
|
f824c2b751
|
updated optimized ssymv_U for bulldozer
|
11 years ago |
wernsaar
|
4ba4ab623f
|
added optimized ssymv_U kernel for nehalem
|
11 years ago |
wernsaar
|
4f39447c05
|
added optimized ssymv_U kernel for bulldozer
|
11 years ago |
wernsaar
|
74c9465672
|
added optimized dsymv_U kernel for bulldozer
|
11 years ago |
wernsaar
|
101dd08173
|
add reference in C for symv_U
|
11 years ago |
wernsaar
|
493d4fe7e5
|
added reference in C for symv_L
|
11 years ago |
wernsaar
|
0a22816e70
|
Ref #433: removed obsolete lapack entries from common_interface.h
|
11 years ago |
Zhang Xianyi
|
c3cd6e7e32
|
Merge pull request #434 from wernsaar/develop
A lot of performance enhancements
|
11 years ago |
wernsaar
|
11eab4c019
|
added optimized cgemv_n for haswell
|
11 years ago |
wernsaar
|
4568d32b6b
|
added optimized cgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
c1a6374c6f
|
optimized zgemv_n kernel for sandybridge
|
11 years ago |
wernsaar
|
dc05937313
|
added additional test values
|
11 years ago |
wernsaar
|
2470129132
|
added fast return, if m or n < 1
|
11 years ago |
wernsaar
|
8c582d362d
|
optimized zgemv_t_microk_haswell-2.c
|
11 years ago |
wernsaar
|
11e34ddd1b
|
bugfix for zgemv_n_microk_haswell-2.c
|
11 years ago |
wernsaar
|
9528f0d9ee
|
bugfix in zgemv_n_microk_sandy-2.c
|
11 years ago |
wernsaar
|
b06550519e
|
added optimized cgemv_t c-kernel
|
11 years ago |
wernsaar
|
6093ee5363
|
bugfix in zgemv_n_microk_haswell-2.c
|
11 years ago |
wernsaar
|
07c66b1960
|
modified algorithm for better numerical stability
|
11 years ago |
wernsaar
|
58b075daef
|
added optimized zgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
09fcd3a341
|
add optimized zgemv_t kernel for bulldozer
|
11 years ago |
wernsaar
|
726ad085cb
|
added optimized zgemv_t for haswell
|
11 years ago |
wernsaar
|
6fe416976d
|
added optimimized zgemv_t c-kernel
|
11 years ago |
wernsaar
|
dbc2eff029
|
disabled optimized haswell zgemv_n kernel for windows ( bad rounding )
|
11 years ago |
wernsaar
|
462b4885ff
|
added optimized zgemv_n kernel for haswell
|
11 years ago |
wernsaar
|
aa54fe064c
|
added zgemv_n c-function
|
11 years ago |
wernsaar
|
006ef3ea01
|
added optimized dgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
60f17628cc
|
added optimized dgemv_n kernel for haswell
|
11 years ago |
wernsaar
|
c9bad1403a
|
added optimized sgemv_t kernel for sandybridge
|
11 years ago |
wernsaar
|
2f8927376f
|
enabled optimized nehalem sgemv_t kernel for windows
|
11 years ago |
wernsaar
|
d945a2b06d
|
added optimized sgemv_t kernel for nehalem
|
11 years ago |
wernsaar
|
ca6c8d06ce
|
enabled optimized sgemv kernels for windows
|
11 years ago |
wernsaar
|
7aa43c8928
|
enabled optimized sgemv kernels for windows
|
11 years ago |
wernsaar
|
891b960854
|
added optimized sgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
95a8caa2f3
|
added optimized sgemv_t kernel
|
11 years ago |
Zhang Xianyi
|
5c0d0ecbde
|
Merge pull request #430 from wernsaar/develop
added a better optimized sgemv_n kernel
|
11 years ago |
wernsaar
|
8c05b8105b
|
bugfix in sgemv_n.c
|
11 years ago |