Martin Kroeker
|
a6e9e0b94b
|
Remove explicit include of complex.h
|
9 years ago |
Zhang Xianyi
|
515bc56ea9
|
Refs #946. Use nrm2 reference implementation for Power8.
|
9 years ago |
Zhang Xianyi
|
ae70b916f4
|
Refs #929. Deal with zero and NaNs for scale.
|
9 years ago |
Werner Saar
|
412bcd187a
|
optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S
|
9 years ago |
Werner Saar
|
8b140220c8
|
optimized dtrsm_kernel_LT for POWER8
|
9 years ago |
Werner Saar
|
8fb5a1aaff
|
added optimized dtrsm_LT kernel for POWER8
|
9 years ago |
Werner Saar
|
6a2bde7a2d
|
optimized dgemm and dgetrf for POWER8
|
9 years ago |
Werner Saar
|
8310d4d3f7
|
optimized dgemm for 20 threads
|
9 years ago |
Werner Saar
|
56948dbf0f
|
optimized dgemm for POWER8
|
9 years ago |
Werner Saar
|
0d0c6f7d7d
|
optimized dgemm for POWER8
|
9 years ago |
Werner Saar
|
a3da10662f
|
added sgemm_tcopy_8_power8.S
|
9 years ago |
Werner Saar
|
d46f07bb4e
|
added cgemm_tcopy_8_power8.S
|
9 years ago |
Werner Saar
|
879a51165f
|
Optimized zgemm and tested zgemm again
|
9 years ago |
Werner Saar
|
9276c9012f
|
Optimized sgemm and dgemm and tested again.
|
9 years ago |
Werner Saar
|
0001260f4b
|
optimized sgemm
|
9 years ago |
Werner Saar
|
3c6294ca3d
|
added optimized sgemm_tcopy for power8
|
9 years ago |
Werner Saar
|
e173c51c04
|
updated zgemm- and ztrmm-kernel for POWER8
|
9 years ago |
Werner Saar
|
9c42f0374a
|
Updated cgemm- and sgemm-kernel for POWER8 SMP
|
9 years ago |
Werner Saar
|
a51102e9b7
|
bugfixes for sgemm- and cgemm-kernel
|
9 years ago |
Werner Saar
|
c5b1fbcb2e
|
updated optimized cgemm- and ctrmm-kernel for POWER8
|
9 years ago |
Werner Saar
|
d4c0330967
|
updated cgemm- and ctrmm-kernel for POWER8
|
9 years ago |
Werner Saar
|
6a9bbfc227
|
updated sgemm- and strmm-kernel for POWER8
|
9 years ago |
Werner Saar
|
68a69c5b50
|
added optimized dgemv_n kernel for POWER8
|
9 years ago |
Werner Saar
|
c2464a7c4a
|
added optimized casum kernel for POWER8
|
9 years ago |
Werner Saar
|
294f933869
|
added optimized zasum kernel for POWER8
|
9 years ago |
Werner Saar
|
f59c9bd6ef
|
added optimized sasum kernel for POWER8
|
9 years ago |
Werner Saar
|
c53be46d78
|
added optimized dasum kernel for POWER8
|
9 years ago |
Werner Saar
|
659ed16591
|
added otimized cswap and zswap kernels for POWER8
|
9 years ago |
Werner Saar
|
35c98a3556
|
added optimized zscal kernel for POWER8
|
9 years ago |
Werner Saar
|
f1a5dd06c5
|
added optimized sscal kernel for POWER8
|
9 years ago |
Werner Saar
|
35f1f21a7f
|
added drot- and srot-kernel optimimized for POWER8
|
9 years ago |
Werner Saar
|
3d9a50e841
|
added optimized sswap kernel for POWER8
|
9 years ago |
Werner Saar
|
828c849b44
|
added optimized ccopy kernel for POWER8
|
9 years ago |
Werner Saar
|
ecc0bc9813
|
added optimized scopy kernel for POWER8
|
9 years ago |
Werner Saar
|
12f209b7b0
|
added optimized zswap kernel for POWER8
|
9 years ago |
Werner Saar
|
7316a87930
|
added optimized dswap kernel for POWER8
|
9 years ago |
Werner Saar
|
0bff057a87
|
added optimized dcopy kernel for POWER8
|
9 years ago |
Werner Saar
|
1e6cf9808c
|
added optimized dscal kernel for POWER8
|
9 years ago |
Werner Saar
|
55eda3813b
|
added optimized zaxpy kernel for POWER8
|
10 years ago |
Werner Saar
|
0664ba4c97
|
added optimized daxpy kernel for POWER8
|
10 years ago |
Werner Saar
|
11c44dede1
|
added optimized sdot kernel for POWER8
|
10 years ago |
Werner Saar
|
9e4584d069
|
added optimized zdot kernel for POWER8
|
10 years ago |
Werner Saar
|
cd9fafc054
|
ddot for POWER8: updated licence information
|
10 years ago |
Werner Saar
|
84b92e6373
|
added optimized ddot kernel for POWER8
|
10 years ago |
Werner Saar
|
e1df5a6e23
|
fixed sgemm- and strmm-kernel
|
10 years ago |
Werner Saar
|
5c658f8746
|
add optimized cgemm- and ctrmm-kernel for POWER8
|
10 years ago |
Werner Saar
|
dcd15b546c
|
BUGFIX: KERNEL.POWER8
|
10 years ago |
Werner Saar
|
96284ab295
|
added sgemm- and strmm-kernel for POWER8
|
10 years ago |
Werner Saar
|
cd5241d0cf
|
modified KERNEL for power, to use the generic DSDOT-KERNEL
|
10 years ago |
Werner Saar
|
085f215257
|
Modified assembly label name, so that they are hidden.
Added license informations.
|
10 years ago |