Martin Kroeker
|
c7b55b6082
|
Merge pull request #1499 from quickwritereader/develop
Implemented missing vsx simd kernels for power8 blas1/2 double. z13 modifications
|
7 years ago |
QWR QWR
|
28ca97015d
|
power8:Added initial zgemv_(t|n) ,i(d|z)amax,i(d|z)amin,dgemv_t(transposed),zrot
z13: improved zgemv_(t|n)_4,zscal,zaxpy
|
8 years ago |
Martin Kroeker
|
22167170b3
|
Merge pull request #1477 from quickwritereader/develop
Power8 blas3 copy-pack routines
|
8 years ago |
Martin Kroeker
|
58f236ad73
|
Use generic/dot.c for DSDOT on zarch
|
8 years ago |
Martin Kroeker
|
e207107150
|
Use generic/dot.c for DSDOT on z13
The implementation in arm/dot.c has lower precision, as shown by the utest for dsdot.
|
8 years ago |
the mslm
|
c5425daa6b
|
power8 ?gemm_tcopy save/restore
|
8 years ago |
Abdelrauf
|
60596a1abc
|
Merge branch 'develop' into develop
|
8 years ago |
Abdelrauf
|
afd514c25d
|
small fix inside ifdef z13mvc . (z13mvc code is not used in production)
|
8 years ago |
Martin Kroeker
|
f45776ec1f
|
Merge pull request #1440 from quickwritereader/develop
small corrections
|
8 years ago |
Abdelrauf
|
f653e7a18d
|
small fix
small fix inside ifdef z13mvc . (z13mvc code is not used in production)
|
8 years ago |
the mslm
|
f946a89432
|
zscal (case: real alpha=0 ) mikrokernel shift&mem fix , da_i as input reg. small typo fixes
|
8 years ago |
Martin Kroeker
|
e4c71a799a
|
Merge pull request #1426 from quickwritereader/develop
(Z13 ) Blas1 mikrokernels can be inlined by gcc. Refactoring,fixes,tunings
|
8 years ago |
the mslm
|
2619ad7ea5
|
Blas1 mikrokernels can be inlined by gcc. Refactoring ( symbolic operan
names). Some fixes and tunings
|
8 years ago |
Martin Kroeker
|
3d23f45107
|
Merge pull request #1415 from quickwritereader/develop
(Z systems Z13) small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels…
|
8 years ago |
Abdelrauf
|
87669d1c0a
|
small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels can be inlined
|
8 years ago |
Andrew
|
7e9b29b9b8
|
fix spurious compiler warning (no code change)
|
8 years ago |
Abdurrauf
|
1cfdb2295d
|
Optimized standard Blas Level-1,2 (excluding nrm2 functions) for z13 (double precision)
|
8 years ago |
Abdurrauf
|
08786c4b95
|
strmm and ctrmm
|
9 years ago |
Abdurrauf
|
82e80fa82b
|
initial strmm(sgemm). not tuned yet
|
9 years ago |
Abdurrauf
|
e831d6924e
|
changed to conventional register save area
|
9 years ago |
Abdurrauf
|
848cb27b1e
|
ztrmm kernel.
|
9 years ago |
Abdurrauf
|
6418667818
|
dtrmm and dgemm for z13
|
9 years ago |
Zhang Xianyi
|
dd43661cfd
|
Init IBM z system (s390x) porting.
|
9 years ago |