Werner Saar
|
133c11a156
|
updated dgemv_n kernel for nehalem
|
10 years ago |
Werner Saar
|
30f52d53df
|
optimized dgemv_n kernel for haswell
|
10 years ago |
Werner Saar
|
5e83d80725
|
optimized dger kernel for sandybridge
|
10 years ago |
Werner Saar
|
b2e1797dc6
|
added optimized sger kernel for sandybridge
|
10 years ago |
Werner Saar
|
e216f686cb
|
optimized saxpy and daxpy for sandybridge
|
10 years ago |
Werner Saar
|
fc0e0391f3
|
bugfixes: replaced int with BLASLONG
|
10 years ago |
Werner Saar
|
c22068c406
|
optimized sdot.c for increments != 1
|
10 years ago |
Werner Saar
|
dee100d0e4
|
optimized saxpy.c for increments != 1
|
10 years ago |
Werner Saar
|
0273966abb
|
optimized daxpy kernel for increments != 1
|
10 years ago |
Werner Saar
|
3a67daa954
|
optimized ddot.c for increments != 1
|
10 years ago |
Werner Saar
|
b4f2153dcd
|
added optimized ssymv kernels for sandybridge
|
10 years ago |
Werner Saar
|
1c4b0eeae3
|
added optimized ssymv kernels for haswell
|
10 years ago |
Werner Saar
|
1bec9abb9a
|
added optimized dsymv kernels for sandybridge
|
10 years ago |
Werner Saar
|
3814bf60d3
|
added optimized dsymv kernels for haswell
|
10 years ago |
Werner Saar
|
6d0db0151f
|
added optimized zaxpy-kernels
|
10 years ago |
Zhang Xianyi
|
37b9033c90
|
Merge pull request #543 from jeromerobert/develop
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
|
10 years ago |
Werner Saar
|
13889515b3
|
added optimized caxpy-kernel for sandybridge
|
10 years ago |
Werner Saar
|
248c9340c3
|
added optimized caxpy-kernel for haswell
|
10 years ago |
Werner Saar
|
e9f33b4ca7
|
added optimized caxpy-kernel for steamroller
|
10 years ago |
Werner Saar
|
f5d847122a
|
updated caxpy_microk_bulldozer-2.c and caxpy.c
|
10 years ago |
Jerome Robert
|
a4c96eca67
|
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
Refs #478, #482, 9798481, fd9fd42
|
10 years ago |
Werner Saar
|
baa0363ea2
|
add optimized ddot-kernel for piledriver
|
10 years ago |
Werner Saar
|
34ba66606a
|
add optimized daxpy-kernel for piledriver
|
10 years ago |
Werner Saar
|
f615dc7603
|
added optimized saxpy kernel for steamroller
|
10 years ago |
Werner Saar
|
331c417637
|
optimized saxpy for piledriver
|
10 years ago |
Werner Saar
|
d7a17ad85d
|
optimized sdot-kernel for pilediver
|
10 years ago |
Werner Saar
|
d35f6c63c2
|
add optimized daxpy-kernel for steamroller
|
10 years ago |
Werner Saar
|
166d76e864
|
added optimized sdot-kernel for steamroller
|
10 years ago |
Werner Saar
|
f9f127d838
|
added optimized ddot kernel for steamroller
|
10 years ago |
wernsaar
|
62231ab337
|
Merge pull request #538 from wernsaar/develop
Added optimized cdot- and zdot-kernels
|
10 years ago |
Werner Saar
|
3119def9a7
|
updated cdot and zdot
|
10 years ago |
Werner Saar
|
33b332372a
|
add optimized cdot- and zdot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
fd838c75bc
|
add optimized cdot- and zdot-kernel for haswell
|
10 years ago |
Werner Saar
|
b57a60dac8
|
updated cdot and zdot for piledriver
|
10 years ago |
Werner Saar
|
5c51163972
|
added optimized cdot- and zdot-kernel for steamroller
|
10 years ago |
Werner Saar
|
9299d8cfd6
|
added optimized cdot- and zdot-kernels for bulldozer
|
10 years ago |
Zhang Xianyi
|
0a3d3b945d
|
Refs #535. Fix the wrong vector instruction in sgemm sandy bridge kernel.
|
10 years ago |
Werner Saar
|
60c6dec6e6
|
updated some lines for bulldozer
|
10 years ago |
Werner Saar
|
47898cca35
|
added optimized saxpy- and daxpy-kernel for sandybridge
|
10 years ago |
Werner Saar
|
53bb924287
|
added optimized saxpy- and daxpy-kernel for haswell
|
10 years ago |
Werner Saar
|
a901b065d3
|
added optimized ddot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
3937e2a0a0
|
add optimized sdot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
9707d608d5
|
removed double definition line
|
10 years ago |
Werner Saar
|
701b9d7556
|
added optimized sdot- and ddot-kernel for HASWELL
|
10 years ago |
Zhang Xianyi
|
e5b96e55a7
|
Fix build bug for ARM64.
|
11 years ago |
Zhang Xianyi
|
ea7f9dacf4
|
Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1.
|
11 years ago |
Martin Koehler
|
39cc6b21d3
|
Add ATLAS-style ?geadd function
|
11 years ago |
Zhang Xianyi
|
229ce2ccd1
|
Add cortex-a9 and cortex-a15 targets.
|
11 years ago |
Zhang Xianyi
|
41aad0407f
|
Merge pull request #482 from jeromerobert/develop
Allow to do gemv and ger buffer allocation on the stack
|
11 years ago |
Werner Saar
|
ddf983d643
|
added optimizations for steamroller
|
11 years ago |