Werner Saar
30f52d53df
optimized dgemv_n kernel for haswell
10 years ago
Werner Saar
5e83d80725
optimized dger kernel for sandybridge
10 years ago
Werner Saar
b2e1797dc6
added optimized sger kernel for sandybridge
10 years ago
Werner Saar
e216f686cb
optimized saxpy and daxpy for sandybridge
10 years ago
Werner Saar
fc0e0391f3
bugfixes: replaced int with BLASLONG
10 years ago
Werner Saar
c22068c406
optimized sdot.c for increments != 1
10 years ago
Werner Saar
dee100d0e4
optimized saxpy.c for increments != 1
10 years ago
Werner Saar
0273966abb
optimized daxpy kernel for increments != 1
10 years ago
Werner Saar
3a67daa954
optimized ddot.c for increments != 1
10 years ago
Werner Saar
b4f2153dcd
added optimized ssymv kernels for sandybridge
10 years ago
Werner Saar
1c4b0eeae3
added optimized ssymv kernels for haswell
10 years ago
Werner Saar
1bec9abb9a
added optimized dsymv kernels for sandybridge
10 years ago
Werner Saar
3814bf60d3
added optimized dsymv kernels for haswell
10 years ago
Werner Saar
6d0db0151f
added optimized zaxpy-kernels
10 years ago
Zhang Xianyi
37b9033c90
Merge pull request #543 from jeromerobert/develop
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
10 years ago
Werner Saar
13889515b3
added optimized caxpy-kernel for sandybridge
10 years ago
Werner Saar
248c9340c3
added optimized caxpy-kernel for haswell
10 years ago
Werner Saar
e9f33b4ca7
added optimized caxpy-kernel for steamroller
10 years ago
Werner Saar
f5d847122a
updated caxpy_microk_bulldozer-2.c and caxpy.c
10 years ago
Jerome Robert
a4c96eca67
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
Refs #478 , #482 , 9798481, fd9fd42
10 years ago
Werner Saar
baa0363ea2
add optimized ddot-kernel for piledriver
10 years ago
Werner Saar
34ba66606a
add optimized daxpy-kernel for piledriver
10 years ago
Werner Saar
f615dc7603
added optimized saxpy kernel for steamroller
10 years ago
Werner Saar
331c417637
optimized saxpy for piledriver
10 years ago
Werner Saar
d7a17ad85d
optimized sdot-kernel for pilediver
10 years ago
Werner Saar
d35f6c63c2
add optimized daxpy-kernel for steamroller
10 years ago
Werner Saar
166d76e864
added optimized sdot-kernel for steamroller
11 years ago
Werner Saar
f9f127d838
added optimized ddot kernel for steamroller
11 years ago
wernsaar
62231ab337
Merge pull request #538 from wernsaar/develop
Added optimized cdot- and zdot-kernels
11 years ago
Werner Saar
3119def9a7
updated cdot and zdot
11 years ago
Werner Saar
33b332372a
add optimized cdot- and zdot-kernel for sandybridge
11 years ago
Werner Saar
fd838c75bc
add optimized cdot- and zdot-kernel for haswell
11 years ago
Werner Saar
b57a60dac8
updated cdot and zdot for piledriver
11 years ago
Werner Saar
5c51163972
added optimized cdot- and zdot-kernel for steamroller
11 years ago
Werner Saar
9299d8cfd6
added optimized cdot- and zdot-kernels for bulldozer
11 years ago
Zhang Xianyi
0a3d3b945d
Refs #535 . Fix the wrong vector instruction in sgemm sandy bridge kernel.
11 years ago
Werner Saar
60c6dec6e6
updated some lines for bulldozer
11 years ago
Werner Saar
47898cca35
added optimized saxpy- and daxpy-kernel for sandybridge
11 years ago
Werner Saar
53bb924287
added optimized saxpy- and daxpy-kernel for haswell
11 years ago
Werner Saar
a901b065d3
added optimized ddot-kernel for sandybridge
11 years ago
Werner Saar
3937e2a0a0
add optimized sdot-kernel for sandybridge
11 years ago
Werner Saar
9707d608d5
removed double definition line
11 years ago
Werner Saar
701b9d7556
added optimized sdot- and ddot-kernel for HASWELL
11 years ago
Zhang Xianyi
e5b96e55a7
Fix build bug for ARM64.
11 years ago
Hank Anderson
84d90d6ed8
Fixed some compiler errors/warnings for clang.
11 years ago
Hank Anderson
518e2424a8
Fixed bad filename for cpuid.S compile.
11 years ago
Zhang Xianyi
ea7f9dacf4
Refs #509 . Fixed geadd building bug with DYNAMIC_ARCH=1.
11 years ago
Hank Anderson
0d8e227ea7
Changed strategy for setting preprocessor definitions.
Instead of generating separate object files for each permutation of
defines for a source file, GenerateNamedObjects now writes an entirely
new source file and inserts the defines as #define c statements.
This solves a problem I ran into with ar.exe where it was refusing to
link objects that had the same filename despite having different paths.
11 years ago
Hank Anderson
12d1fb2e40
Fixed incorrect object name in kernel CMakeLists.txt
11 years ago
Hank Anderson
1b7f427401
Added conj gemv objects for complex build.
11 years ago