Hank Anderson
14fd3d35de
Added checks for missing defines in kernel.
11 years ago
Hank Anderson
cebc07cebd
ParseMakefileVars now recursively parses included makefiles.
11 years ago
Hank Anderson
33c5e8db7f
Added a helper function for setting the L1 kernel defaults.
Added loop to build objects with different KERNEL defines.
11 years ago
Hank Anderson
4662a0b13a
Changed generate functions to iterate through a list of float types.
This will generate obj files for SINGLE/DOUBLE/COMPLEX/DOUBLE COMPLEX.
11 years ago
Hank Anderson
162791e30e
Added common objects from kernel Makefile.
11 years ago
Hank Anderson
c0624a26be
Fixed some dgemm_copy function names.
11 years ago
Hank Anderson
4bfaf1ce66
Removed some list appends I missed.
11 years ago
Hank Anderson
e8c39138c6
Removed return value from GenerateNamedObjects.
It sets DBLAS_OBJS directly to save a bunch of list appending in the
CMakeLists.txt files.
11 years ago
Hank Anderson
f992799226
Added the rest of Makefile.L3.
11 years ago
Hank Anderson
4c65afcce1
Changed kernel filenames to vars. These will need to be read from KERNEL.
Added some kernel/L3 objects.
11 years ago
Hank Anderson
7fa5c4e2fd
Fixed some case issues with ARCH.
Added some kernel and driver/others objects.
11 years ago
Hank Anderson
fa0e6a6c93
Added the rest of the L1 kernel makefile.
11 years ago
Hank Anderson
38681fb1c6
Added more kernel files.
11 years ago
Hank Anderson
189fadfde0
Started implementing kernel/Makefile in cmake.
11 years ago
Werner Saar
ddf983d643
added optimizations for steamroller
11 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
11 years ago
Werner Saar
587e16fba3
Ref #458 : Backport, sandybrigde uses nehalem zgemm kernel
11 years ago
Werner Saar
6261342de3
small optimization on dgemm_kernel for N=1
11 years ago
Werner Saar
bc5fff7085
changed inline assembler labels to short form
11 years ago
Zhang Xianyi
0cf29ba6d2
Fixed a bug of sgemm sandy bridge kernel.
Reported by Julia project. JuliaLang/julia#9084
11 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Zhang Xianyi
a85c2785ae
Refs #467 . Added generic kernel file for x86_64.
11 years ago
Benedikt Huber
58c90d5937
# The first commit's message is:
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
11 years ago
wernsaar
7aae4a62e7
enabled use of GEMM3M functions
11 years ago
wernsaar
b7c9566eea
removed obsolete gemv kernel files
11 years ago
wernsaar
6df1b0be81
optimized zgemv_n_microk_sandy-4.c
11 years ago
wernsaar
2ac1e076c1
added optimized zgemv_n kernel for sandybridge
11 years ago
wernsaar
9908b6031c
bugfix in KERNEL.PILEDRIVER
11 years ago
wernsaar
8f100a14f2
optimized cgemv_t kernel for haswell
11 years ago
wernsaar
53b5726b04
added optimized cgemv_t kernel for haswell
11 years ago
wernsaar
1a352b24e6
updated KERNEL.HASWELL
11 years ago
wernsaar
5194818d4b
updated zgemv_t_4.c
11 years ago
wernsaar
8a39cdb1c1
added optimized zgemv_t kernel for haswell
11 years ago
wernsaar
0a1390f2d8
enabled optimized zgemv_t kernel for bulldozer
11 years ago
wernsaar
a8b0812feb
optimized zgemv_t for bulldozer
11 years ago
wernsaar
a0fb68ab42
added optimized zgemv_t kernel for bulldozer
11 years ago
wernsaar
44c11165d5
bugfix in cgemv_t_4.c
11 years ago
wernsaar
564be4eb72
added optimized cgemv_t kernel
11 years ago
wernsaar
107c3ea7d5
added optimized zgemv_t routine
11 years ago
wernsaar
bb8d698335
optimized zgemv_n_microk_haswell-4.c for small size
11 years ago
wernsaar
e0192a6914
bugfix in zgemv_n_4.c
11 years ago
wernsaar
bced4594bb
added optimized zgemv_n kernel
11 years ago
wernsaar
cafba99b6b
bufix in cgemv_n_microk_haswell-4.c
11 years ago
wernsaar
ac8f232b2a
more optimizations
11 years ago
wernsaar
f98e1244c4
optimized cgemv_n_4.c
11 years ago
wernsaar
be95700b30
added optimized cgemv_kernel for haswell
11 years ago
wernsaar
4aa534ae93
added cgemv_n kernel, optimized for small sizes
11 years ago
wernsaar
baa46e4fba
added and tested optimized dgemv_n kernel for haswell
11 years ago
wernsaar
faab7a181d
added optimized dgemv_n kernel for haswell
11 years ago
wernsaar
8109d8232c
optimized dgemv_t kernel for haswell
11 years ago