Arjan van de Ven
d321448a63
dgemm: use dgemm_ncopy_8_skylakex.c also for Haswell
The dgemm_ncopy_8_skylakex.c code is not avx512 specific and gives
a nice performance boost for medium sized matrices
7 years ago
Arjan van de Ven
c43331ad0a
dgemm: Use the skylakex beta function also for haswell
it's more efficient for certain tall/skinny matrices
7 years ago
Arjan van de Ven
0586899a10
Use sgemm_ncopy_4_skylakex.c also for Haswell
sgemm_ncopy_4_skylakex.c uses SSE transpose operations where the
real perf win happens; this also works great for Haswell.
This gives double digit percentage gains on small and skinny matrices
7 years ago
Arjan van de Ven
00dc09ad19
Use the skylake sgemm beta code also for haswell
with a few small changes it's possible to use the skylake sgemm code
also for haswell, this gives a modest gain (10% range) for smallish
matrixes but does wonders for very skinny matrixes
7 years ago
Martin Kroeker
28c3fa8950
Add dsdot
8 years ago
Werner Saar
c8f2c5d636
added optimized trsm_kernels
10 years ago
Werner Saar
e7c969e164
added optimized dtrmm_kernel for haswell
10 years ago
Werner Saar
9bd962f655
modified haswell parameter dgemm_unroll_n
10 years ago
Werner Saar
31c9e399e9
added optimized cscal kernel for haswell
10 years ago
Werner Saar
d63034303b
added optimized zscal kernel for haswell
10 years ago
Werner Saar
02e772c7e4
added optimized dscal kernel for haswell
10 years ago
Werner Saar
1c4b0eeae3
added optimized ssymv kernels for haswell
10 years ago
Werner Saar
3814bf60d3
added optimized dsymv kernels for haswell
10 years ago
Werner Saar
6d0db0151f
added optimized zaxpy-kernels
10 years ago
Werner Saar
248c9340c3
added optimized caxpy-kernel for haswell
10 years ago
Werner Saar
fd838c75bc
add optimized cdot- and zdot-kernel for haswell
11 years ago
Werner Saar
53bb924287
added optimized saxpy- and daxpy-kernel for haswell
11 years ago
Werner Saar
701b9d7556
added optimized sdot- and ddot-kernel for HASWELL
11 years ago
wernsaar
8f100a14f2
optimized cgemv_t kernel for haswell
11 years ago
wernsaar
1a352b24e6
updated KERNEL.HASWELL
11 years ago
wernsaar
e0192a6914
bugfix in zgemv_n_4.c
11 years ago
wernsaar
baa46e4fba
added and tested optimized dgemv_n kernel for haswell
11 years ago
wernsaar
debc6d1a05
bugfix in KERNEL.HASWELL
11 years ago
wernsaar
e73a0113ec
added optimized gemv kernels
11 years ago
wernsaar
80f7786875
enabled optimized sgemv kernels for piledriver
11 years ago
wernsaar
d143f84dd2
added optimized sgemv_n kernel for haswell
11 years ago
wernsaar
11eab4c019
added optimized cgemv_n for haswell
11 years ago
wernsaar
4568d32b6b
added optimized cgemv_t kernel for haswell
11 years ago
wernsaar
8c582d362d
optimized zgemv_t_microk_haswell-2.c
11 years ago
wernsaar
11e34ddd1b
bugfix for zgemv_n_microk_haswell-2.c
11 years ago
wernsaar
58b075daef
added optimized zgemv_t kernel for haswell
11 years ago
wernsaar
dbc2eff029
disabled optimized haswell zgemv_n kernel for windows ( bad rounding )
11 years ago
wernsaar
462b4885ff
added optimized zgemv_n kernel for haswell
11 years ago
wernsaar
006ef3ea01
added optimized dgemv_t kernel for haswell
11 years ago
wernsaar
60f17628cc
added optimized dgemv_n kernel for haswell
11 years ago
wernsaar
7aa43c8928
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
95a8caa2f3
added optimized sgemv_t kernel
11 years ago
wernsaar
2bab92961f
enabled optimized sgemv_n kernels for windows
11 years ago
wernsaar
3fbc13eb65
modified sgemv_n for haswell
11 years ago
wernsaar
6acbafe45b
added sgemv_n microkernel for haswell
11 years ago
wernsaar
d9d4077c93
added sgemv_t microkernel for haswell
11 years ago
wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
d2c82d7543
enabled optimized sgemv kernel for HASWELL
11 years ago
wernsaar
a77c71eaf5
added highly optimized dgemm_kernel for HASWELL
12 years ago
Zhang Xianyi
2638370844
Init code base for Intel Haswell.
12 years ago