1540 Commits (28d2dfe2b3bd6c779137fcb53451f97f47b78b37)

Author SHA1 Message Date
  Martin Kroeker 28d2dfe2b3
Fix macro name used in ifdef 5 years ago
  Rajalakshmi Srinivasaraghavan b435491885 Optimize caxpy for POWER10 5 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker 67f39ad813
Merge pull request #2939 from thrasibule/Makefile_cleanup 5 years ago
  Rajalakshmi Srinivasaraghavan c24ba8b1dd Optimize saxpy for POWER10 5 years ago
  Martin Kroeker 6f9460f0f6
Merge pull request #2937 from martin-frbg/pwr-buffersz 5 years ago
  Guillaume Horel 1917a4e7b8 reuse variables defined in Makefile.system 5 years ago
  Martin Kroeker 34c3c407ef
label always_inline function as inline to silence a gcc warning 5 years ago
  Martin Kroeker 2e48d560ba
Fix compiler version check 5 years ago
  Rajalakshmi Srinivasaraghavan ad745c0bae Optimize scopy/ccopy for POWER10 5 years ago
  İsmail Dönmez 4a1d00f589
Fix build with -Werror=return-type 5 years ago
  Bart Oldeman b073d759d0 x86_64: clobber all xmm registers after vzeroupper 5 years ago
  Martin Kroeker dc6e44c3f8
Merge pull request #2916 from martin-frbg/issue2911 5 years ago
  Martin Kroeker a61c086408
Fix spurious trailing whitespace in comment 5 years ago
  Bart Oldeman 03e781b766 sgemm_direct_skylakex: fix 75eeb26 regression. 5 years ago
  Martin Kroeker f1a4071d8c
Clean up STACKSIZE redefinition 5 years ago
  Martin Kroeker 97cf10062f
Clean up STACKSIZE redefinition 5 years ago
  Martin Kroeker 17e288e18d
Clean up STACKSIZE redefinition 5 years ago
  Martin Kroeker c1422f3e46
Clean up STACKSIZE redefinition 5 years ago
  Martin Kroeker d85b24e103
Clean up STACKSIZE redefinition 5 years ago
  Martin Kroeker df70667043
fix core list for sse/sse2 5 years ago
  Martin Kroeker f071d1207a
add sse2 5 years ago
  Martin Kroeker dc6cefd2f5
Expressly enable -msse for 32bit DYNAMIC_ARCH kernels 5 years ago
  Martin Kroeker c339c40c01
Silence a redefinition warning 5 years ago
  Martin Kroeker 10379fc83b
Use ifdef instead of if 5 years ago
  Martin Kroeker 4c25910da0
Merge pull request #2896 from martin-frbg/intrin-double 5 years ago
  Martin Kroeker ae6ac83991
Revert "add double precision SSE" 5 years ago
  Qiyu8 4fac91ef37 adapt arm platform 5 years ago
  Qiyu8 bfdf4b56da Add double precision universal intrinsics for X86/ARM 5 years ago
  Martin Kroeker ebf0470fc2
add sse4.1 for DYNAMIC_ARCH kernels 5 years ago
  Martin Kroeker c9c3ae07af
Add double precision operations 5 years ago
  Martin Kroeker 756802df61
Merge pull request #2890 from martin-frbg/s-d-sum 5 years ago
  Rajalakshmi Srinivasaraghavan 0826d68f93 POWER10: Change the packing format for bfloat16 5 years ago
  Rajalakshmi Srinivasaraghavan b5d30b390d Fix build issues with bfloat16 5 years ago
  Martin Kroeker fecedc9c69
Add -mssse3 5 years ago
  Martin Kroeker 0eacbca85f
Add Haswell and Zen to temporary sse3 whitelist 5 years ago
  Martin Kroeker 6999086a2b
whitelist SANDYBRIDGE for SSE3 5 years ago
  Martin Kroeker 8d2df7d066
Revert special handling of Windows xNRM2 and enable C+intrinsics kernel for SSUM/DSUM 5 years ago
  Martin Kroeker 08929430cd
Merge pull request #2886 from martin-frbg/issue_2767 5 years ago
  Martin Kroeker 0c84ffe05f
Merge pull request #2881 from mattip/fninit 5 years ago
  Matti Picus 403eb513a0 use emms instead, add WIN guards 5 years ago
  Qiyu8 0ed1f07660 Optimize the performance of sum by using universal intrinsics 5 years ago
  Martin Kroeker 3aecafad80
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 756062afa5
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 2061f7fdff
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker dc8a1afa63
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker fd94236042
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 68ce719fac
Rename shdot_microk_cooperlake.c to sbdot_microk_cooperlake.c 5 years ago
  Martin Kroeker d7dd9b396c
Rename shdot.c to sbdot.c 5 years ago
  Martin Kroeker 9ae80490e0
rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago