2531 Commits (cdebb4fd4b2bbbf856e5abdcedbe9a5cf348ef8e)

Author SHA1 Message Date
  Martin Kroeker 1fe96f8da7
Fix failures to handle increments of zero 2 years ago
  Martin Kroeker 73b30b1dec
Fix VLEV_FLOAT/VSEV_FLOAT macros to compile with t-head 2.6.1 2 years ago
  Martin Kroeker c3a2d407a0
Merge pull request #4048 from imzhuhl/spr_sbgemm_fix 2 years ago
  Manjul Mohan 58b88aa5f0 POWER10: Fix compiler warnings 2 years ago
  ZhengSh 2a8bc38cdc
Merge branch 'xianyi:risc-v' into risc-v 2 years ago
  Heller Zheng 0954746380 remove argument unused during compilation. 2 years ago
  sh-zheng d3bf5a5401 Combine two reduction operations of zhe/symv into one, with tail undisturbed setted. 3 years ago
  Honglin Zhu 9e80a194d6 Fix dynamic_list build and gcc version check error 3 years ago
  Honglin Zhu a76afdc047 Compatible with older version of GNU make 3 years ago
  sh-zheng 18d7afe69d Add rvv support for zsymv and active rvv support for zhemv 3 years ago
  Honglin Zhu 90f041e348 Invoke the syscall to allow the use of amx tiles 3 years ago
  Honglin Zhu 0b83088887 spr dynamic arch support 3 years ago
  Honglin Zhu f249ccb741 Fix spr sbgemm error 3 years ago
  Martin Kroeker e9a8d5b45f
Merge pull request #4015 from martin-frbg/issue4013-2 3 years ago
  Martin Kroeker 72caceb324
Merge pull request #4009 from Mousius/sve-gemm 3 years ago
  Martin Kroeker 84bcf6639f
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker c9174ae8d7
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker c2fe9cb91f
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker 66b39b835c
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker bb6d6735bf
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker d18efaed20
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker 99f6d31ed5
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker 7de9335c56
Disable gcc's tree-vectorizer pass on all operating systems 3 years ago
  Martin Kroeker 437c0bf2b4
Merge pull request #3843 from Mousius/switch-ratio 3 years ago
  Chris Sidebottom ec334e69dc Use SVE kernel for SGEMM/DGEMM on Arm(R) Neoverse(TM) V1 3 years ago
  Chris Sidebottom 32f2fafde7 Propagate SWITCH_RATIO to DYNAMIC_ARCH builds 3 years ago
  Martin Kroeker 44164e3a3d
revert "move alpha out of register 18" (out of PR scope, no SVE on Apple hw) 3 years ago
  Martin Kroeker 8be68fa7f4
move declaration of sca to really keep the compiler from throwing it out (for now) 3 years ago
  Martin Kroeker 3727672a74
Improve workaround and keep compilers from optimizing it out 3 years ago
  Martin Kroeker 108a21e47a
Move ALPHA out of register 18 (reserved on OSX) 3 years ago
  Martin Kroeker 0b1acb0ba3
Move ALPHA_I out of register 18 (reserved on OSX) 3 years ago
  Martin Kroeker c7bbad09ad
Move ALPHA_I out of register 18 (reserved on OSX) 3 years ago
  Martin Kroeker cda29633a3
move ALPHA_I out of register 18 (reserved on OSX) 3 years ago
  Martin Kroeker 09ace3cf23
Merge pull request #3846 from lilh9598/sbgemm_opt 3 years ago
  Heller Zheng 1374a2d08b This PR adapts latest spec changes 3 years ago
  Zhang Xianyi 19f17c8bc6
Merge pull request #3893 from HellerZheng/develop 3 years ago
  Sergei Lewis cb0a70e0e2 dot.c early bail fix 3 years ago
  Sergei Lewis 9b61be4545 factoring riscv64/dot.c fix into separate PR as requested 3 years ago
  Sergei Lewis 2406958629 * update intrinsics to match latest spec at https://github.com/riscv-non-isa/rvv-intrinsic-doc (in particular, __riscv_ prefixes for rvv intrinsics) 3 years ago
  Martin Kroeker 38d6fb4225
Fix dependencies in builds with specified subsets of precision types 3 years ago
  Martin Kroeker e412bee313
fix GEMM kernel dependencies in builds that use only a subset of precisions 3 years ago
  Martin Kroeker d80adf253e
make SSYMV available to BUILD_DOUBLE-only builds 3 years ago
  Martin Kroeker 5481c328e8
fix DYNAMIC_ARCH builds that use only a subset of precisions 3 years ago
  Heller Zheng 63cf4d0166 add riscv level3 C,Z kernel functions. 3 years ago
  Xianyi Zhang c19dff0a31 Fix T-Head RVV intrinsic API changes. 3 years ago
  Martin Kroeker 5a9cd87794
Merge pull request #3868 from Mousius/sve-prefetch 3 years ago
  Chris Sidebottom 1361229291 Remove prefetches from SVE kernels 3 years ago
  Bart Oldeman 60e49b851c Fix typo in clobber list, should be xmm14 instead of ymm14. 3 years ago
  Bart Oldeman 4afe1439a1 Fix skylake fallback kernel name for old compilers. 3 years ago
  Bart Oldeman 5ceca1a4d8 Add sscal.c + microkernels for Haswell, Zen, Skylake and newer. 3 years ago