10 Commits (develop)

Author SHA1 Message Date
  Chip Kerchner 64401b4417 Disable vectorized packing for DGEMM - since it is slower than scalar. 8 months ago
  Chip Kerchner c00afc86a6 Add and use vectorized packing to ZVL128B and ZVL256B. Up to 3x+ faster than generic scalar functions. 8 months ago
  Srangrang ec14e1648c fix: resolve non-RISCV host build failed issue 10 months ago
  Srangrang fb89820f20 Merge branch 'develop' of https://github.com/Srangrang/OpenBLAS into develop 11 months ago
  gkdddd 670ec6f757 Added shgemm_kernel_8x8 for RISCV64_ZVL128B and shgemm_kernel_16x8 for RISCV64_ZVL256B 11 months ago
  guoyuanplct d2003dc886 del lines 11 months ago
  guoyuanplct 45fd2d9b07 Optimized the axpby function. 11 months ago
  guoyuanplct 4d213653d8 kernel/riscv64:Added support for omatcopy on riscv64. 11 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 1 year ago
  Sergei Lewis 2406958629 * update intrinsics to match latest spec at https://github.com/riscv-non-isa/rvv-intrinsic-doc (in particular, __riscv_ prefixes for rvv intrinsics) 3 years ago