2103 Commits (304a9b60afbc79be77193cc3bdb5bb5d503aa533)

Author SHA1 Message Date
  Dirreke ec89466e14 Add CSKY support 2 years ago
  Martin Kroeker 0d2e486edf
Handle NAN and INF 2 years ago
  Martin Kroeker 5f5b7c4f45
Merge pull request #4423 from martin-frbg/issue4422 2 years ago
  Martin Kroeker f31bea07dd
Merge pull request #4419 from martin-frbg/issue4413 2 years ago
  Martin Kroeker 20413ee6ec
Update zscal.c 2 years ago
  Martin Kroeker b57627c27f
Handle NAN and INF 2 years ago
  Martin Kroeker 995a990e24
Make AVX512 BFLOAT16 kernels conditional on compiler capability 2 years ago
  Martin Kroeker 7df363e1e2
temporarily disable the MSA C/ZSCAL kernels 2 years ago
  Chip-Kerchner 058dd2a4cb Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2 years ago
  Martin Kroeker 1c31f56e5a
Handle NAN 2 years ago
  Martin Kroeker 7ee1ee38e2
Handle NaN in input 2 years ago
  Martin Kroeker f637e12713
Handle INF and NAN 2 years ago
  Martin Kroeker 25b0c48082
Update zscal.c 2 years ago
  Martin Kroeker 5e7f714e93
Update zscal.c 2 years ago
  Martin Kroeker cf8b03ae8b
Use NAN rather than SNAN for portability 2 years ago
  Martin Kroeker f0808d856b
Handle NAN in input 2 years ago
  Martin Kroeker acf17a825d
Handle NAN in input 2 years ago
  Martin Kroeker c9df62e883
Fix handling of NAN 2 years ago
  Martin Kroeker def4996170
Fix handling of NAN and INF arguments 2 years ago
  Martin Kroeker 519b40fad9
Merge pull request #4398 from yinshiyou/la-dev 2 years ago
  pengxu a5d0d21378 loongarch64: Add zgemm and cgemm optimization 2 years ago
  gxw 546f13558c loongarch64: Add {c/z}swap and {c/z}sum optimization 2 years ago
  Hao Chen edabb93668 loongarch64: Refine axpby optimization functions. 2 years ago
  Hao Chen 1ec5dded43 loongarch64: Add c/zrot optimization functions. 2 years ago
  Hao Chen 3c53ded315 loongarch64: Add c/znrm2 optimization functions. 2 years ago
  Hao Chen fbd612f8c4 loongarch64: Add ic/zamin optimization functions. 2 years ago
  Hao Chen d97272cb35 loongarch64: Add c/zdot optimization functions. 2 years ago
  Hao Chen 65a0aeb128 loongarch64: Add c/zcopy optimization functions. 2 years ago
  Hao Chen 2a34fb4b80 loongarch64: Add and refine scal optimization functions. 2 years ago
  Hao Chen 8785e948b5 loongarch64: Add camin optimization function. 2 years ago
  Hao Chen 0753848e03 loongarch64: Refine and add axpy optimization functions. 2 years ago
  Hao Chen 06fd5b5995 loongarch64: Add and Refine asum optimization functions. 2 years ago
  guxiwei e771be185e Optimize copy functions with lsx. 2 years ago
  Hao Chen 179ed51d3b Add dgemm_kernel_8x4.S file. 2 years ago
  Hao Chen 173a65d4e6 loongarch64: Add and refine iamax optimization functions. 2 years ago
  zhoupeng ea70e165c7 loongarch64: Refine rot optimization. 2 years ago
  zhoupeng 116aee7527 loongarch64: Refine imin optimization. 2 years ago
  zhoupeng 8be2654193 loongarch64: Refine imax optimization. 2 years ago
  zhoupeng 154baad454 loongarch64: Refine iamin optimization. 2 years ago
  Shiyou Yin 36c12c4971 loongarch64: Refine copy,swap,nrm2,sum optimization. 2 years ago
  Shiyou Yin c6996a80e9 loongarch64: Refine amax,amin,max,min optimization. 2 years ago
  Chris Sidebottom ecae1389df Reduce duplication in kernel definitions 2 years ago
  Chris Sidebottom 60e66725e4 Use numeric labels to allow repeated inlining 2 years ago
  Chris Sidebottom 7a4fef4f60 Tweak SVE dot kernel 2 years ago
  Martin Kroeker f06b535566
Use C kernel for dgemv_t due to limitations of the old assembly one 2 years ago
  barracuda156 d9653af018 KERNEL.PPC970, KERNEL.PPCG4: unbreak CMake parsing 2 years ago
  Chip-Kerchner 93747fb377 Merge remote-tracking branch 'origin/develop' into power10Copies 2 years ago
  Chip-Kerchner 4e738e561a Replace two vector loads with one vector pair load and fix endianess of stores. 2 years ago
  yancheng d32f38fb37 loongarch64: Add optimizations for nrm2. 2 years ago
  yancheng f9b468990e loongarch64: Add optimizations for rot. 2 years ago