7938 Commits (500ac4de5e20596d5cd797d745db97dd0a62ff86)
 

Author SHA1 Message Date
  Martin Kroeker 500ac4de5e
fix incompatible pointer types 2 years ago
  Martin Kroeker b3fa16345d
fix prototype for c/zaxpby 2 years ago
  Martin Kroeker e9cfb7fd30
Merge pull request #4491 from martin-frbg/fixup-4488 2 years ago
  Martin Kroeker e9f480111e
fix sbgemm bfloat16 conversion errors introduced in PR 4488 2 years ago
  Martin Kroeker 22b487b622
Merge pull request #4488 from martin-frbg/issue4475-2 2 years ago
  Martin Kroeker 818bf30628
Merge pull request #4490 from ChipKerchner/missingCPUIDsForAIX 2 years ago
  Martin Kroeker 344763331a
Merge pull request #4484 from martin-frbg/lapack981 2 years ago
  Chip Kerchner 08ce6b1c1c Add missing CPU ID definitions for old versions of AIX. 2 years ago
  Martin Kroeker fb99fc2e6e
fix type conversion warnings 2 years ago
  Martin Kroeker 08e479f956
Merge pull request #4487 from ErnstPeng/feature-branch 2 years ago
  Martin Kroeker d4db6a9f16
Separate the interface for SBGEMMT from GEMMT due to differences in GEMV arguments 2 years ago
  pengxu fe3da43b7d Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch 2 years ago
  Martin Kroeker e5d2725e5a
Merge pull request #4185 from XiWeiGu/mips_enable_msa 2 years ago
  Martin Kroeker 479e4af089
Rescale input vector more often to minimize relative error (Reference-LAPACK PR 981) 2 years ago
  Martin Kroeker a4fde2c5ac
Merge pull request #4451 from martin-frbg/overflow_reset 2 years ago
  Martin Kroeker b537528feb
Merge pull request #4480 from XiWeiGu/loongarch64-fixed-{s/d}amin-lsx 2 years ago
  Martin Kroeker bc7154a80d
Merge pull request #4482 from martin-frbg/issue4476 2 years ago
  Martin Kroeker 6d8a273cca
Handle zero increment(s) in C910V ?AXPBY (#4483) 2 years ago
  Martin Kroeker dbcf4f8b7d
Merge pull request #4479 from XiWeiGu/loongarch-opt-axpby 2 years ago
  Martin Kroeker dc802dd637
Merge pull request #4474 from ChipKerchner/sgemmIncopy_PR 2 years ago
  Martin Kroeker e307675222
Merge pull request #4478 from martin-frbg/issue4475 2 years ago
  Martin Kroeker 033168cdf0
Merge pull request #4481 from martin-frbg/cpuid_riscv 2 years ago
  Martin Kroeker a29f91ae9a
Merge pull request #4471 from ChipKerchner/fixMakefileAIXOpenMP 2 years ago
  Martin Kroeker e61d96303d
Fix missing NO_AVX2 fallback for SapphireRapids 2 years ago
  Martin Kroeker d02c61e82e
Update lowercase cpunames for RISC-V 2 years ago
  Martin Kroeker 7228c708d7
Merge pull request #4461 from markdryan/cpuid_riscv64_crash 2 years ago
  gxw adde725321 LoongArch64: Fixed {s/d}amin LSX optimization 2 years ago
  gxw 7bc93d95a1 LoongArch64: Opt {c/z}axpby 2 years ago
  gxw 1e1f487dc7 LoongArch64: Fixed {s/d}axpby 2 years ago
  gxw 3597827c93 utest: add axpby 2 years ago
  Martin Kroeker 68d354814f
Fix incompatible pointer type in BFLOAT16 mode 2 years ago
  Martin Kroeker 3848d4e9f4
Merge pull request #4477 from martin-frbg/c910caxpy 2 years ago
  Martin Kroeker 4d8dee508c
temporarily disable the CAXPY/ZAXPY kernels 2 years ago
  Martin Kroeker 27816fa929
Merge pull request #4472 from sergei-lewis/dev/slewis/merge-from-riscv 2 years ago
  Chip Kerchner 2bb7ea64a1 Only vectorize 64-bit version for Power8. 2 years ago
  Sergei Lewis 3ffd6868d7 Merge branch 'develop' into dev/slewis/merge-from-riscv 2 years ago
  Sergei Lewis a3b0ef6596 Restore riscv64 fixes from develop branch: dot product double precision accumulation, zscal NaN handling 2 years ago
  Martin Kroeker ec74dcd213
Merge pull request #4470 from martin-frbg/issue4455 2 years ago
  Chip Kerchner 61c8e19f95 Fix Makefile to support OpenMP on AIX for xlc (clang) with xlf. 2 years ago
  Martin Kroeker 47bd064763
Fix names in build rules 2 years ago
  Martin Kroeker a7d004e820
Fix CBLAS prototype 2 years ago
  Martin Kroeker b54cda8490
Unify creation of CBLAS interfaces for ?AMIN/?AMAX and C/ZAXPYC between gmake and cmake builds 2 years ago
  Martin Kroeker 1a6fdb0353
Add prototypes for extensions ?AMIN/?AMAX and CAXPYC/ZAXPYC 2 years ago
  Martin Kroeker d1343302bd
Merge pull request #4465 from XiWeiGu/utest-zscal 2 years ago
  gxw 969601a1dc X86_64: Fixed bug in zscal 2 years ago
  Martin Kroeker 98c9ff3194
Merge pull request #4464 from XiWeiGu/loongarch64-zscal 2 years ago
  Martin Kroeker 9f0630187a
Merge pull request #4463 from XiWeiGu/loongarch64-zamax-zamin 2 years ago
  Chip Kerchner 09bb48d1b9 Vectorize in-copy packing/copying for SGEMM - 4X faster. 2 years ago
  gxw bb043a021f utest: Add tests for zscal 2 years ago
  gxw 83ce97a4ca LoongArch64: Handle NAN and INF 2 years ago