1039 Commits (76a66eaac8aaa795dddc26af4e43acb455654a18)

Author SHA1 Message Date
  Ashwin Sekhar T K d5aeff636f ARM64: Enable DYNAMIC_ARCH 7 years ago
  Ashwin Sekhar T K e7b66cd36e ARM64: Fix DYNAMIC_ARCH compilation for cores which dont use GEMM3M 7 years ago
  Ashwin Sekhar T K d50abc8903 ARM64: Move parameters from parameter.c to param.h 7 years ago
  Ashwin Sekhar T K 351a0c777c ARM64: Remove XGENE1 references 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Ashwin Sekhar T K caf339412f ARM64: Remove dependency of THUNDERX2T99 Makefile on CORTEXA57 Makefile 7 years ago
  Ashwin Sekhar T K 8001fdcd2a ARM64: Remove dependency of THUNDERX Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K 162e312832 ARM64: Remove dependency of CORTEXA57 Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K c3d93caa8d ARM64: Remove dependency of XGENE1 Makefile on ARMV8 Makefile 7 years ago
  Arjan van de Ven 55b244ca0d enable the SGEMM/SKX C based kernel 7 years ago
  Arjan van de Ven d4bad73834 Add a C+intrinsics version of the SGEMM/skylakex kernel 7 years ago
  Arjan van de Ven 582c589727 dgemm/skylakex: replace discrete mul/add with fma 7 years ago
  Arjan van de Ven adbf6afa25 Add vector optimizations for ncopy as well for dgemm/skylakex 7 years ago
  Arjan van de Ven 32bec8afbb add a skylakex optimized dgemm beta function 7 years ago
  Arjan van de Ven 20c5d668fe dgemm/avx512 simplify and speed up the 4x4 kernel 7 years ago
  Arjan van de Ven 6d43c51ccf undo slow dgemm/skylake microoptimization 7 years ago
  Arjan van de Ven d74dc39b0f Add optimized *copy versions for skylakex 7 years ago
  Arjan van de Ven 66b43affbc Add a 24x8 kernel to the skylakex dgemm implementation 7 years ago
  Arjan van de Ven 1938819c25 skylake dgemm: Add a 16x8 kernel 7 years ago
  Martin Kroeker b7496c3638
Function name needs to be CNAME, set from outside to allow suffixing for dynamic_arch 7 years ago
  Arjan van de Ven 45fe8cb0c5 Create a AVX512 enabled version of DGEMM 7 years ago
  Martin Kroeker 544b069e85
Merge pull request #1780 from martin-frbg/issue1774-2 7 years ago
  Martin Kroeker 9b2a7ad40d
Convert fldmia/fstmia instructions to UAL syntax for clang7 7 years ago
  fengruilin 6fc85a6359 test_axpy work error on LOONGSON3A platform #1777 7 years ago
  Martin Kroeker 7e5df34e6a
Convert fldmia/fstmia instructions to UAL syntax for clang7 7 years ago
  Andrew 1e531701b7 fix small typo 7 years ago
  Martin Kroeker ba4f433321
Merge pull request #1749 from martin-frbg/issue1531 7 years ago
  Martin Kroeker 1cb7b9015e
Conditional compilation of assembly files that IOS does not like 7 years ago
  Martin Kroeker a4bd41e9f2
Fix paths to C kernels for nrm2 7 years ago
  Martin Kroeker e11126b26a
Merge pull request #1745 from martin-frbg/issue1743 7 years ago
  Martin Kroeker f3fd44a731
Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic 7 years ago
  Martin Kroeker 375dff54fc
Merge pull request #1733 from fenrus75/dsymv 7 years ago
  Martin Kroeker a5f165275a
Merge pull request #1732 from fenrus75/dgemv 7 years ago
  Martin Kroeker 8c13aa495a
Merge pull request #1730 from fenrus75/fix-sdot 7 years ago
  Arjan van de Ven 9bec34cb67 Add an AVX512 enabled DSYMV (L) function 7 years ago
  Arjan van de Ven 87bebdbd8a Add an AVX512 enabled DGEMV (n) function 7 years ago
  Arjan van de Ven 36add7570a Fix typo in sdot function 7 years ago
  Arjan van de Ven cacacc8007 Add an AVX512 enabled DSCAL function 7 years ago
  Martin Kroeker 1a00ef3d27
Merge pull request #1725 from fenrus75/axpy 7 years ago
  Arjan van de Ven 2e99873ff7 Add a AVX512 enabled SAXPY/DAXPY functions 7 years ago
  Arjan van de Ven 00abaa865b Add an AVX512 enabled SDOT function 7 years ago
  Arjan van de Ven 7932ff3ea9 Add an AVX512 enabled DDOT function 7 years ago
  Martin Kroeker 4e103c822c
typo fix 7 years ago
  Martin Kroeker d2142760e0
Fix precision problem in DSDOT 7 years ago
  Martin Kroeker 2fbfc64da8
Use C kernels for default c/zAXPY, xROT, c/zSWAP 7 years ago
  Martin Kroeker ba8388cee0
Merge pull request #1651 from martin-frbg/avx512-nodgemm 7 years ago
  Martin Kroeker 6e54b0a027
Disable the 16x2 DTRMM kernel on SkylakeX as well 7 years ago
  Martin Kroeker 40c8cbc3bf
Merge pull request #1650 from martin-frbg/avx512-nodgemm 7 years ago
  Martin Kroeker f0a8dc2eec
Disable the AVX512 DGEMM kernel for now 7 years ago
  Martin Kroeker b83e4c60c7
Remove premature exit for INC_X or INC_Y zero 7 years ago