4049 Commits (92b10212de6972c808ebeccfe9fac0a82012e94e)
 

Author SHA1 Message Date
  wjc404 92b10212de
optimize AVX2 SGEMM 6 years ago
  wjc404 b73bf01378
optimize AVX2 SGEMM 6 years ago
  wjc404 eb3c9f1db9
optimize AVX2 SGEMM 6 years ago
  wjc404 a0f0a802fc
Update zgemm3m_kernel_4x4_haswell.c 6 years ago
  wjc404 700fe5b5ee
Add files via upload 6 years ago
  wjc404 bb2729c855
Update CONTRIBUTORS.md 6 years ago
  wjc404 aae44d040d
Update CONTRIBUTORS.md 6 years ago
  wjc404 6362c34ee6
Update param.h 6 years ago
  wjc404 f60840c420
Update KERNEL.ZEN 6 years ago
  wjc404 109e18cd96
Update KERNEL.HASWELL 6 years ago
  wjc404 ae1579be13
Create zgemm3m_kernel_4x4_haswell.c 6 years ago
  wjc404 312060d0d6
Update CONTRIBUTORS.md 6 years ago
  wjc404 cd765f094b
Update cgemm3m_kernel_8x4_haswell.c 6 years ago
  wjc404 64639f440f
Update param.h 6 years ago
  wjc404 3a66c8cac1
Update KERNEL.ZEN 6 years ago
  wjc404 4c35b8dbaa
Update gemm3m_level3.c 6 years ago
  wjc404 ed9af2f7da
Update KERNEL.HASWELL 6 years ago
  wjc404 5fd1edead9
Create cgemm3m_kernel_8x4_haswell.c 6 years ago
  wjc404 eeecd623d8
Update cgemm_kernel_8x2_haswell.c 6 years ago
  wjc404 3ce6bcdb5f
Update CONTRIBUTORS.md 6 years ago
  wjc404 6fbe51072b
Update CONTRIBUTORS.md 6 years ago
  wjc404 611445c7f8
Update param.h 6 years ago
  wjc404 2cd9306bb5
Update KERNEL.ZEN 6 years ago
  wjc404 c418c81224
Update KERNEL.HASWELL 6 years ago
  wjc404 025741f16a
Fast Haswell CGEMM kernel 6 years ago
  wjc404 105e26e12a
Adjust Haswell ZGEMM blocking parameters 6 years ago
  wjc404 f41d52665d
Fast Haswell ZGEMM kernel 6 years ago
  wjc404 d573d24de7
Fast Haswell ZGEMM kernel 6 years ago
  Martin Kroeker 31d6c2eb7d
Merge pull request #2340 from Zeyiii/develop 6 years ago
  w00421467 b7cc69ee62 declare DGEMM_BETA in KERNEL.ARMV8 rather than the generic KERNEL 6 years ago
  w00421467 aeef942c4f use arm neon instructions to optimize gemm beta operation 6 years ago
  Martin Kroeker 445ca2f418
Merge pull request #2339 from Jehan/wip/Jehan/fix-timeout 6 years ago
  Jehan 13226e3101 driver: more reasonable thread wait timeout on Windows. 6 years ago
  Martin Kroeker 1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod 6 years ago
  Martin Kroeker c6ecb195e6
Merge pull request #2337 from martin-frbg/issue2336 6 years ago
  Martin Kroeker b28db31429
Support two-digit version numbers in gcc version check 6 years ago
  Kavana Bhat 6baa9b07d7 AIX changes for Power8 6 years ago
  Martin Kroeker a4896b5538
Update DYNAMIC_ARCH support for ARM64 and PPC (#2332) 6 years ago
  Kavana Bhat 3938e59569 AIX changes for Power8 6 years ago
  Martin Kroeker 9d5079008f
Merge pull request #2334 from martin-frbg/fix2228 6 years ago
  Martin Kroeker 3518617f5b
Add Intel Goldmont+ cpuid 6 years ago
  Martin Kroeker 715f4650d9
Delete stray copy of dynamic.c from PR 2228 6 years ago
  Martin Kroeker 10705183ce
Merge pull request #20 from xianyi/develop 6 years ago
  Martin Kroeker 235599f17a
Merge pull request #2329 from isuruf/patch-1 6 years ago
  Isuru Fernando b863b32ac5 Workaround an ICE in clang 9.0.0 6 years ago
  Martin Kroeker dd04143d4a
Merge pull request #2328 from martin-frbg/ppc9 6 years ago
  Martin Kroeker f3a6164bff
Merge pull request #2324 from antonblanchard/power9_segv 6 years ago
  Martin Kroeker dedd822d1a
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker 2181fb7047
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker a9b62c03f8
Substitute precompiled gcc7 codes only when gcc is older than 9.x 6 years ago