1093 Commits (3f1719a98da89f0a6f1d435d3f705aa083702ac7)

Author SHA1 Message Date
  Martin Kroeker 3f1719a98d
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker dc15f3b5a7
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 00aff05c40
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker c9078eb8b4
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker de207d10c1
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker c23c17163f
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker c18c2c9d9b
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker ca02ac724f
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 9d46f84f24
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 6008f65318
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker d94e7da701
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 7af8f34df4
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker bb16456fe1
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 2f5a7c1656
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 30a7bd8e15
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 47e2b4592e
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker a671e19dd2
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 663eef3b66
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 4e6f8fec31
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker 8a6bbf5a5b
Tag operands 0 and 1 as both input and output 7 years ago
  Martin Kroeker f0dd058430
Tag operands 0 and 1 as both input and output 7 years ago
  Abdelrauf a034e65512
Merge branch 'develop' into develop 7 years ago
  Ubuntu 8c3386be87 Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, 7 years ago
  Arjan van de Ven 795285c587 Fix thinko in skylake beta handling 7 years ago
  Arjan van de Ven d321448a63 dgemm: use dgemm_ncopy_8_skylakex.c also for Haswell 7 years ago
  Arjan van de Ven c43331ad0a dgemm: Use the skylakex beta function also for haswell 7 years ago
  Martin Kroeker c4e23dd016
Update Makefile 7 years ago
  Martin Kroeker cfc4acc221
typo 7 years ago
  Martin Kroeker 545c2b1bbb
Add -mavx2 on Haswell only if the compiler supports it 7 years ago
  Arjan van de Ven 69d206440a Make the skylakex/haswell sgemm code compile and run even with compilers without avx2 support 7 years ago
  Martin Kroeker 3843e3e017
use -maxv2 on haswell 7 years ago
  Martin Kroeker fbcb14a74b
should be core-avx2 7 years ago
  Martin Kroeker 2a3190dc76
fix elseifeq and use older option core2-avx for compatibility 7 years ago
  Martin Kroeker 1ebe5c0f49
Add -march=haswell to HASWELL part of DYNAMIC_ARCH build 7 years ago
  Arjan van de Ven 0586899a10 Use sgemm_ncopy_4_skylakex.c also for Haswell 7 years ago
  Arjan van de Ven 00dc09ad19 Use the skylake sgemm beta code also for haswell 7 years ago
  Arjan van de Ven cdc668d82b Add a "sgemm direct" mode for small matrixes 7 years ago
  Martin Kroeker 87718807f0
Merge pull request #1910 from martin-frbg/issue1909 7 years ago
  Martin Kroeker 51aec8e96b
make sure the added march=skylake-avx512 does not cause problems on Windows 7 years ago
  Martin Kroeker 06f7d78d70
Add -march=skylake-avx512 to SkylakeX part of DYNAMIC_ARCH builds 7 years ago
  Martin Kroeker 7639f2e1f0
Rewrite the conditional for OSX to fix cmake parsing on others 7 years ago
  Martin Kroeker 2fc712469d
Avoid creating spurious non-suffixed c/zgemm_kernels 7 years ago
  Martin Kroeker 6ba30e270d
Fix typo that broke CNRM2 on ARMV8 since 0.3.0 7 years ago
  Martin Kroeker 701ea88347
Use p2align instead of align for OSX compatibility 7 years ago
  Martin Kroeker 6c7b691083
Really revert xDOT changes from 1832 7 years ago
  Martin Kroeker 5f4c550c27
Merge pull request #1892 from martin-frbg/mipsdot 7 years ago
  Martin Kroeker 95a5542e3c
Revert DOT kernel changes from #1834 7 years ago
  Martin Kroeker 7a2e1bc804
Use generic kernel for DSDOT/SDSDOT 7 years ago
  Martin Kroeker 35653e38b3
Merge pull request #1834 from fengrl/develop 7 years ago
  Andrew 19c4bdd8b3 Add return value so that freebsd system clang does not err out 7 years ago