421 Commits (165f00c159cf0c4e7e6eef8f656fa68e7cda4ea2)

Author SHA1 Message Date
  Martin Kroeker 6e54b0a027
Disable the 16x2 DTRMM kernel on SkylakeX as well 7 years ago
  Martin Kroeker f0a8dc2eec
Disable the AVX512 DGEMM kernel for now 7 years ago
  Craig Donner c2545b0fd6 Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
  Arjan van de Ven 89372e0993 Use AVX512 also for DGEMM 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker 840e01061f
Merge pull request #1491 from martin-frbg/ddot_mt 7 years ago
  Martin Kroeker a55694dd5b
Declare dot_compute static to avoid conflicts in multiarch builds 7 years ago
  Martin Kroeker 85a41e9cdb
Add multithreading support for Haswell DDOT 7 years ago
  Martin Kroeker 81215711a2
Re-enable DAXPY microkernels for x86_64 8 years ago
  Martin Kroeker 497f0c3d8a
Replace .align with .p2align in the Nehalem microkernels 8 years ago
  Martin Kroeker ea37db828e
Convert .align to .p2align for OSX compatibility 8 years ago
  Martin Kroeker 7c1925acec
Use .p2align instead of .align for compatibility on Sandybridge as well 8 years ago
  Martin Kroeker 2359c7c1a9
Use .p2align instead of .align for portability 8 years ago
  Martin Kroeker e388459a27
Merge pull request #1419 from brada4/develop 8 years ago
  Andrew 4938faa822 core.IdenticalExpr clang501 checker 8 years ago
  Martin Kroeker 42285d8e70
Merge pull request #1410 from brada4/develop 8 years ago
  Andrew 4d0b005e5b Eliminate remaining unused results in kernels (clang5 analyzer) 8 years ago
  Martin Kroeker b81656936f
Merge pull request #1409 from martin-frbg/issue1292-2 8 years ago
  Martin Kroeker b973990df2
Tag %1 and %2 as both input and output operands 8 years ago
  Martin Kroeker 1e31124eb0
Merge pull request #1406 from martin-frbg/issue1292 8 years ago
  Martin Kroeker 723f396a20
Tag %1 and %2 as both input and output 8 years ago
  Martin Kroeker 43c0622e7b
Retire Piledriver/Steamroller/Excavator daxpy microkernels as well 8 years ago
  Martin Kroeker 0623636c98
Use Sandybridge daxpy kernel on Haswell and Zen for now 8 years ago
  Andrew 281a2b952f warning cleanup (#1380) 8 years ago
  Martin Kroeker 6c77b5f267
Merge pull request #1369 from martin-frbg/dsdot 8 years ago
  Martin Kroeker c92cd6d162
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker cae5d9a20b
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker 3d891c3106
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker 4fbdcfa823
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker 1bb6a96ebc
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker 6bd163f37a
Add trivially optimized dsdot based on sdot 8 years ago
  Martin Kroeker f0333333d1
Add trivially optimized dsdot based on sdot 8 years ago
  Andrew e89b979b2c fix spurious compiler warning fix (no code change) 8 years ago
  Andrew 7e9b29b9b8 fix spurious compiler warning (no code change) 8 years ago
  Martin Kroeker 6157d0902a
Merge pull request #1358 from martin-frbg/unused_vars 8 years ago
  Martin Kroeker 3fea849bbf
Remove unused variables from Haswell dtrmm and Bulldozer dtrsm 8 years ago
  Martin Kroeker 8f177621bc
Remove unused variables at0...at3 from ?symv_U 8 years ago
  Martin Kroeker 5f402b7759
Remove unused (loop?) variable j from the gemv_n_4 implementations 8 years ago
  Martin Kroeker a07807caac Eliminate loop code when called as/from dsdot 8 years ago
  Martin Kroeker 5e3e91d0fc Split the microkernel workload into chunks of 32 floats for dsdot mode to limit loss of precision 8 years ago
  Martin Kroeker 28c3fa8950 Add dsdot 8 years ago
  Martin Kroeker 8ac87c1cb6 Implement DSDOT with unchanged sdot microkernels 8 years ago
  Isuru Fernando 505b218829 Merge remote-tracking branch 'upstream/develop' into dyn 8 years ago
  Isuru Fernando 1d1854032b Add missing EXCAVATOR 8 years ago
  Isuru Fernando 2c51a990ac Fix extra whitespaces. CMake parser macro fails with it 8 years ago
  Isuru Fernando ca17b4b75c Fix complex support for MSVC headers 8 years ago
  Denis Steckelmacher c9ff735da6 Add ZEN support (tested for auto-detected static backend) 9 years ago
  Martin Kroeker a6efabf155 Replace gnu _real_ , _imag_ extensions in initializers 9 years ago
  Martin Kroeker dc34a0da96 Merge pull request #915 from mdong/small_fix_for_icc 9 years ago
  Martin Kroeker 4998e19869 Change file comments to work around clang 3.9 assembler bug 9 years ago