504 Commits (c275290ea62079aebbf0c8d98c331d1defe07464)

Author SHA1 Message Date
  Martin Kroeker dccff2e785
Merge pull request #2206 from martin-frbg/zen-dtrmm 6 years ago
  Martin Kroeker 5c3458a6e7
Merge pull request #2199 from martin-frbg/zen-dtrsm 6 years ago
  Martin Kroeker acf6002ab2
Replace most vpermpd calls in the Haswell DTRSM_RN kernel 6 years ago
  Martin Kroeker 2dfb804cb9
Replace vpermpd with vpermilpd in the Haswell DTRMM kernel 6 years ago
  Martin Kroeker 4c153ec9da
Merge pull request #2196 from wjc404/develop 6 years ago
  wjc404 7eecd8e39c
Add files via upload 6 years ago
  Martin Kroeker 7b0b7c11d2
Merge pull request #2190 from martin-frbg/zdot-zen 6 years ago
  Martin Kroeker 28e96458e5
Replace vpermpd with vpermilpd 6 years ago
  wjc404 95fb98f556
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 4801c6d36b
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 9440fa607d
Add files via upload 6 years ago
  wjc404 94db259e5b
Add files via upload 6 years ago
  wjc404 f49f8047ac
Add files via upload 6 years ago
  wjc404 825777faab
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 9c89757562
Add files via upload 6 years ago
  wjc404 9b04baeaee
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 8a074b3965
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 211ab03b14
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 1733f927e6
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 182b06d6ad
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 7a9050d681
Update dgemm_kernel_4x8_haswell.S 6 years ago
  wjc404 0ba29fd262
Update dgemm_kernel_4x8_haswell.S for zen2 6 years ago
  Martin Kroeker 9ea30f3788
Replace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125) 6 years ago
  Martin Kroeker b1561ecc68
Disable DGEMMINCOPY as well for now 6 years ago
  Martin Kroeker 7ed8431527
Disable the SkyLakeX DGEMMITCOPY kernel as well 6 years ago
  Martin Kroeker c04a729081
Add ?sum definitions for generic kernel 6 years ago
  Martin Kroeker 9d717cb5ee
Add x86_64 implementation of ?sum 6 years ago
  Martin Kroeker 32c7063cb0
Merge pull request #2061 from martin-frbg/martin-frbg-patch-1 6 years ago
  Martin Kroeker e608d4f7fe
Disable the AVX512 DGEMM kernel (again) 6 years ago
  Celelibi b7f59da42d Fix crash in sgemm SSE/nano kernel on x86_64 7 years ago
  Andrew 6eee1beac5 move fix to right place 7 years ago
  Martin Kroeker e12cdf58ef
Merge pull request #2024 from martin-frbg/gcc9fixes4 7 years ago
  Martin Kroeker 1860c9456d
Merge pull request #2023 from martin-frbg/gcc9fixes3 7 years ago
  Martin Kroeker f9bb76d29a
Fix inline assembly constraints in Bulldozer TRSM kernels 7 years ago
  Martin Kroeker efb9038f72
Fix inline assembly constraints 7 years ago
  Martin Kroeker e976557d29
Fix inline assembly constraints 7 years ago
  Martin Kroeker 9d8be15789
Fix inline assembly constraints 7 years ago
  Martin Kroeker d752799a0f
Merge pull request #2021 from martin-frbg/gcc9fixes2 7 years ago
  Martin Kroeker c26c0b77a7
Fix wrong constraints in inline assembly 7 years ago
  Martin Kroeker 1c6da2d03c
Merge pull request #2019 from martin-frbg/gcc9fixes 7 years ago
  Martin Kroeker 4255a58cd2
Rename operands to put lda on the input/output constraint list 7 years ago
  Martin Kroeker 46e415b140
Save and restore input argument 8 (lda4) 7 years ago
  Bart Oldeman 69a97ca7b9 dgemv_kernel_4x4(Haswell): add missing clobbers for xmm0,xmm1,xmm2,xmm3 7 years ago
  Martin Kroeker ab1630f9fa
Fix declaration of arguments in inline assembly 7 years ago
  Martin Kroeker b824fa70eb
Fix declaration of assembly arguments in SSYMV and DSYMV microkernels 7 years ago
  Martin Kroeker 91481a3e4e
Fix declaration of input arguments in inline assembly 7 years ago
  Martin Kroeker dc6ac9eab0
Fix declaration of input arguments in the x86_64 s/dGEMV_T and s/dGEMV_N kernels 7 years ago
  Martin Kroeker 32b0f1168e
Fix declaration of input arguments in the Sandybridge GER microkernels (#1967) 7 years ago
  Martin Kroeker b495e54310
Fix declaration of input arguments in the x86_64 SCAL microkernels (#1966) 7 years ago
  Martin Kroeker d5e6940253
Fix declaration of input arguments in the x86_64 microkernels for DOT and AXPY (#1965) 7 years ago