1158 Commits (be09551cdf2efdddb3d671c1355c46560e4610f2)

Author SHA1 Message Date
  AbdelRauf be09551cdf aligned 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  AbdelRauf 148c4cc5fd conflict resolve 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago
  AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 6 years ago
  AbdelRauf 8fe794f059 improved zgemm power9 based on power8 6 years ago
  AbdelRauf 47f892198c conflict resolve 6 years ago
  AbdelRauf 628b335e83 Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop 6 years ago
  AbdelRauf 0f105dd8a5 sgemm/strmm 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 03d7110900
Merge pull request #2042 from maomao194313/develop 6 years ago
  Martin Kroeker f18ab6c17b
Merge pull request #2051 from martin-frbg/issue2048 6 years ago
  Martin Kroeker 5b95534afc
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1 6 years ago
  Celelibi b7f59da42d Fix crash in sgemm SSE/nano kernel on x86_64 7 years ago
  maomao194313 783ba8058f
HiSilicon tsv110 CPUs optimization branch 7 years ago
  Andrew 6eee1beac5 move fix to right place 7 years ago
  Martin Kroeker e12cdf58ef
Merge pull request #2024 from martin-frbg/gcc9fixes4 7 years ago
  Martin Kroeker 1860c9456d
Merge pull request #2023 from martin-frbg/gcc9fixes3 7 years ago
  Martin Kroeker f9bb76d29a
Fix inline assembly constraints in Bulldozer TRSM kernels 7 years ago
  Martin Kroeker efb9038f72
Fix inline assembly constraints 7 years ago
  Martin Kroeker e976557d29
Fix inline assembly constraints 7 years ago
  Martin Kroeker 9d8be15789
Fix inline assembly constraints 7 years ago
  Martin Kroeker d752799a0f
Merge pull request #2021 from martin-frbg/gcc9fixes2 7 years ago
  Martin Kroeker c26c0b77a7
Fix wrong constraints in inline assembly 7 years ago
  Martin Kroeker 1c6da2d03c
Merge pull request #2019 from martin-frbg/gcc9fixes 7 years ago
  Martin Kroeker 4255a58cd2
Rename operands to put lda on the input/output constraint list 7 years ago
  Martin Kroeker 46e415b140
Save and restore input argument 8 (lda4) 7 years ago
  Bart Oldeman 69a97ca7b9 dgemv_kernel_4x4(Haswell): add missing clobbers for xmm0,xmm1,xmm2,xmm3 7 years ago
  Martin Kroeker 056917d616
Merge pull request #2013 from martin-frbg/issue2011 7 years ago
  Martin Kroeker 718efcec6f
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Martin Kroeker f9d67bb5e8
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Martin Kroeker 76bb74fcd4
Merge pull request #2012 from maamountki/z14 7 years ago
  maamountki 0a54c98b9d
[ZARCH] Modify constraints 7 years ago
  maamountki bec54ae366
[ZARCH] Fix caxpy 7 years ago
  Martin Kroeker ab1630f9fa
Fix declaration of arguments in inline assembly 7 years ago
  Martin Kroeker b824fa70eb
Fix declaration of assembly arguments in SSYMV and DSYMV microkernels 7 years ago
  Martin Kroeker 91481a3e4e
Fix declaration of input arguments in inline assembly 7 years ago
  Martin Kroeker dc6ac9eab0
Fix declaration of input arguments in the x86_64 s/dGEMV_T and s/dGEMV_N kernels 7 years ago
  maamountki f583674109
[ZARCH] Fix cgemv_t_4 7 years ago
  maamountki 77fe70019f
[ZARCH] Fix constraints and source code formatting 7 years ago
  maamountki 7039770165
[ZARCH] Undo the last commit 7 years ago
  maamountki 11a43e8116
[ZARCH] Set alignment hint for vl/vst 7 years ago
  maamountki 61526480f9
[ZARCH] Fix copy constraint 7 years ago
  maamountki 81daf6bc38
[ZARCH] Format source code, Fix constraints 7 years ago
  Martin Kroeker 729e925174
Merge pull request #1996 from quickwritereader/develop 7 years ago
  Ubuntu 498ac98581 Note for unused kernels 7 years ago
  Ubuntu cd9ea45463 NBMAX=4096 for gemvn, added sgemvn 8x8 for future 7 years ago
  Martin Kroeker f9c5023e04
Merge pull request #1994 from quickwritereader/develop 7 years ago