1156 Commits (cdbe0f0235b0d23b19daeb40fab98ec83260197c)

Author SHA1 Message Date
  Martin Kroeker cdbe0f0235
Add MIPS implementation of ?sum 6 years ago
  Martin Kroeker f8b82bc6dc
Add ia64 implementation of ?sum 6 years ago
  Martin Kroeker 3e3ccb9011
Add ARM64 implementations of ?sum 6 years ago
  Martin Kroeker 94ab4e6fb2
Add ARM implementations of ?sum 6 years ago
  Martin Kroeker c3cfc6986b
Add implementations of ssum/dsum and csum/zsum 6 years ago
  Martin Kroeker b9f4943a14
Add ?sum 6 years ago
  Martin Kroeker 32c7063cb0
Merge pull request #2061 from martin-frbg/martin-frbg-patch-1 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker e608d4f7fe
Disable the AVX512 DGEMM kernel (again) 6 years ago
  Martin Kroeker 03d7110900
Merge pull request #2042 from maomao194313/develop 6 years ago
  Martin Kroeker f18ab6c17b
Merge pull request #2051 from martin-frbg/issue2048 6 years ago
  Martin Kroeker 5b95534afc
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1 6 years ago
  Celelibi b7f59da42d Fix crash in sgemm SSE/nano kernel on x86_64 7 years ago
  maomao194313 783ba8058f
HiSilicon tsv110 CPUs optimization branch 7 years ago
  Andrew 6eee1beac5 move fix to right place 7 years ago
  Martin Kroeker e12cdf58ef
Merge pull request #2024 from martin-frbg/gcc9fixes4 7 years ago
  Martin Kroeker 1860c9456d
Merge pull request #2023 from martin-frbg/gcc9fixes3 7 years ago
  Martin Kroeker f9bb76d29a
Fix inline assembly constraints in Bulldozer TRSM kernels 7 years ago
  Martin Kroeker efb9038f72
Fix inline assembly constraints 7 years ago
  Martin Kroeker e976557d29
Fix inline assembly constraints 7 years ago
  Martin Kroeker 9d8be15789
Fix inline assembly constraints 7 years ago
  Martin Kroeker d752799a0f
Merge pull request #2021 from martin-frbg/gcc9fixes2 7 years ago
  Martin Kroeker c26c0b77a7
Fix wrong constraints in inline assembly 7 years ago
  Martin Kroeker 1c6da2d03c
Merge pull request #2019 from martin-frbg/gcc9fixes 7 years ago
  Martin Kroeker 4255a58cd2
Rename operands to put lda on the input/output constraint list 7 years ago
  Martin Kroeker 46e415b140
Save and restore input argument 8 (lda4) 7 years ago
  Bart Oldeman 69a97ca7b9 dgemv_kernel_4x4(Haswell): add missing clobbers for xmm0,xmm1,xmm2,xmm3 7 years ago
  Martin Kroeker 056917d616
Merge pull request #2013 from martin-frbg/issue2011 7 years ago
  Martin Kroeker 718efcec6f
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Martin Kroeker f9d67bb5e8
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Martin Kroeker 76bb74fcd4
Merge pull request #2012 from maamountki/z14 7 years ago
  maamountki 0a54c98b9d
[ZARCH] Modify constraints 7 years ago
  maamountki bec54ae366
[ZARCH] Fix caxpy 7 years ago
  Martin Kroeker ab1630f9fa
Fix declaration of arguments in inline assembly 7 years ago
  Martin Kroeker b824fa70eb
Fix declaration of assembly arguments in SSYMV and DSYMV microkernels 7 years ago
  Martin Kroeker 91481a3e4e
Fix declaration of input arguments in inline assembly 7 years ago
  Martin Kroeker dc6ac9eab0
Fix declaration of input arguments in the x86_64 s/dGEMV_T and s/dGEMV_N kernels 7 years ago
  maamountki f583674109
[ZARCH] Fix cgemv_t_4 7 years ago
  maamountki 77fe70019f
[ZARCH] Fix constraints and source code formatting 7 years ago
  maamountki 7039770165
[ZARCH] Undo the last commit 7 years ago
  maamountki 11a43e8116
[ZARCH] Set alignment hint for vl/vst 7 years ago
  maamountki 61526480f9
[ZARCH] Fix copy constraint 7 years ago
  maamountki 81daf6bc38
[ZARCH] Format source code, Fix constraints 7 years ago
  Martin Kroeker 729e925174
Merge pull request #1996 from quickwritereader/develop 7 years ago
  Ubuntu 498ac98581 Note for unused kernels 7 years ago
  Ubuntu cd9ea45463 NBMAX=4096 for gemvn, added sgemvn 8x8 for future 7 years ago
  Martin Kroeker f9c5023e04
Merge pull request #1994 from quickwritereader/develop 7 years ago
  Ubuntu 4abc375a91 sgemv cgemv pairs 7 years ago
  Martin Kroeker 874df65491
Fix incorrect sgemv results for IBM z14 7 years ago