Browse Source

Merge pull request #5348 from hideaki-motoki/issue5343_prefered_size_for_a64fx

Setting `GEMM_PREFERED_SIZE` parameter for `A64FX`
pull/5350/head
Martin Kroeker GitHub 10 months ago
parent
commit
334cd242d4
No known key found for this signature in database GPG Key ID: B5690EEEBB952194
1 changed files with 6 additions and 0 deletions
  1. +6
    -0
      param.h

+ 6
- 0
param.h View File

@@ -3701,6 +3701,12 @@ is a big desktop or server with abundant cache rather than a phone or embedded d

#elif defined(A64FX) // 512-bit SVE

#if defined(XDOUBLE) || defined(DOUBLE)
#define GEMM_PREFERED_SIZE 8
#else
#define GEMM_PREFERED_SIZE 16
#endif

/* When all BLAS3 routines are implemeted with SVE, SGEMM_DEFAULT_UNROLL_M should be "sve_vl".
Until then, just keep it different than DGEMM_DEFAULT_UNROLL_N to keep copy routines in both directions seperated. */
#define SGEMM_DEFAULT_UNROLL_M 4


Loading…
Cancel
Save