20 Commits (develop)

Author SHA1 Message Date
  Martin Kroeker c2a8ebfe69
Add workaround for NVIDIA HPC mishandling of the asm DOT kernels 5 years ago
  Qiyu8 60e6c68e38 Adapt ARM architect 5 years ago
  shengyang 80db5f11e1 update 6 years ago
  Martin Kroeker 44028581cc
Merge pull request #2355 from Zeyiii/dev-zeyi2 6 years ago
  shengyang 8d84403205 Use arm neon instructions to optimize ncopy operation 6 years ago
  w00421467 0833a4846a Use arm neon instructions to optimize sgemm_beta operation 6 years ago
  zq 50f7fc1401 [WIP] Use arm neon instructions to optimize tcopy operation 6 years ago
  w00421467 b7cc69ee62 declare DGEMM_BETA in KERNEL.ARMV8 rather than the generic KERNEL 6 years ago
  Martin Kroeker 85ccdce8c4
Remove the IOS fallbacks to generic C kernels 6 years ago
  Martin Kroeker 7639f2e1f0
Rewrite the conditional for OSX to fix cmake parsing on others 7 years ago
  Martin Kroeker 6ba30e270d
Fix typo that broke CNRM2 on ARMV8 since 0.3.0 7 years ago
  Renato Golin 310ea55f29 Simplifying ARMv8 build parameters 7 years ago
  Ashwin Sekhar T K d5aeff636f ARM64: Enable DYNAMIC_ARCH 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Martin Kroeker 1cb7b9015e
Conditional compilation of assembly files that IOS does not like 7 years ago
  Martin Kroeker 288d1a3f6e
Use dot.S also for DSDOT on ARMV8 8 years ago
  Martin Kroeker b47e6822aa
Enable most assembly kernels in the generic ARMV8 target 8 years ago
  Benedikt Huber 58c90d5937 # The first commit's message is: 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar fe5f46c330 added experimental support for ARMV8 12 years ago