90 Commits (abea977ded8729c6dcfcfbee51a18eceef8d8440)

Author SHA1 Message Date
  Martin Kroeker 3e3ccb9011
Add ARM64 implementations of ?sum 6 years ago
  maomao194313 783ba8058f
HiSilicon tsv110 CPUs optimization branch 7 years ago
  Martin Kroeker 7639f2e1f0
Rewrite the conditional for OSX to fix cmake parsing on others 7 years ago
  Martin Kroeker 6ba30e270d
Fix typo that broke CNRM2 on ARMV8 since 0.3.0 7 years ago
  Renato Golin 310ea55f29 Simplifying ARMv8 build parameters 7 years ago
  Ashwin Sekhar T K d5aeff636f ARM64: Enable DYNAMIC_ARCH 7 years ago
  Ashwin Sekhar T K d50abc8903 ARM64: Move parameters from parameter.c to param.h 7 years ago
  Ashwin Sekhar T K 351a0c777c ARM64: Remove XGENE1 references 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Ashwin Sekhar T K caf339412f ARM64: Remove dependency of THUNDERX2T99 Makefile on CORTEXA57 Makefile 7 years ago
  Ashwin Sekhar T K 8001fdcd2a ARM64: Remove dependency of THUNDERX Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K 162e312832 ARM64: Remove dependency of CORTEXA57 Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K c3d93caa8d ARM64: Remove dependency of XGENE1 Makefile on ARMV8 Makefile 7 years ago
  Martin Kroeker 1cb7b9015e
Conditional compilation of assembly files that IOS does not like 7 years ago
  Martin Kroeker a4bd41e9f2
Fix paths to C kernels for nrm2 7 years ago
  Craig Donner c2545b0fd6 Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
  Ashwin Sekhar T K fa9ca65c0e ARM64: Fix utest dsdot errors 8 years ago
  Martin Kroeker c9d408064a
Use dot.S also for DSDOT on CORTEXA57 8 years ago
  Martin Kroeker 288d1a3f6e
Use dot.S also for DSDOT on ARMV8 8 years ago
  Martin Kroeker b47e6822aa
Enable most assembly kernels in the generic ARMV8 target 8 years ago
  Ashwin Sekhar T K a0128aa489 ARM64: Convert all labels to local labels 8 years ago
  Ashwin Sekhar T K 4899d67f7d THUDNERX2T99: Fix clang compilation 8 years ago
  Ashwin Sekhar T K 67473d09dd THUNDERX2T99: Bug Fixes in D/Z NRM2 and ZGEMM 9 years ago
  Ashwin Sekhar T K 19ba133383 THUNDERX2T99: Add Optimized ZGEMM Implementation 9 years ago
  Ashwin Sekhar T K a3935f0dfb THUNDERX2T99: Add Optimized D/Z NRM2 Implementation 9 years ago
  Ashwin Sekhar T K 738628e9a8 ARM64: Remove unused code 9 years ago
  Ashwin Sekhar T K ab3ffab96a THUNDERX2T99: Add Optimized C/Z DOT Implementation 9 years ago
  Ashwin Sekhar T K f036be9ce2 THUNDERX2T99: Add Optimized SDOT Implementation 9 years ago
  Ashwin Sekhar T K faba876fda THUNDERX2T99: Bug fix in C/Z IAMAX 9 years ago
  Ashwin Sekhar T K 172a62d73e THUNDERX2T99: Add Optimized C/Z IAMAX Implementation 9 years ago
  Ashwin Sekhar T K 228c75a69c THUNDERX2T99: Add parallel SCNRM2 Implementation 9 years ago
  Ashwin Sekhar T K 8e89668f62 THUNDERX2T99: Fix bug in SNRM2 9 years ago
  Ashwin Sekhar T K f63deae9de THUNDERX2T99: Add Optimized S/D IAMAX Implementation 9 years ago
  Ashwin Sekhar T K 071a830e8b THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations 9 years ago
  Ashwin Sekhar T K d09f88192c THUNDERX2T99: Add optimized S/D/C/Z COPY Implementations 9 years ago
  Ashwin Sekhar T K e58233460a THUDNERX2T99: Add optimized D/C/Z ASUM Implementations 9 years ago
  Ashwin Sekhar T K 99bd2892bf THUNDERX2T99: Add optimized CASUM Implementation 9 years ago
  Ashwin Sekhar T K ff6f572f2e THUNDERX2T99: Rename labels in for DDOT and SNRM2 9 years ago
  Ashwin Sekhar T K e0dc5f58c5 THUNDERX2T99: Remove Duplicate Code 9 years ago
  Ashwin Sekhar T K 2757b49767 THUNDERX2T99: Add Optimized CGEMM Implementation 9 years ago
  Ashwin Sekhar T K 907e286eb6 THUNDERX2T99: Add threaded SNRM2 Implementation 9 years ago
  Ashwin Sekhar T K cde3aee08b ARM64: Rename kernel files to have consistent naming 9 years ago
  Ashwin Sekhar T K ee6ea7e988 THUNDERX2T99: Add Optimized CNRM2 Implementation 9 years ago
  Ashwin Sekhar T K ca0b36b012 THUNDERX2T99: Add Optimized SNRM2 Implementation 9 years ago
  Ashwin Sekhar T K d0a79ca6e0 THUNDERX2T99: Add threaded DDOT Implementation 9 years ago
  Ashwin Sekhar T K 0c07003ccf THUNDERX2T99: Add Optimized DDOT Implementation 9 years ago
  Ashwin Sekhar T K f33fcedb30 THUNDERX2T99: Improve SGEMM 9 years ago
  Ashwin Sekhar T K 0f1d6e8b39 THUNDERX2T99: Improve DGEMM 9 years ago
  Ashwin Sekhar T K 981064acc6 THUNDERX2T99: Add Optimized DAXPY Implementation 9 years ago
  Ashwin Sekhar T K f279ff4789 THUNDERX2T99: Add Optimized SGEMM Implementation 9 years ago