1440 Commits (5fa581c87e0f3979d0fc70b4ea485fc0d898ffb3)

Author SHA1 Message Date
  Ashwin Sekhar T K 4e1be0e481 ARM64: Add THUNDERX3T110 Target 5 years ago
  Rajalakshmi Srinivasaraghavan 9be2688c78 Fix to store results in correct order for POWER10 GEMM kernels 5 years ago
  Martin Kroeker 6a2a60038c
Merge pull request #2720 from martin-frbg/issue2694 5 years ago
  Martin Kroeker 251a09ec90
Typo fix 5 years ago
  Martin Kroeker 95d37e1575
Regroup the 32 and 64bit sections and restore 64bit CAXPY 5 years ago
  Martin Kroeker 3523bb778e
Merge pull request #2721 from martin-frbg/p8align 5 years ago
  Martin Kroeker bf1f0734ff
Use OPENBLAS_MAKE_COMPLEX_FLOAT on PPC only 5 years ago
  Martin Kroeker ca3561cab9
Add ifdefs around call to altivec microkernel 5 years ago
  Martin Kroeker 21072e502a
Typo fix 5 years ago
  Martin Kroeker 7c6e56b5df
Rewrite assignment to complex for better portability 5 years ago
  Martin Kroeker 661c6bfa5a
Exclude altivec code paths if the compiler does not support them 5 years ago
  Martin Kroeker 0033f8be0d
Use vec_vsx_ld/st to fix misaligned accesses flagged by asan 5 years ago
  Martin Kroeker f308e741b2
remove debug output and revert changes to cdot and crot 5 years ago
  Martin Kroeker da17abec87
fix trailing whitespace 5 years ago
  Martin Kroeker f8c2697701
Use POWER6 GEMM, TRMM and DTRSM on 32bit POWER8 5 years ago
  Martin Kroeker b144423f0f
Do not define USE_TRMM for 32bit POWER8 5 years ago
  Martin Kroeker ed7e155c35
Merge branch 'develop' into aix 5 years ago
  EGuesnet 634e1305f9
Update cgemm_kernel_8x4_power8.S 5 years ago
  Martin Kroeker 28d69e0097
Merge pull request #2687 from martin-frbg/utfbom 5 years ago
  Martin Kroeker c2467c9619
Merge pull request #2686 from RajalakshmiSR/p10_shgemm 5 years ago
  Martin Kroeker d199c2787d
Merge pull request #2680 from kavanabhat/aix_makefile_fix 5 years ago
  Martin Kroeker e30ad0e521
Strip UTF8 byte order marker from source 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Martin Kroeker c854ef5471
Fix variable names in conditional 5 years ago
  Martin Kroeker c0afc11742
Fix POWERPC builds on AIX (gcc/gfortran 7) 5 years ago
  Gordon Fossum bb2f52844b powerpc: Optimized ZGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 571eadb880 powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 5 years ago
  Kavana Bhat df4ade070f Fix for #2671 5 years ago
  Martin Kroeker 93592d1260
Merge pull request #2675 from wjc404/develop 5 years ago
  wjc404 086d87a302
AVX512 dgemm tcopy_16 function 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  ZhangDanfeng bc6fd20a40 fix INIT8x4 5 years ago
  Martin Kroeker 89091e6b64
Merge pull request #2645 from martin-frbg/misc_fixes 5 years ago
  Martin Kroeker c3574ffe53
Merge pull request #2646 from wjc404/develop 5 years ago
  wjc404 0e3ac4a06b
Add files via upload 5 years ago
  Martin Kroeker 7f60fb6b91
Delete spurious copy of common_param.h 5 years ago
  ZhangDanfeng 9b7877ccf1 sgemm copy source init 5 years ago
  ZhangDanfeng f82fa802d1 Insert prefetch 5 years ago
  Martin Kroeker b1ee81228a
Change complex DOT and ROT to generic kernels and switch CGEMM 5 years ago
  张丹枫 9df79ae9a3 update sgemm and strmm kernel selecting strategy 5 years ago
  张丹枫 a1fc6041cd use general register to speedup 5 years ago
  张丹枫 edb423d772 align general register using to strmm_kernel_8x8 5 years ago
  zhangdanfeng 0e6eb8c247 sgemm kernel use sgemm_kernel_8x8_cortexa53 5 years ago
  zhangdanfeng d475db29c6 optimized for cortex-a53 5 years ago
  Marius Hillenbrand 89fe17f20e s390x: Use new sgemm kernel also for DGEMM and DTRMM on Z14 5 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 5 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 5 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 5 years ago