4557 Commits (456dc04441fa72b11e600a69c8d752a0d93ecc8a)
 

Author SHA1 Message Date
  Martin Kroeker 456dc04441
Update sgemm_kernel_16x4_skylakex_3.c 5 years ago
  Martin Kroeker 89323458a9
preset optimization level for apple clang 5 years ago
  Martin Kroeker e153bdeb70
Update dynamic_arch.yml 5 years ago
  Martin Kroeker c2001f7756
Make cmake build verbose to see options in use 5 years ago
  Martin Kroeker c2b3f0b3f6
Revert "keep Apple Clang from optimizing this" 5 years ago
  Martin Kroeker 9f7358d7dc
Keep Apple Clang from optimizing this 5 years ago
  Martin Kroeker 54fa90fb25
Keep apple clang 11.0.3 from trying to optimize this (and running out of registers) 5 years ago
  Leonard Lausen 5a709b8340 Print CPU info in output 5 years ago
  Leonard Lausen b31a68b835 Add Github Actions test for DYNAMIC_ARCH builds 5 years ago
  Martin Kroeker a349d48d89
Merge pull request #2636 from martin-frbg/issue2634 5 years ago
  Martin Kroeker 4db00121dc
Disable EXPRECISION and add -lm on OSX (same as the BSDs and Linux) 5 years ago
  Martin Kroeker 909897f13b
Document option USE_LOCKING 5 years ago
  Martin Kroeker e79245acd9
Merge pull request #2635 from ilayn/patch-1 5 years ago
  Ilhan Polat 76d2612e0c
BUG: Fix the loop range in ZHEEQUB.f 5 years ago
  Martin Kroeker dd7a650792
Merge pull request #59 from xianyi/develop 5 years ago
  Martin Kroeker 4a4c50a7ce
Merge pull request #2627 from pkubaj/patch-1 5 years ago
  Martin Kroeker d069780e63
Merge pull request #2626 from docularxu/working-gcc-version-detections 5 years ago
  pkubaj 33c8790603
Add powerpc (32-bit) 5 years ago
  Guodong Xu 06387ac0e6 make GCC version detection OS-independent 5 years ago
  Martin Kroeker f1a18d245b
Merge pull request #2618 from craft-zhang/cortex-A53 5 years ago
  张丹枫 2a3aa91354 update CONTRIBUTORS.md, adding myself 5 years ago
  张丹枫 ea5bdc3f72 split cortex-a53 param to match 8x8 kernel 5 years ago
  张丹枫 9df79ae9a3 update sgemm and strmm kernel selecting strategy 5 years ago
  张丹枫 a1fc6041cd use general register to speedup 5 years ago
  张丹枫 edb423d772 align general register using to strmm_kernel_8x8 5 years ago
  zhangdanfeng 0e6eb8c247 sgemm kernel use sgemm_kernel_8x8_cortexa53 5 years ago
  zhangdanfeng d475db29c6 optimized for cortex-a53 5 years ago
  Martin Kroeker 729ac6bd4a
Merge pull request #2623 from mhillenibm/zarch_dgemm_z14 5 years ago
  Marius Hillenbrand 89fe17f20e s390x: Use new sgemm kernel also for DGEMM and DTRMM on Z14 5 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 5 years ago
  Martin Kroeker e1038ea836
Merge pull request #2622 from martin-frbg/issue2619 5 years ago
  Martin Kroeker 6baa9a778d
Improve declaration of LAPACKE_get_nancheck 5 years ago
  Martin Kroeker cf46c9f84e
Merge pull request #2617 from martin-frbg/issue2616 5 years ago
  Martin Kroeker 55602fce56
Ignore spurious all-numeric library names derived from mishandled jobserver flags 5 years ago
  Martin Kroeker 3d5e159e7a
Ignore spurious all-numeric library names derived from mishandled jobserver flags 5 years ago
  Martin Kroeker 2931feb575
Merge pull request #58 from xianyi/develop 5 years ago
  Martin Kroeker 20245ded5f
Merge pull request #2615 from mhillenibm/z14_alignment_hints 5 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 5 years ago
  Martin Kroeker ea78106c71
Merge pull request #2614 from mhillenibm/gemm_vec_z14 5 years ago
  Marius Hillenbrand cb9dc36dd5 Update CONTRIBUTORS.md 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 5 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 5 years ago
  Marius Hillenbrand d7c1677c20 Update CONTRIBUTORS.md, adding myself 5 years ago
  Marius Hillenbrand 0dbe61a612 s390x: choose SIMD kernels at run-time based on OS and compiler support 5 years ago
  Marius Hillenbrand 62cf391cbb s390x: only build kernels supported by gcc with dynamic arch support 5 years ago
  Marius Hillenbrand 8c338616f9 s390x: gate dynamic arch detection on gcc version and add generic 5 years ago
  Martin Kroeker f94c53ec0a
Merge pull request #2612 from RajalakshmiSR/testshgemm 5 years ago
  Rajalakshmi Srinivasaraghavan 8efba9b7c0 Improve shgemm test 5 years ago
  Martin Kroeker 4fffa556d8
Merge pull request #2611 from RajalakshmiSR/bench_half 5 years ago