Commit Graph

  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • |\
  • | *
  • | |\
  • | | | *
  • | | |/
  • | |/|
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | * |
  • | |\|
  • | | | *
  • | | |/
  • | |/|
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | * |
  • | |\|
  • | | *
  • | | *
  • | | *
  • | |/
  • | *
  • | |\
  • | | *
  • | | *
  • | |/
  • | *
  • | |\
  • | | *
  • | |/
  • | *
  • | |\
  • | * \
  • | |\ \
  • | | * |
  • | |/ /
  • |/| |
  • | | *
  • | * |
  • | | *
  • | * |
  • | |\ \
  • | | | | *
  • | | | | *
  • | | | | *
  • | | | | *
  • | | | | *
  • | | | | *
  • | | * | |
  • | | * | |
  • | |/ / /
  • |/| | |
  • * | | |
  • |\| | |
  • | | | *
  • | | | | *
  • | | | | *
  • | | | | *
  • | | | | *
  • | | | * |
  • | | | * |
  • | | | * |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • | | | *
  • | | | *
  • | | | *
  • | | | *
  • | | | *
  • | | |/
  • | |/|
  • | * |
  • | |\ \
  • | | * |
  • | |/ /
  • | * |
  • | |\ \
  • | * \ \
  • | |\ \ \
  • | | | * |
  • | | | * |
  • | | | * |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • * | | |
  • |\| | |
  • | * | |
  • | |\ \ \
  • | | * | |
  • | | * | |
  • | * | | |
  • | |\ \ \ \
  • | * \ \ \ \
  • | |\ \ \ \ \
  • | * \ \ \ \ \
  • | |\ \ \ \ \ \
  • | | | * | | | |
  • | |_|/ / / / /
  • |/| | | | | |
  • | | * | | | |
  • | |/ / / / /
  • | * | | | |
  • | |\ \ \ \ \
  • | | * | | | |
  • | |/ / / / /
  • |/| | | | |
  • | | | | * |
  • | | |_|/ /
  • | |/| | |
  • | | | * |
  • | | |/ /
  • | |/| |
  • | * | |
  • | | * |
  • | |/ /
  • |/| |
  • | * |
  • | * |
  • | |\ \
  • | * \ \
  • | |\ \ \
  • | | | | | *
  • | | |_|_|/
  • | |/| | |
  • | | * | |
  • | |/ / /
  • | | * /
  • | |/ /
  • | * |
  • | |\ \
  • | | * |
  • | * | |
  • | |\ \ \
  • 079137350 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:39:06 +0100
  • e6ab4b0ca Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:37:49 +0100
  • 6d9ff1191 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:36:50 +0100
  • 9854d7fc0 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:35:21 +0100
  • ce0d38cc9 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:33:19 +0100
  • 6f2f06547 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:30:23 +0100
  • 82e45b23e Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:28:33 +0100
  • 26c55db4e Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:27:34 +0100
  • 4ee9b48e5 Update LAPACK to 3.9.0 by Martin Kroeker 2019-12-29 18:24:58 +0100
  • 4a62c8f66 Update make.inc entries for LAPACK 3.9.0 by Martin Kroeker 2019-12-29 18:15:43 +0100
  • 0257f2648 Merge pull request #21 from xianyi/develop by Martin Kroeker 2019-12-29 18:08:55 +0100
  • c45b7aef1 Merge pull request #2348 from wjc404/develop by Martin Kroeker 2019-12-28 20:07:56 +0100
  • 0a72d67e7 add in runtime cpu detection for zarch by nk521 2019-12-28 21:19:38 +0530
  • 312060d0d (refs/pull/2348/head) Update CONTRIBUTORS.md by wjc404 2019-12-27 23:36:13 +0800
  • cd765f094 Update cgemm3m_kernel_8x4_haswell.c by wjc404 2019-12-27 18:23:29 +0800
  • 64639f440 Update param.h by wjc404 2019-12-27 18:06:42 +0800
  • 3a66c8cac Update KERNEL.ZEN by wjc404 2019-12-27 18:04:08 +0800
  • 4c35b8dba Update gemm3m_level3.c by wjc404 2019-12-27 18:03:01 +0800
  • ed9af2f7d Update KERNEL.HASWELL by wjc404 2019-12-27 18:01:38 +0800
  • 5fd1edead Create cgemm3m_kernel_8x4_haswell.c by wjc404 2019-12-27 18:00:55 +0800
  • 26478eb0d Merge pull request #2345 from wjc404/develop by Martin Kroeker 2019-12-25 22:26:41 +0100
  • e9ed67ed7 (refs/pull/2346/head) LAPACK: avoid out-of-bound write in ?LANTR by Vladimir Chalupecky 2019-12-19 12:17:14 +0100
  • eeecd623d (refs/pull/2345/head) Update cgemm_kernel_8x2_haswell.c by wjc404 2019-12-24 00:40:16 +0800
  • 3ce6bcdb5 Update CONTRIBUTORS.md by wjc404 2019-12-24 00:30:16 +0800
  • 6fbe51072 Update CONTRIBUTORS.md by wjc404 2019-12-24 00:24:40 +0800
  • 611445c7f Update param.h by wjc404 2019-12-23 23:44:55 +0800
  • 2cd9306bb Update KERNEL.ZEN by wjc404 2019-12-23 23:42:30 +0800
  • c418c8122 Update KERNEL.HASWELL by wjc404 2019-12-23 23:41:44 +0800
  • 025741f16 Fast Haswell CGEMM kernel by wjc404 2019-12-23 23:40:03 +0800
  • 0ae49d299 Merge pull request #2344 from wjc404/develop by Martin Kroeker 2019-12-21 12:16:55 +0100
  • 105e26e12 (refs/pull/2344/head) Adjust Haswell ZGEMM blocking parameters by wjc404 2019-12-21 14:38:51 +0800
  • f41d52665 Fast Haswell ZGEMM kernel by wjc404 2019-12-21 14:37:06 +0800
  • d573d24de Fast Haswell ZGEMM kernel by wjc404 2019-12-21 14:35:15 +0800
  • 31d6c2eb7 Merge pull request #2340 from Zeyiii/develop by Martin Kroeker 2019-12-20 08:38:57 +0100
  • b7cc69ee6 (refs/pull/2340/head) declare DGEMM_BETA in KERNEL.ARMV8 rather than the generic KERNEL by w00421467 2019-12-20 10:11:50 +0800
  • aeef942c4 use arm neon instructions to optimize gemm beta operation by w00421467 2019-12-17 10:00:13 +0800
  • 445ca2f41 Merge pull request #2339 from Jehan/wip/Jehan/fix-timeout by Martin Kroeker 2019-12-13 14:57:26 +0100
  • 13226e310 (refs/pull/2339/head) driver: more reasonable thread wait timeout on Windows. by Jehan 2019-12-11 17:51:42 +0100
  • 1a6ea8ee6 Merge pull request #2338 from kavanabhat/aix_mod by Martin Kroeker 2019-12-09 17:54:49 +0100
  • c6ecb195e Merge pull request #2337 from martin-frbg/issue2336 by Martin Kroeker 2019-12-07 09:38:06 +0100
  • b28db3142 (refs/pull/2337/head) Support two-digit version numbers in gcc version check by Martin Kroeker 2019-12-06 21:23:56 +0100
  • 6baa9b07d (refs/pull/2338/head) AIX changes for Power8 by Kavana Bhat 2019-12-06 04:33:32 -0600
  • a4896b553 Update DYNAMIC_ARCH support for ARM64 and PPC (#2332) by Martin Kroeker 2019-12-04 11:06:03 +0100
  • 3938e5956 AIX changes for Power8 by Kavana Bhat 2019-12-04 00:23:46 -0600
  • 9d5079008 Merge pull request #2334 from martin-frbg/fix2228 by Martin Kroeker 2019-12-03 22:23:52 +0100
  • 8be499114 (refs/pull/2332/head) remove spurious copypasta by Martin Kroeker 2019-12-03 21:52:24 +0100
  • 8269288aa Add back the additions to ARM64 dynamic_core by Martin Kroeker 2019-12-03 20:27:27 +0100
  • 80f219e12 Update dynamic_arm64.c by Martin Kroeker 2019-12-03 17:05:07 +0100
  • c240abaff Fix typo by Martin Kroeker 2019-12-03 10:07:12 +0100
  • 8ca7d5a4f Add test for gcc >=9 by Martin Kroeker 2019-12-03 09:41:48 +0100
  • ef752b993 Need at least gcc9 for tsv110 support by Martin Kroeker 2019-12-03 09:41:06 +0100
  • 3518617f5 (refs/pull/2334/head) Add Intel Goldmont+ cpuid by Martin Kroeker 2019-12-03 08:32:29 +0100
  • 715f4650d Delete stray copy of dynamic.c from PR 2228 by Martin Kroeker 2019-12-03 08:24:10 +0100
  • 10705183c Merge pull request #20 from xianyi/develop by Martin Kroeker 2019-12-03 08:22:40 +0100
  • 26799ccbf Fix typos by Martin Kroeker 2019-12-03 08:18:14 +0100
  • f2d787429 (refs/pull/2331/head) Update zgemm3m_kernel_4x4_haswell.c by wjc404 2019-12-03 13:51:27 +0800
  • 4275c7df3 Update cgemm3m_kernel_8x4_haswell.c by wjc404 2019-12-03 13:49:41 +0800
  • 2458f9ec3 update Haswell GEMM3M parameters by wjc404 2019-12-03 13:41:34 +0800
  • fa93ec9ad AVX2 CGEMM3M & ZGEMM3M kernels by wjc404 2019-12-03 13:40:14 +0800
  • ba3eba180 Add prototypes by Martin Kroeker 2019-12-02 23:02:01 +0100
  • 84695e63c Update list of ARM64 targets for DYNAMIC_ARCH and add PPC targets by Martin Kroeker 2019-12-02 20:23:55 +0100
  • fab6361ba Update cpu list by Martin Kroeker 2019-12-02 20:22:36 +0100
  • 4432f96fe Update DYNAMIC_ARCH list of ARM64 targets by Martin Kroeker 2019-12-02 20:21:13 +0100
  • c49a0740b Update zgemm3m_kernel_8x4_skylakex.c by wjc404 2019-12-02 16:29:15 +0800
  • 7c52e0a56 update avx512 zgemm3m kernel by wjc404 2019-12-02 16:01:35 +0800
  • 87773b9be AVX512 ZGEMM3M kernel by wjc404 2019-12-02 15:56:34 +0800
  • 685fb38ba adjust some thresholds to improve performance by wjc404 2019-12-02 15:52:40 +0800
  • b1934ace2 adjust avx512 zgemm3m parameters by wjc404 2019-12-02 15:50:08 +0800
  • 235599f17 Merge pull request #2329 from isuruf/patch-1 by Martin Kroeker 2019-12-02 08:30:43 +0100
  • b863b32ac (refs/pull/2329/head) Workaround an ICE in clang 9.0.0 by Isuru Fernando 2019-12-01 11:55:49 -0600
  • dd04143d4 Merge pull request #2328 from martin-frbg/ppc9 by Martin Kroeker 2019-11-30 12:23:57 +0100
  • f3a6164bf Merge pull request #2324 from antonblanchard/power9_segv by Martin Kroeker 2019-11-30 00:03:42 +0100
  • dedd822d1 (refs/pull/2328/head) Fix caxpy/caxpyc naming in localentry by Martin Kroeker 2019-11-29 23:56:57 +0100
  • 2181fb704 Fix caxpy/caxpyc naming in localentry by Martin Kroeker 2019-11-29 23:54:15 +0100
  • a9b62c03f Substitute precompiled gcc7 codes only when gcc is older than 9.x by Martin Kroeker 2019-11-29 23:49:50 +0100
  • 97762234f Add variable for gcc >=9 test by Martin Kroeker 2019-11-29 23:47:23 +0100
  • 948d11fc5 Merge pull request #19 from xianyi/develop by Martin Kroeker 2019-11-29 23:44:09 +0100
  • c815b8fb8 Merge pull request #2323 from wjc404/develop by Martin Kroeker 2019-11-28 20:55:16 +0100
  • e20709e97 (refs/pull/2323/head) Update param.h by wjc404 2019-11-28 19:57:50 +0800
  • 934e601e9 Update dgemm_kernel_4x8_skylakex_2.c by wjc404 2019-11-28 19:56:35 +0800
  • a4c3668f9 Merge pull request #2321 from martin-frbg/issue2319 by Martin Kroeker 2019-11-28 09:30:24 +0100
  • 867232c6a Merge pull request #2327 from martin-frbg/travisosx by Martin Kroeker 2019-11-28 08:43:45 +0100
  • 5aaf70ef9 Merge pull request #2326 from xianyi/revert-2325-travisosx by Martin Kroeker 2019-11-28 00:17:19 +0100
  • ae2a0995c (refs/pull/2327/head) Cleanup IOS build and disable FORTRAN on 32bit and ios builds for now by Martin Kroeker 2019-11-28 00:15:36 +0100
  • 83dae28ae (refs/pull/2326/head, revert-2325-travisosx) Revert "Cleanup Travis IOS xbuild and disable FORTRAN on 32bit and ios builds for now" by Martin Kroeker 2019-11-28 00:09:06 +0100
  • da986d2e8 Merge pull request #2325 from martin-frbg/travisosx by Martin Kroeker 2019-11-27 21:59:36 +0100
  • 6bc487de3 (refs/pull/2325/head) Cleanup IOS build and disable FORTRAN on 32bit and ios builds for now by Martin Kroeker 2019-11-27 15:10:57 +0100
  • cf2a8e410 (refs/pull/2324/head) Fix SEGV in cdot_power9 by Anton Blanchard 2019-11-26 21:55:04 -0700
  • eb1e9c8c9 some optimizations by wjc404 2019-11-26 14:12:20 +0800
  • f95989cbc Fix AVX512 capability test (always returning zero) by Martin Kroeker 2019-11-23 22:38:07 +0100
  • f3065a0ee (refs/pull/2321/head) Fix race conditions in multithreaded GEMM3M by Martin Kroeker 2019-11-23 19:54:56 +0100
  • 04226f1e9 Add the cpuid of the business/rackmount version of z15 as well by Martin Kroeker 2019-11-21 18:14:29 +0100
  • 0925ef70d Merge pull request #2316 from sharkcz/s390x by Martin Kroeker 2019-11-21 18:03:00 +0100
  • 371e6f73d Merge pull request #2317 from aarnez/develop by Martin Kroeker 2019-11-21 17:59:21 +0100
  • 8fd019723 (refs/pull/2318/head) Correct Inline Assembly name mismatches by Detrez 2019-11-21 14:19:26 +0100
  • d117dfd50 (refs/pull/2317/head) Change bad usage of "asum" to "sum" in ZARCH versions of ?sum by Andreas Arnez 2019-09-20 18:32:47 +0200
  • 883c39773 (refs/pull/2316/head) zarch: treat z15 as z14 instead of generic by Dan Horák 2019-11-21 12:49:54 +0100
  • b09b5be0a Merge pull request #2315 from ewanglong/develop by Martin Kroeker 2019-11-21 05:06:44 +0100
  • bfb5fbdb4 (refs/pull/2315/head) revised fix windows compatible for #2313 by Wang, Long 2019-11-21 10:19:40 +0800
  • 3da6d66da Merge pull request #2314 from Jehan/wip/Jehan/fix-openblas-crash by Martin Kroeker 2019-11-20 16:16:35 +0100