Commit Graph

  • *
  • *
  • *
  • *
  • |\
  • | | *
  • | | |\
  • | |_|/
  • |/| |
  • * | |
  • |\ \ \
  • | | | | *
  • | |_|_|/
  • |/| | |
  • | * | |
  • | * | |
  • | * | |
  • |/ / /
  • * | |
  • |\ \ \
  • | | * |
  • | |/ /
  • |/| |
  • | * |
  • |/ /
  • | *
  • | *
  • | |\
  • | |/
  • |/|
  • | | *
  • | | *
  • | | *
  • | * |
  • | * |
  • * | |
  • |\ \ \
  • * \ \ \
  • |\ \ \ \
  • * \ \ \ \
  • |\ \ \ \ \
  • | * | | | |
  • | | |_|/ /
  • | |/| | |
  • * | | | |
  • |\ \ \ \ \
  • | | | | * |
  • | |_|_|/ /
  • |/| | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • | |/ /
  • | * |
  • | |\ \
  • | |/ /
  • |/| |
  • * | |
  • |\ \ \
  • | * | |
  • |/ / /
  • * | |
  • |\ \ \
  • * \ \ \
  • |\ \ \ \
  • * \ \ \ \
  • |\ \ \ \ \
  • * \ \ \ \ \
  • |\ \ \ \ \ \
  • * \ \ \ \ \ \
  • |\ \ \ \ \ \ \
  • * \ \ \ \ \ \ \
  • |\ \ \ \ \ \ \ \
  • | | | | | | * | |
  • | |_|_|_|_|/ / /
  • |/| | | | | | |
  • | | | | | * | |
  • | |_|_|_|/ / /
  • |/| | | | | |
  • | | | | * | |
  • | | | | |/ /
  • | | | * / /
  • | | | |/ /
  • | | * / /
  • | |/ / /
  • |/| | |
  • | | | | *
  • | * | | |
  • | * | | |
  • | * | | |
  • | |/ / /
  • | * | |
  • | |\ \ \
  • | |/ / /
  • |/| | |
  • * | | |
  • * | | |
  • | | | | *
  • | | | | |\
  • | |_|_|_|/
  • |/| | | |
  • * | | | |
  • |\ \ \ \ \
  • | | |_|_|/
  • | |/| | |
  • * | | | |
  • |\ \ \ \ \
  • | * | | | |
  • | | |/ / /
  • | |/| | |
  • | * | | |
  • | |\ \ \ \
  • | |/ / / /
  • |/| | | |
  • * | | | |
  • * | | | |
  • |\ \ \ \ \
  • * \ \ \ \ \
  • |\ \ \ \ \ \
  • | | * | | | |
  • | | |/ / / /
  • | * | | | |
  • * | | | | |
  • |\ \ \ \ \ \
  • | | * | | | |
  • | | * | | | |
  • * | | | | | |
  • |\ \ \ \ \ \ \
  • | | |_|_|_|/ /
  • | |/| | | | |
  • | | | * | | |
  • | | | * | | |
  • | | | * | | |
  • | | | |/ / /
  • | | | * | |
  • | | | |\ \ \
  • | |_|_|/ / /
  • |/| | | | |
  • | * | | | |
  • | * | | | |
  • | * | | | |
  • | | * | | |
  • | | * | | |
  • | |/ / / /
  • |/| | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • | * | | |
  • |/ / / /
  • * | | |
  • |\ \ \ \
  • | * | | |
  • | |/ / /
  • * | | |
  • |\ \ \ \
  • * \ \ \ \
  • |\ \ \ \ \
  • | * | | | |
  • | | * | | |
  • | |/ / / /
  • |/| | | |
  • * | | | |
  • |\ \ \ \ \
  • * \ \ \ \ \
  • |\ \ \ \ \ \
  • | |_|_|_|_|/
  • |/| | | | |
  • | | * | | |
  • | |/ / / /
  • |/| | | |
  • | * | | |
  • | | |/ /
  • | |/| |
  • 1a88c4ab2 (refs/pull/2426/head) Fix bottle upload problem & typo by Izaak Beekman 2020-02-17 13:32:33 -0500
  • 0b4480216 Test push & PRs only when workflow file changes by Izaak Beekman 2020-02-17 13:12:50 -0500
  • 2c242b4ce Add Github Action to build development branch nightly with Homebrew by Izaak Beekman 2020-02-17 11:49:53 -0500
  • 0bfb7336d Merge pull request #2424 from isuruf/osx by Martin Kroeker 2020-02-17 17:00:08 +0100
  • 403cde104 Merge pull request #30 from xianyi/develop by Martin Kroeker 2020-02-17 14:53:46 +0100
  • 634f2bddd Merge pull request #2414 from marxin/fix-iamax_sse-implementation by Martin Kroeker 2020-02-17 14:50:18 +0100
  • e94140bdc (refs/pull/2425/head) Use -DUSE_MAX macro instead of -UUSE_MIN. by Martin Liska 2020-02-17 09:46:12 +0100
  • aeea14ee4 (refs/pull/2414/head) Come up with LOAD_AND_COMPARE_TO_MXX macro in iamax_sse.S. by Martin Liska 2020-02-13 14:42:45 +0100
  • 18bcc36a6 Fix implementation of iamax_sse.S as reported in #2116. by Martin Liska 2020-02-13 14:32:24 +0100
  • 0e7f43c89 Add missing USE_MIN in kernel/CMakeLists.txt. by Martin Liska 2020-02-14 10:35:51 +0100
  • 79e201fbb Merge pull request #2423 from xianyi/issue2419 by Martin Kroeker 2020-02-17 07:24:02 +0100
  • 4326dcb46 (refs/pull/2424/head) Pass CFLAGS from env to Makefile.prebuild and remove iOS hack by Isuru Fernando 2020-02-16 15:11:40 -0600
  • e32f3b144 (refs/pull/2423/head, issue2419) Restore -march flag for Android builds by Martin Kroeker 2020-02-16 17:32:13 +0100
  • d483e9270 Update KERNEL.POWER8 by Martin Kroeker 2020-02-16 17:29:35 +0100
  • 01834aee3 Merge pull request #29 from xianyi/develop by Martin Kroeker 2020-02-16 17:28:10 +0100
  • b0558c11b Update param.h by wjc404 2020-02-16 23:01:31 +0800
  • f566787e6 Update KERNEL.SKYLAKEX by wjc404 2020-02-16 22:58:44 +0800
  • e3368cbf1 AVX512 STRMM kernel by wjc404 2020-02-16 22:58:00 +0800
  • d92bd5be2 Update KERNEL.POWER8 by Martin Kroeker 2020-02-15 23:07:50 +0100
  • 46e4b1294 Update KERNEL.POWER8 by Martin Kroeker 2020-02-15 23:06:51 +0100
  • 5e94aa487 Merge pull request #2417 from marxin/make-ctest-verbose-for-drone by Martin Kroeker 2020-02-15 21:57:41 +0100
  • 93f3e2757 Merge pull request #2415 from marxin/add-cmake-to-gitignore by Martin Kroeker 2020-02-15 21:57:03 +0100
  • 785c389b0 Merge pull request #2420 from martin-frbg/issue2396 by Martin Kroeker 2020-02-15 21:56:16 +0100
  • c222b25b8 (refs/pull/2420/head) Correct generation of GETRF files by the CMAKE build by Martin Kroeker 2020-02-15 19:29:14 +0100
  • 221da8bf0 Merge pull request #2411 from martin-frbg/fix2254-038 by Martin Kroeker 2020-02-14 23:07:43 +0100
  • eb285b4d2 (refs/pull/2417/head) Make ctest verbose for drone builder. by Martin Liska 2020-02-14 10:45:31 +0100
  • cafdd999b (refs/pull/2411/head) Update caxpy_power8.S by Martin Kroeker 2020-02-13 22:44:09 +0100
  • 92ca92a46 Update caxpy_power8.S by Martin Kroeker 2020-02-13 21:24:54 +0100
  • 486c35c5d Update icamin_power8.S by Martin Kroeker 2020-02-13 18:38:43 +0100
  • 0e05ea9ba (refs/pull/2415/head) Add CMake related files to .gitignore. by Martin Liska 2020-02-13 14:51:55 +0100
  • 5ba3699f4 Update isamin_power8.S by Martin Kroeker 2020-02-13 00:00:32 +0100
  • 8eefa530c Update isamax_power8.S by Martin Kroeker 2020-02-12 23:59:50 +0100
  • de40d47ed Update isamin_power8.S by Martin Kroeker 2020-02-12 23:57:48 +0100
  • 7c162b8a2 Update isamax_power8.S by Martin Kroeker 2020-02-12 23:56:57 +0100
  • 0544cbc80 Fix syntax of endianness conditional by Martin Kroeker 2020-02-12 20:00:29 +0100
  • 120d20731 Fix syntax of endianness conditional by Martin Kroeker 2020-02-12 19:58:42 +0100
  • dc345d84d Fix syntax of endianness conditional and add gcc version check for workaround by Martin Kroeker 2020-02-12 19:56:52 +0100
  • 616921fd9 Merge pull request #27 from xianyi/develop by Martin Kroeker 2020-02-12 19:16:14 +0100
  • 8a9e9a82a Merge pull request #2410 from bartoldeman/fix-dscal-inline-asm by Martin Kroeker 2020-02-12 15:38:37 +0100
  • 7ea5e07d1 (refs/pull/2410/head) Fix inline asm in dscal: mark x, x1 as clobbered. Fixes #2408 by Bart Oldeman 2020-02-12 14:11:44 +0000
  • cb6ef4985 Merge pull request #2407 from susilehtola/patch-2 by Martin Kroeker 2020-02-11 13:04:44 +0100
  • 63994e1cd Merge pull request #2405 from susilehtola/patch-1 by Martin Kroeker 2020-02-11 13:03:35 +0100
  • 496e3019b Merge pull request #2404 from martin-frbg/issue2395 by Martin Kroeker 2020-02-11 13:00:36 +0100
  • 169be3f09 Merge pull request #2403 from martin-frbg/issue2400 by Martin Kroeker 2020-02-11 13:00:16 +0100
  • 6ccbb089c Merge pull request #2402 from gxw-loongson/develop by Martin Kroeker 2020-02-11 12:59:53 +0100
  • 59ebe3636 Merge pull request #2399 from martin-frbg/buffersize by Martin Kroeker 2020-02-11 12:56:56 +0100
  • 5a6bba306 (refs/pull/2407/head) Patch out instances of Z15 in dynamic_zarch.c by Susi Lehtola 2020-02-11 15:07:33 +1300
  • dff173e50 (refs/pull/2405/head) Fix typo in dynamic_zarch.c by Susi Lehtola 2020-02-11 14:46:30 +1300
  • 7e5cbb6f3 (refs/pull/2404/head) Fix bad conditional syntax that caused spurious application of USE_TRMM by Martin Kroeker 2020-02-10 21:17:39 +0100
  • 303bdb673 (refs/pull/2403/head) Fix coretype detection for Intel extended models 6 and 7 by Martin Kroeker 2020-02-10 19:17:32 +0100
  • 754433f42 (refs/pull/2402/head) Avoid printing the following information on mips and mips64 when check msa: "unrecognized command line option ‘-mmsa’" by gxw 2020-02-10 19:11:45 +0800
  • 137fd21fe (refs/pull/2401/head) Avoid printing the following information on mips and mips64 platform when check msa: "unrecognized command line option ‘-mmsa’" by gxw 2020-02-10 18:49:50 +0800
  • 7f0d523b4 (refs/pull/2399/head) Make BUFFER_SIZE configurable by Martin Kroeker 2020-02-09 23:32:57 +0100
  • c353d8b10 Make BUFFER_SIZE configurable by Martin Kroeker 2020-02-09 23:30:22 +0100
  • 579be3aa9 Add configuration option for BUFFER_SIZE by Martin Kroeker 2020-02-09 23:28:04 +0100
  • 449e8ea44 Merge pull request #26 from xianyi/develop by Martin Kroeker 2020-02-09 23:23:55 +0100
  • 3bec250cf Increment version to 0.3.9.dev by Martin Kroeker 2020-02-09 23:18:44 +0100
  • f03dd23e9 Increment version to 0.3.9.dev by Martin Kroeker 2020-02-09 23:18:07 +0100
  • fb5eb4755 (tag: v0.3.8) Merge pull request #2398 from xianyi/develop by Martin Kroeker 2020-02-09 23:16:28 +0100
  • fa93d6336 (refs/pull/2398/head) Merge branch 'release-0.3.0' into develop by Martin Kroeker 2020-02-09 23:16:06 +0100
  • 90e6c66a5 Merge pull request #2397 from martin-frbg/038changes by Martin Kroeker 2020-02-09 23:01:52 +0100
  • 32d97330b (refs/pull/2397/head) Update with changes from 0.3.8 by Martin Kroeker 2020-02-09 23:00:36 +0100
  • 29eaf4b6d Merge pull request #25 from xianyi/develop by Martin Kroeker 2020-02-09 22:48:15 +0100
  • 47c1bf7f4 typo fixes by Martin Kroeker 2020-02-09 01:06:40 +0100
  • 2b55f0ad3 Merge pull request #2393 from martin-frbg/issue2388 by Martin Kroeker 2020-02-09 01:00:33 +0100
  • a5b32ab06 Merge pull request #2390 from martin-frbg/pgi by Martin Kroeker 2020-02-09 00:13:40 +0100
  • 50545b19d (refs/pull/2393/head) Update CPU and OS support and document DYNAMIC_ARCH option in README.md by Martin Kroeker 2020-02-09 00:06:07 +0100
  • b3cbd60d7 (refs/pull/2390/head) Remove PGI from list again as it is actually still not capable by Martin Kroeker 2020-02-08 10:20:13 +0100
  • 70199d190 Merge pull request #2389 from Zeyiii/develop by Martin Kroeker 2020-02-07 16:05:46 +0100
  • cfe63d8cc Remove OpenMP libraries from link list by Martin Kroeker 2020-02-07 16:03:51 +0100
  • d55b10830 Remove OpenMP libraries from link list by Martin Kroeker 2020-02-07 16:02:17 +0100
  • c1c10cbb2 Merge pull request #2384 from wjc404/develop by Martin Kroeker 2020-02-07 13:47:12 +0100
  • 598984152 Add PGI to avx512-supporting compilers by Martin Kroeker 2020-02-07 13:01:31 +0100
  • 68a43db35 Fix utest compilation with PGI by Martin Kroeker 2020-02-07 10:15:18 +0100
  • 9694037b2 Set SUFFIX in tempfile commands, fix bad architecture option for PGI compiler in avx512 test by Martin Kroeker 2020-02-07 10:09:25 +0100
  • 71faa1c1a Merge pull request #24 from xianyi/develop by Martin Kroeker 2020-02-07 10:03:02 +0100
  • 3447d04ea (refs/pull/2384/head) Update dgemm_kernel_16x2_skylakex.c by wjc404 2020-02-06 02:14:10 +0000
  • 8b5cdcc64 Update sgemm_kernel_8x4_haswell.c by wjc404 2020-02-06 01:47:46 +0000
  • 4e00d96a7 Update dgemm_kernel_16x2_skylakex.c by wjc404 2020-02-06 01:46:36 +0000
  • ce9ea8f82 (refs/pull/2389/head) Fix another branch by w00421467 2020-02-05 15:07:18 +0800
  • 0b909203c Fix bugs in benchmark of gemv by w00421467 2020-02-05 14:53:37 +0800
  • 096da2f51 Update dgemm_kernel_16x2_skylakex.c by wjc404 2020-02-05 13:36:57 +0800
  • 2f96a2c55 Update trmm_R.c by wjc404 2020-02-05 10:15:02 +0800
  • 833bd0f8f Update trmm_L.c by wjc404 2020-02-05 10:09:41 +0800
  • 77b8f4955 Update level3_thread.c by wjc404 2020-02-04 20:33:08 +0800
  • 1c3e20ce4 Update level3.c by wjc404 2020-02-04 20:30:23 +0800
  • 83b6be797 Update param.h by wjc404 2020-02-04 19:55:26 +0800
  • 081b18852 Update KERNEL.SKYLAKEX by wjc404 2020-02-03 21:38:08 +0800
  • f3f969f68 Update param.h by wjc404 2020-02-03 21:34:12 +0800
  • 8019e7021 AVX512 16x2 DGEMM kernel by wjc404 2020-02-03 21:32:56 +0800
  • 8d2a796f4 Merge pull request #2378 from martin-frbg/issue2377 by Martin Kroeker 2020-01-30 17:07:19 +0100
  • 8dc9fd4df (refs/pull/2378/head) Add -march option for AVX512 by Martin Kroeker 2020-01-30 12:41:18 +0100
  • abc67bdd7 Merge pull request #2375 from ewanglong/master by Martin Kroeker 2020-01-30 10:27:29 +0100
  • 1f62a8278 Merge pull request #2376 from wjc404/develop by Martin Kroeker 2020-01-23 21:50:19 +0100
  • e9fb8f62b (refs/pull/2376/head) Update level3_gemm3m_thread.c by wjc404 2020-01-22 17:40:03 +0000
  • fbf4f48f4 (refs/pull/2375/head) fix a few performance drop in some matrix size per data type by Wang,Long 2020-01-22 15:07:50 +0000
  • b9ad45029 Merge pull request #2373 from Qiyu8/optimize#gemmbeta by Martin Kroeker 2020-01-21 15:05:38 +0100
  • e011ad820 Merge pull request #2372 from martin-frbg/winexit by Martin Kroeker 2020-01-21 14:56:45 +0100
  • ff42e6865 (refs/pull/2373/head) Optimize genenal Gemm Beta by Qiyu8 2020-01-20 11:49:42 +0800
  • 23f322f99 (refs/pull/2372/head) Do not run any cleanup if the program is exiting anyway by Martin Kroeker 2020-01-19 13:28:27 +0100