571eadb88
powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 by
2020-06-24 14:48:15 -0500
df4ade070
(refs/pull/2680/head)
Fix for #2671 by
2020-06-24 04:25:47 -0500
e6b927503
(refs/pull/2677/head)
address vs2019 C4293 by
2020-06-24 09:12:23 +0300
53ea5bfec
Merge pull request #66 from xianyi/develop by
2020-06-23 10:13:44 +0200
93592d126
Merge pull request #2675 from wjc404/develop by
2020-06-23 09:29:02 +0200
6eaeb0126
Merge pull request #2658 from RajalakshmiSR/p10 by
2020-06-23 00:02:37 +0200
45d542c9d
Merge pull request #65 from xianyi/develop by
2020-06-21 12:41:01 +0200
086d87a30
(refs/pull/2675/head)
AVX512 dgemm tcopy_16 function by
2020-06-20 00:07:43 +0800
af501eb75
Merge pull request #2669 from mhillenibm/zarch_fix_gcc_detection by
2020-06-17 17:55:25 +0200
0eb6c4dde
Merge pull request #2672 from mhillenibm/test_num_threads by
2020-06-17 17:54:31 +0200
de838c38e
(refs/pull/2672/head)
cpp_thread_test/dgemv: fail early if concurrency is zero by
2020-06-17 16:15:44 +0200
478898b37
cpp_thread_test/dgemv: cap concurrency to number of hw threads on small systems by
2020-06-17 16:08:48 +0200
cde469072
(refs/pull/2670/head)
RFC: Use gcc -dumpfullversion to get minor version with gcc-7.x by
2020-06-16 15:45:59 +0200
238929176
(refs/pull/2669/head)
Makefile.system: remove duplicate variable GCCVERSIONGT5 by
2020-06-16 14:45:09 +0200
a2d13ea61
Fix gcc version detection for zarch by
2020-06-16 14:40:50 +0200
1bd3cd66c
Increment version to 0.3.10.dev by
2020-06-14 22:05:19 +0200
1c53e1366
Increment version to 0.3.10.dev by
2020-06-14 22:04:37 +0200
63b03efc2
(tag: v0.3.10)
Merge pull request #2667 from xianyi/develop by
2020-06-14 22:03:04 +0200
95dbeff66
(refs/pull/2667/head)
Merge branch 'release-0.3.0' into develop by
2020-06-14 22:02:45 +0200
3b673a24b
Increment version to 0.3.10.dev by
2020-06-14 21:57:52 +0200
1eb197905
Increment version to 0.3.10.dev by
2020-06-14 21:57:15 +0200
efc53b6e7
Merge pull request #2665 from martin-frbg/flang-fixes-2a by
2020-06-14 21:56:08 +0200
72888497e
Update with 0.3.10 changes by
2020-06-14 21:55:31 +0200
7e3e006af
Merge pull request #2666 from martin-frbg/blastest by
2020-06-14 18:28:37 +0200
d906d1440
Merge pull request #2664 from ACSimon33/exported_symbols by
2020-06-14 18:27:03 +0200
3785c0e82
Merge pull request #2663 from martin-frbg/issue2654 by
2020-06-14 18:26:43 +0200
f2d8879af
Merge pull request #2661 from martin-frbg/issue2660 by
2020-06-14 18:25:37 +0200
6876221cf
(refs/pull/2665/head)
Remove optimization level limit for flang again and add -fno-unroll-loops for AOCC flang 2.x instead by
2020-06-14 17:40:24 +0200
79cdcde71
Re-enable higher optimization levels for flang while disabling loop unrolling for AOCC flang by
2020-06-14 17:18:16 +0200
18a11137f
(refs/pull/2666/head)
Update BLAS tests to correspond to Reference-LAPACK 3.9.0 by
2020-06-14 10:26:25 +0200
1dd712131
Fix spelling of flang option -Mrecursive and add -Kieee by
2020-06-14 00:09:31 +0200
0ed2adf0b
Fix spelling of flang option -Mrecursive and add -Kieee by
2020-06-14 00:01:20 +0200
abf670757
(refs/pull/2663/head)
Respect predefined defaults for AR, AS, LD and RANLIB by
2020-06-13 23:21:13 +0200
41fc6f3cd
(refs/pull/2664/head)
Added missing exported symbols. by
2020-06-13 22:37:39 +0200
c90c528ee
(refs/pull/2662/head)
Force flang optimization level to -O0 and correct spelling of -Mrecursive by
2020-06-13 19:41:49 +0200
f132b05de
Force flang optimization level to -O0 to work around failures in ctest and lapack-test by
2020-06-13 19:36:01 +0200
f6ccca344
Correct flang option to -Mrecursive by
2020-06-13 19:32:54 +0200
007d9f97d
(refs/pull/2661/head)
Make gotoblas_corename report the name of the selected TARGET rather than its aliases by
2020-06-13 19:25:28 +0200
63d26090f
Merge pull request #64 from xianyi/develop by
2020-06-13 19:14:47 +0200
9fe930f20
(refs/pull/2658/head)
powerpc: Add support for future processor by
2020-06-11 15:47:20 -0500
3a1b58d54
Merge pull request #2653 from craft-zhang/cortex-a53 by
2020-06-10 12:19:33 +0200
f7659be4a
Merge pull request #2652 from martin-frbg/flang-fixes by
2020-06-09 20:31:06 +0200
bc6fd20a4
(refs/pull/2653/head)
fix INIT8x4 by
2020-06-10 01:01:16 +0800
3ce469a34
(refs/pull/2652/head)
Limit optimization level to O1 for flang and add -frecursive by
2020-06-09 16:11:13 +0200
ba2c5b404
When building with flang, use it also for the final link step to get dependencies right by
2020-06-09 16:09:34 +0200
f07a80354
Apply previously AOCC-specific workaround to all versions of flang by
2020-06-09 16:07:03 +0200
fdd1b5026
Merge pull request #63 from xianyi/develop by
2020-06-09 15:54:30 +0200
b98923f33
Test enforce -O1 for flang by
2020-06-09 06:54:42 +0000
4cb1db0e3
Test flang build by
2020-06-09 06:25:45 +0000
430e8b45f
Merge pull request #2648 from martin-frbg/lapack411 by
2020-06-07 19:45:52 +0200
88fe85f4e
Merge pull request #2647 from martin-frbg/aocc-flang by
2020-06-07 19:45:11 +0200
89091e6b6
Merge pull request #2645 from martin-frbg/misc_fixes by
2020-06-07 19:44:50 +0200
522aaf53b
(refs/pull/2648/head)
Break out of potentially infinite rescaling loop in LAPACK xLARGV/xLARTG/xLARTGP by
2020-06-07 14:30:20 +0200
c3574ffe5
Merge pull request #2646 from wjc404/develop by
2020-06-07 13:18:22 +0200
4e28dc635
(refs/pull/2647/head)
Use only -O1 with AMD AOCC version of flang by
2020-06-07 00:05:02 +0200
13c28889a
(refs/pull/2645/head)
Update "cosmetic fixes for non-C99 compilers" by
2020-06-06 15:22:27 +0200
0e3ac4a06
(refs/pull/2646/head)
Add files via upload by
2020-06-06 14:56:57 +0800
28915eed7
Cosmetic fixes for non-C99 compilers by
2020-06-05 10:05:34 +0200
7f60fb6b9
Delete spurious copy of common_param.h by
2020-06-05 10:04:16 +0200
0464e662a
make blas_quickdivide unsigned and guard against miscompilation by
2020-06-05 10:03:36 +0200
0f9a935a5
Merge pull request #62 from xianyi/develop by
2020-06-05 09:51:06 +0200
79cd69fea
Merge pull request #2644 from martin-frbg/cmake-maxstack by
2020-06-05 08:33:48 +0200
bb12c2c85
(refs/pull/2644/head)
Limit MAX_STACK_ALLOC availability to non-Wndows by
2020-06-04 19:07:27 +0200
32c1c1e12
Update azure-pipelines.yml by
2020-06-04 19:03:46 +0200
f1953b8b8
Update azure-pipelines.yml by
2020-06-04 17:58:13 +0200
6e97df7b4
Add CMAKE support for MAX_STACK_ALLOC setting by
2020-06-04 14:45:31 +0200
729303e5e
Merge pull request #2643 from craft-zhang/cortex-a53 by
2020-06-04 07:58:45 +0200
547965530
Merge pull request #2638 from leezu/actions by
2020-06-04 00:02:37 +0200
9b7877ccf
(refs/pull/2643/head)
sgemm copy source init by
2020-06-04 02:09:38 +0800
f82fa802d
Insert prefetch by
2020-06-04 02:08:48 +0800
3eda3d34c
Merge pull request #2641 from martin-frbg/ppcg4 by
2020-06-03 16:43:46 +0200
a8f42ae85
(refs/pull/2638/head)
set cmake build type to Release by
2020-06-03 15:28:59 +0200
e6e2e531b
revert clang pragma by
2020-06-03 15:16:27 +0200
456dc0444
Update sgemm_kernel_16x4_skylakex_3.c by
2020-06-03 15:15:41 +0200
89323458a
preset optimization level for apple clang by
2020-06-03 15:07:25 +0200
e153bdeb7
Update dynamic_arch.yml by
2020-06-03 13:46:43 +0200
c2001f775
Make cmake build verbose to see options in use by
2020-06-03 12:18:15 +0200
c2b3f0b3f
Revert "keep Apple Clang from optimizing this" by
2020-06-03 10:22:15 +0200
f16e39554
(refs/pull/2641/head)
Change PPCG4 CGEMM_M to match kernel change by
2020-06-03 09:15:29 +0200
b1ee81228
Change complex DOT and ROT to generic kernels and switch CGEMM by
2020-06-03 09:13:29 +0200
9f7358d7d
Keep Apple Clang from optimizing this by
2020-06-03 08:52:53 +0200
54fa90fb2
Keep apple clang 11.0.3 from trying to optimize this (and running out of registers) by
2020-06-02 17:31:45 +0200
5a709b834
Print CPU info in output by
2020-06-01 20:51:11 +0000
b31a68b83
Add Github Actions test for DYNAMIC_ARCH builds by
2020-05-31 01:17:05 +0000
86552bf4c
Update f_check by
2020-05-31 15:22:12 +0200
a349d48d8
Merge pull request #2636 from martin-frbg/issue2634 by
2020-05-31 15:16:09 +0200
4db00121d
(refs/pull/2636/head)
Disable EXPRECISION and add -lm on OSX (same as the BSDs and Linux) by
2020-05-31 12:39:36 +0200
909897f13
Document option USE_LOCKING by
2020-05-31 12:37:57 +0200
e79245acd
Merge pull request #2635 from ilayn/patch-1 by
2020-05-30 14:37:12 +0200
76d2612e0
(refs/pull/2635/head)
BUG: Fix the loop range in ZHEEQUB.f by
2020-05-30 14:11:11 +0200
ced49466f
Use the fortran compiler to link LAPACK-related benchmarks by
2020-05-29 13:35:51 +0200
6e270f91e
add support for RETURN_BY_STACK semantics, e.g. clang by
2020-05-29 13:29:10 +0200
200296b0f
remove libomp from link list only for pgfortran by
2020-05-29 13:23:51 +0200
dd7a65079
Merge pull request #59 from xianyi/develop by
2020-05-29 13:06:25 +0200
4a4c50a7c
Merge pull request #2627 from pkubaj/patch-1 by
2020-05-26 08:36:24 +0200
d069780e6
Merge pull request #2626 from docularxu/working-gcc-version-detections by
2020-05-26 08:35:58 +0200
33c879060
(refs/pull/2627/head)
Add powerpc (32-bit) by
2020-05-25 13:14:09 +0200
06387ac0e
(refs/pull/2626/head)
make GCC version detection OS-independent by
2020-05-25 10:40:12 +0000
f1a18d245
Merge pull request #2618 from craft-zhang/cortex-A53 by
2020-05-25 12:14:46 +0200
2a3aa9135
(refs/pull/2618/head)
update CONTRIBUTORS.md, adding myself by
2020-05-20 22:35:26 +0800