Martin Kroeker
c8a32d0a93
Add alternative OpenMP thread safety test from old issue 602
5 years ago
Martin Kroeker
1748f40cbb
Add testcase from issue 602
5 years ago
Martin Kroeker
e607d8de14
Add C version of testcase from issue 602
5 years ago
Martin Kroeker
c1f52d3589
Add original testcase from issue 602
5 years ago
Martin Kroeker
eead529d38
Create test_dgemm_f90.f
5 years ago
Martin Kroeker
4293b4b654
Create test_dgemm_omp.c
5 years ago
Martin Kroeker
c3b0b2d59b
Update .drone.yml
5 years ago
Martin Kroeker
3532fbcad6
Update .drone.yml
5 years ago
Martin Kroeker
9f57b7b8af
Update .drone.yml
5 years ago
Martin Kroeker
d735454a9a
add package for add-apt-repository command
5 years ago
Martin Kroeker
d81513ab7a
update repo address
5 years ago
Martin Kroeker
045a349437
add toolchain-test repo for gcc10
5 years ago
Martin Kroeker
f491291269
try to update the Epyc build to gcc10
5 years ago
Martin Kroeker
de932375c7
use parallel make in the ARM server build&test as well
5 years ago
Martin Kroeker
40be13d5ea
Switch the Epyc build to parallel make to catch build races
5 years ago
Martin Kroeker
9efc3f0815
Merge pull request #109 from xianyi/develop
rebase
5 years ago
Martin Kroeker
aa21cb5217
Merge pull request #2960 from thrasibule/avx2_detection
fix avx2 detection
5 years ago
Guillaume Horel
1f564d729b
fix avx2 detection
reword commits to make it clearer
5 years ago
Martin Kroeker
9349dcd206
Merge pull request #2956 from RajalakshmiSR/caxpy_p10
Optimize caxpy for POWER10
5 years ago
Rajalakshmi Srinivasaraghavan
b435491885
Optimize caxpy for POWER10
This patch makes use of new POWER10 vector pair instructions for
loads and stores.
5 years ago
Martin Kroeker
9a058f2451
Merge pull request #2940 from Qiyu8/optimize-benchmark
Refactor the performance measurement system
5 years ago
Martin Kroeker
074927a7d0
Merge pull request #2954 from Guobing-Chen/BF16_gemv_support
Implementation of BF16 based gemv
5 years ago
Martin Kroeker
60b22e3462
Merge pull request #2955 from Guobing-Chen/Fix_cooperlake_build_issue
Fix cooperlake compile issue
5 years ago
Chen, Guobing
c5e62dad69
Fix cooperlake compile issue
Add a missing macro which is required in Makefile.x86_64 due to recent
clearnup, which causes cooperlake platform build failure.
5 years ago
Chen, Guobing
a7b1f9b1bb
Implementation of BF16 based gemv
1. Add a new API -- sbgemv to support bfloat16 based gemv
2. Implement a generic kernel for sbgemv
3. Implement an avx512-bf16 based kernel for sbgemv
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
5 years ago
Martin Kroeker
67f39ad813
Merge pull request #2939 from thrasibule/Makefile_cleanup
reuse variables defined in Makefile.system
5 years ago
Martin Kroeker
6e13a7e99e
Merge pull request #2951 from martin-frbg/cleanup_make
Minor Makefile cleanup
5 years ago
Martin Kroeker
2207a16235
Merge pull request #2952 from martin-frbg/issue2931
Try to read cpu ID from /sys/devices/.../cpu0 if HWCAP_CPUID fails
5 years ago
Martin Kroeker
5d643929dd
Merge pull request #2948 from martin-frbg/issue2947
Expressly enable neon for use with intrinsics if available
5 years ago
Martin Kroeker
e8cbf0fc50
Output predefined HAVE_ entries to Makefile.conf for ARM with specified TARGET
5 years ago
Martin Kroeker
b937d78a6d
Try to read cpu information from /sys/devices/system/cpu/cpu0 if HWCAP_CPUID fails
5 years ago
Martin Kroeker
e2f9005db8
Merge pull request #2950 from RajalakshmiSR/saxpy
Optimize saxpy for POWER10
5 years ago
Martin Kroeker
6a1f3e40af
Remove debug printout of object list
5 years ago
Martin Kroeker
878b6d1f41
Remove spurious expr in flang version check
5 years ago
Rajalakshmi Srinivasaraghavan
c24ba8b1dd
Optimize saxpy for POWER10
This patch makes use of new POWER10 vector pair instructions for
loads and stores.
5 years ago
Qiyu8
f917c26e83
Refractoring remaining benchmark cases.
5 years ago
Martin Kroeker
76203e2120
Merge pull request #2946 from martin-frbg/issue2945
Move definitions that are neither needed nor supported on Solaris
5 years ago
Martin Kroeker
eec517af0e
Expressly enable neon for use with intrinsics if available
5 years ago
Martin Kroeker
fd7da56965
Move definitions that are neither needed nor supported on SUNOS
5 years ago
Martin Kroeker
2f9fc9be30
Update version to 0.3.12.dev
5 years ago
Martin Kroeker
81fcfd5ed3
Update version to 0.3.12.dev
5 years ago
Martin Kroeker
addf7593ae
Merge pull request #2944 from xianyi/release-0.3.0
Merge back 0.3.12 tag (and Changelog typo fixes) from release
5 years ago
Martin Kroeker
c5f280a7f0
Fix typos
5 years ago
Martin Kroeker
6e3a05f2c9
Merge pull request #2943 from xianyi/develop
Merge from develop for 0.3.12 release
5 years ago
Martin Kroeker
89db73569b
Update Changelog with 0.3.12 changes
5 years ago
Martin Kroeker
e1c18e4eeb
Update version to 0.3.12 for release
5 years ago
Martin Kroeker
26f658c9d2
Update version to 0.3.12 for release
5 years ago
Martin Kroeker
dc35477317
Merge pull request #2942 from martin-frbg/makebuildtypes
Comment out BUILD_SINGLE etc. in Makefile.rule and add a short explanation
5 years ago
Martin Kroeker
365f28787c
Comment out BUILD_SINGLE etc. and add a short explanation
5 years ago
Martin Kroeker
2f2e9ddb65
Merge pull request #2941 from martin-frbg/exportsfix
Fix grouping of sladiv1/dladiv1/ilaenv2stage in gensymbol
5 years ago