l00536773
d45c53ecf1
[OpenBLAS]: benchmark for her/her2 LEVEL2 functions
[description]: benchmark for her/her2
[solution]: added benchmark for her/her2, modified makefile in benchmark
[dts]:
5 years ago
Martin Kroeker
c2840997db
Merge pull request #2508 from liujingjue/develop
[OpenBLAS]:fix the iamax benchmark error
5 years ago
Martin Kroeker
c775458299
Merge pull request #2512 from martin-frbg/lapackh
Move declarations of lapack_complex_custom types outside the extern C
5 years ago
Martin Kroeker
c0649aa694
Merge pull request #2506 from xiaofengF/develop
Add benchmark for SPMV and fix segmentation fault when data size >= 50000
5 years ago
Martin Kroeker
0fe0914437
Merge pull request #2505 from aaawuanjun/develop
[OpenBlas]:Add benchmark tpmv.c and modify benchmark/Makefile
6 years ago
Martin Kroeker
4cfd8a3f57
Merge pull request #2511 from martin-frbg/fixppctest
Prevent attempts to run ctest or test when fortran is not available
6 years ago
Martin Kroeker
ee2e758278
Move declarations of lapack_complex_custom types outside the extern C
fixes #2510
6 years ago
Martin Kroeker
2d8781b0dc
Do not attempt to run test without fortran
6 years ago
Martin Kroeker
c436e8af7b
Do not attempt to run ctest without fortran
The main Makefile takes care of this in the build process, but users or CI jobs may try to run this directly
6 years ago
l00546269
a0a3bf7c81
[OpenBLAS]:fix the iamax benchmark error
[Description]:the result for i?amax is not MFlops, it is MBytes
6 years ago
jayfely@qq.com
ae3f2c2e49
Remove cspmv and zspmv to remove the error occured in travis CI
6 years ago
jayfely@qq.com
83ecf9fea7
Modify Makefile in interface to remove the error occured in travis CI
6 years ago
jayfely@qq.com
649733ff15
Only keep spmv.goto and spmv.atlas
6 years ago
wuanjun 00447568
3e8f1c6cc5
[OpenBlas]:Add benchmark tpmv.c and modify Makefile
[Description]:Solve the problem of missing tpmv.c benchmark file
6 years ago
jayfely@qq.com
2f4c5bb3a9
Update spmv.c: solve segmentation fault when m and n are larger than 50000
6 years ago
Martin Kroeker
4e1c4e67d4
Merge pull request #2503 from martin-frbg/xerbl
Apply fix for LAPACK issue 394 (fixed-form code beyond column 72)
6 years ago
Martin Kroeker
b9a2a3c540
Merge pull request #2502 from martin-frbg/issue2497
Fix INTERFACE64 not propagating to the fortran codes on ARMV8
6 years ago
Martin Kroeker
047dfb216d
Merge pull request #2501 from jijiwawa/Fix_mistakes
Fix pr #2487 error
6 years ago
s00527847
cd8871f1a1
Use the correct unit of measure
6 years ago
Martin Kroeker
b25ae1fc60
Apply fix for Reference-LAPACK issue 394
reference to XERBLA extending beyond column 72, breaking builds with compilers that default to traditional punch card format
6 years ago
Martin Kroeker
3f7f7ab7e2
Restore INTERFACE64 for arm64
6 years ago
Martin Kroeker
9c22170f52
Merge pull request #37 from xianyi/develop
rebase
6 years ago
jayfely@qq.com
08e1d8cbae
Modify Makefile in Benchmark
6 years ago
jayfely@qq.com
ff40a4e726
Add benchmark for SPMV
6 years ago
Zhang Xianyi
51019feae1
Merge pull request #2498 from njutcz/develop
Add benchmark for ?amax, ?max, ?amin, ?min, i?max, i?amin and i?min.
6 years ago
s00548429
bec7923a0d
Fix the functional bugs for zamax.
6 years ago
s00548429
c5bdd21352
Add benchmark for ?amax, ?max, ?amin, ?min, i?max, i?amin and i?min.
6 years ago
njutcz
d2d16d091e
Merge pull request #1 from xianyi/develop
update
6 years ago
Martin Kroeker
b6a6ccbbea
Merge pull request #2495 from ZuoQ3/develop
add benchmark for axpby test
6 years ago
Martin Kroeker
8b720f7365
Merge pull request #2494 from shengyang-3390/develop
add benchmark for csrot and zdrot
6 years ago
Martin Kroeker
14df234edb
Merge pull request #2489 from jijiwawa/brightness
Remove redundant code
6 years ago
s00527847
bbeda55b7b
add trmm.c
6 years ago
s00527847
efcf89aec7
Remove redundant code
6 years ago
Martin Kroeker
37d456f7e0
Merge pull request #2493 from martin-frbg/plainmake
Fix use of make vs $(MAKE) in building lapack-testing
6 years ago
Martin Kroeker
0b9e96922b
Merge pull request #2488 from liujingjue/develop
Modify the main Makefile in OpenBLAS
6 years ago
zq
0c8162eba6
Add benchmark file axpby.c and modify benchmark/Makefile to test s/d/c/zaxpby
6 years ago
zq
9a94a30132
Merge pull request #1 from xianyi/develop
update
6 years ago
shengyang
09c7a191bd
add benchmark for csrot and zdrot
modified: benchmark/Makefile
modified: benchmark/rot.c
6 years ago
l00546269
8a8df530e2
[OpenBLAS]:modifed the Makefile
[Description]: check the compiler version and show the detail info
6 years ago
Martin Kroeker
37f46f2fa0
Fix another spot where make was used instead of $(MAKE)
Broke lapack-testing on BSD as their default "make" does not support GNU Makefile syntax
6 years ago
Martin Kroeker
9afc561be4
Merge pull request #36 from xianyi/develop
rebase
6 years ago
Martin Kroeker
dca3e0cf20
Merge pull request #2491 from chenxuqiang/hbmv_benchmark
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
6 years ago
Martin Kroeker
c9f8db979b
Merge pull request #2490 from shengyang-3390/develop
Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
6 years ago
Martin Kroeker
18099de976
Merge pull request #2487 from jijiwawa/develop
add benchmark for spr/spr2
6 years ago
Martin Kroeker
97c36ca58c
Merge branch 'develop' into develop
6 years ago
Martin Kroeker
9f5a74f3c7
Merge pull request #2486 from qqqil/develop
add benchmark for trsv
6 years ago
Martin Kroeker
2afb10975d
Merge pull request #2485 from Darkness303/develop
Add syr2 benchmark
6 years ago
Martin Kroeker
dbef479227
Merge pull request #2469 from AGSaidi/acq-rel-2
Use acq/rel semantics to pass flags/pointers in getrf_parallel.
6 years ago
Ali Saidi
208c7e7ca5
Use acq/rel semantics to pass flags/pointers in getrf_parallel.
The current implementation has locks, but the locks each only
have a critical section of one variable so atomic reads/writes
with barriers can be used to achieve the same behavior.
Like the previous patch, pthread_mutex_lock isn't fair, so in a
tight loop the previous thread that has the lock can keep it
starving another thread, even if that thread is about to write
the data that will stop the current thread from spinning.
On a 64c Arm system this improves performance by 20x on sgesv.goto.
6 years ago
chenxuqiang
32c847df45
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
Signed-off-by: Xuqiang Chen chenxuqiang3@hisilicon.com
6 years ago