Chris Sidebottom
2c3cdaf74e
Optimized BGEMV for NEOVERSEV1 target
- Adds bgemv T based off of sbgemv T kernel
- Adds bgemv N which is slightly alterated to not use Y as an
accumulator due to the output being bf16 which results in loss of
precision
- Enables BGEMM_GEMV_FORWARD to proxy BGEMM to BGEMV with new kernels
6 months ago
Chris Sidebottom
740efd71c4
Add optimized BGEMM kernel for NEOVERSEV1 target
This also improves the testing and generic kernel by re-using the BF16
conversion functions.
Built on top of https://github.com/OpenMathLib/OpenBLAS/pull/5357 and derived from https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com>
6 months ago
Srangrang
9f13b2c6ac
style: modify HALF to BFLOAT16 in benchmark folder
7 months ago
gxw
ffaa5765a4
Bench: Add omatcopy
1 year ago
Sergei Lewis
3ffd6868d7
Merge branch 'develop' into dev/slewis/merge-from-riscv
2 years ago
gxw
3d4dfd0085
Benchmark: Rename the executable file names for {sc/dz}a{min/max}
No interface named {c/z}a{min/max}, keeping it would
cause ambiguity
2 years ago
HellerZheng
943372bdf5
Merge branch 'develop' into develop
3 years ago
Martin Kroeker
f92dd6e303
change line endings from CRLF to LF
3 years ago
Heller Zheng
bef47917bd
Initial version for riscv sifive x280
3 years ago
Martin Kroeker
7ae9e8960e
Change "HALF" and "sh" to "BFLOAT16" and "sb"
5 years ago
Martin Kroeker
ced49466f0
Use the fortran compiler to link LAPACK-related benchmarks
to fix linking problems with (at least) the AMD version of flang that creates dependencies on more than just the fortran runtime.
5 years ago
Rajalakshmi Srinivasaraghavan
ce90e2bd3f
Include shgemm in benchtest
This patch is to enable benchtest for half precision gemm
when BUILD_HALF is set during make.
5 years ago
Martin Kroeker
717c604aeb
Merge pull request #2515 from zelong-1024/develop
[OpenBLAS]: benchmark for her/her2 LEVEL2 functions
5 years ago
Martin Kroeker
ce33da4cab
Merge pull request #2513 from aaawuanjun/develop
[OpenBlas]: Add benchmark tpsv file and modify benchmark/Makefile
5 years ago
l00536773
d45c53ecf1
[OpenBLAS]: benchmark for her/her2 LEVEL2 functions
[description]: benchmark for her/her2
[solution]: added benchmark for her/her2, modified makefile in benchmark
[dts]:
5 years ago
Martin Kroeker
c0649aa694
Merge pull request #2506 from xiaofengF/develop
Add benchmark for SPMV and fix segmentation fault when data size >= 50000
5 years ago
wuanjun 00447568
2428dc9fd3
[OpenBlas]: Add benchmark tpsv file and modify benchmark/Makefile
[Description]: Solve lack of tpsv benchmark.
5 years ago
jayfely@qq.com
ae3f2c2e49
Remove cspmv and zspmv to remove the error occured in travis CI
5 years ago
jayfely@qq.com
649733ff15
Only keep spmv.goto and spmv.atlas
5 years ago
wuanjun 00447568
3e8f1c6cc5
[OpenBlas]:Add benchmark tpmv.c and modify Makefile
[Description]:Solve the problem of missing tpmv.c benchmark file
5 years ago
jayfely@qq.com
08e1d8cbae
Modify Makefile in Benchmark
5 years ago
jayfely@qq.com
ff40a4e726
Add benchmark for SPMV
5 years ago
s00548429
c5bdd21352
Add benchmark for ?amax, ?max, ?amin, ?min, i?max, i?amin and i?min.
5 years ago
Martin Kroeker
b6a6ccbbea
Merge pull request #2495 from ZuoQ3/develop
add benchmark for axpby test
5 years ago
zq
0c8162eba6
Add benchmark file axpby.c and modify benchmark/Makefile to test s/d/c/zaxpby
5 years ago
shengyang
09c7a191bd
add benchmark for csrot and zdrot
modified: benchmark/Makefile
modified: benchmark/rot.c
5 years ago
Martin Kroeker
dca3e0cf20
Merge pull request #2491 from chenxuqiang/hbmv_benchmark
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
5 years ago
Martin Kroeker
c9f8db979b
Merge pull request #2490 from shengyang-3390/develop
Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
5 years ago
Martin Kroeker
97c36ca58c
Merge branch 'develop' into develop
5 years ago
Martin Kroeker
9f5a74f3c7
Merge pull request #2486 from qqqil/develop
add benchmark for trsv
5 years ago
Martin Kroeker
2afb10975d
Merge pull request #2485 from Darkness303/develop
Add syr2 benchmark
5 years ago
chenxuqiang
32c847df45
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
Signed-off-by: Xuqiang Chen chenxuqiang3@hisilicon.com
5 years ago
shengyang
e0df9485d4
Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
modified: benchmark/Makefile
new file: benchmark/rotm.c
5 years ago
s00527847
0f1a2b12f9
add benchmark for spr/spr2
5 years ago
q00437336
de74e11641
add benchmark for trsv
5 years ago
Darkness303
114dbec947
1.Add syr2 benchmark
2.Fixed some errors
5 years ago
wuanjun 00447568
f682d19ed4
[OpenBlas]: add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv
6 years ago
j00520245
e1062400c4
New add syr benchmark
6 years ago
Ashwin Sekhar T K
1530e78cfe
Benchmarks: Avoid building lapack benchmarks when NO_LAPACK=1
9 years ago
Ashwin Sekhar T K
925d4e1dc6
Add IAMAX and NRM2 benchmarks
9 years ago
Werner Saar
318cad9c37
added trsm bencharks for POWER8 to benchmark/Makefile
9 years ago
Werner Saar
dd2b897795
added bugfixes for some make files and smallscaling.c
9 years ago
Werner Saar
1ca750471a
added cholesky benchmarks to Makefile for ESSL
9 years ago
Werner Saar
08bddde3f3
updated benchmark Makefile for ESSL
9 years ago
Werner Saar
12540cedb5
added ESSL to Makefile for benchmarks
9 years ago
Werner Saar
7a92c1538e
added benchmark test for srot and drot
9 years ago
Jerome Robert
323c237e7b
Fix smallscaling compilation
Also revert 0bbca5e
10 years ago
Werner Saar
0bbca5e803
removed build of smallscaling, because build on arm, arm64 and power fails
10 years ago
Jerome Robert
73397faf68
Add benchmark/smallscaling.c
* Bench small matrices with multi-threading
* Close #727
10 years ago
Werner Saar
6a13a94e71
added gesv benchmark
10 years ago