Sergei Lewis
a3b0ef6596
Restore riscv64 fixes from develop branch: dot product double precision accumulation, zscal NaN handling
2 years ago
Sergei Lewis
1093def0d1
Merge branch 'risc-v' into develop
2 years ago
Martin Kroeker
8892121130
Merge pull request #4462 from martin-frbg/issue4449
Use +sve in arch declarations of the fallback paths for SVE targets
2 years ago
Martin Kroeker
48a4c4d454
Use +sve in arch declarations of the fallback paths for SVE targets
2 years ago
Martin Kroeker
889c5d026a
Merge pull request #4456 from kseniyazaytseva/riscv-rvv10
Fix BLAS and LAPACK tests for RVV 1.0 target, update to 0.12.0 intrincics
2 years ago
Martin Kroeker
4e2a32ff51
Merge pull request #4454 from kseniyazaytseva/riscv-rvv07
Fix BLAS and LAPACK tests for C910V and RISCV64_ZVL256B targets
2 years ago
Martin Kroeker
a21b2fa5e4
Merge pull request #4452 from kseniyazaytseva/riscv-generic
Fix BLAS, BLAS-like functions and Generic RISC-V kernels
2 years ago
Andrey Sokolov
73530b03fa
remove RISCV64_ZVL256B additional extentions
2 years ago
kseniyazaytseva
86943afa9c
Fix x280 taget include riscv_vector.h
2 years ago
Andrey Sokolov
9c49a81d54
Resolve conflicts
2 years ago
kseniyazaytseva
e1afb23811
Fix BLAS and LAPACK tests for C910V and RISCV64_ZVL256B targets
* Fixed bugs in dgemm, [a]min\max, asum kernels
* Added zero checks for BLAS kernels
* Added dsdot implementation for RVV 0.7.1
* Fixed bugs in _vector files for C910V and RISCV64_ZVL256B targets
* Added additional definitions for RISCV64_ZVL256B target
2 years ago
Martin Kroeker
d6a5174e9c
Merge pull request #4447 from RevySR/update-thead-toolchains
Update T-Head toolchains v2.8.0
2 years ago
Han Gao/Revy/Rabenda
304a9b60af
Update T-Head toolchains v2.8.0
Signed-off-by: Han Gao/Revy/Rabenda <rabenda.cn@gmail.com>
2 years ago
Martin Kroeker
f5de4fad27
Merge pull request #4444 from Mousius/part-mapping
Add dynamic support for Arm(R) Neoverse(TM) V2 processor
2 years ago
Chris Sidebottom
aaf65210cc
Add dynamic support for Arm(R) Neoverse(TM) V2 processor
Whilst I figure out how best to map the L2 parameters without
duplicating all of `ARMV8SVE`, lets just map this to `NEOVERSEV1`.
2 years ago
Martin Kroeker
10c22f4a39
Merge pull request #4355 from imaginationtech/img-riscv64-zvl128b
[RISC-V] Add RISC-V Vector 128-bit target
2 years ago
Octavian Maghiar
ccbc3f875b
[RISC-V] Add RISCV64_ZVL128B target to common_riscv64.h
2 years ago
Octavian Maghiar
deecfb1a39
Merge branch 'risc-v' into img-riscv64-zvl128b
2 years ago
kseniyazaytseva
f89e0034a4
Fix LAPACK usage from BLAS
2 years ago
Martin Kroeker
f7cf637d7a
redo lost edit
2 years ago
Martin Kroeker
85548e66ca
Fix build failures seen with the NO_LAPACK option - cspr/csymv/csyr belong on the LAPACK list
2 years ago
Martin Kroeker
f129161453
restore C/Z SPMV, SPR, SYR,SYMV
2 years ago
kseniyazaytseva
5222b5fc18
Added axpby kernels for GENERIC RISC-V target
2 years ago
Martin Kroeker
1c04df20bd
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
2 years ago
Martin Kroeker
5b4df851d7
fix stray blank on continuation line
2 years ago
kseniyazaytseva
ff41cf5c49
Fix BLAS, BLAS-like functions and Generic RISC-V kernels
* Fixed gemmt, imatcopy, zimatcopy_cnc functions
* Fixed cblas_cscal testing in ctest
* Removed rotmg unreacheble code
* Added zero size checks
2 years ago
Martin Kroeker
500442cf96
Merge pull request #4442 from pbo-linaro/fix-utest-compilation
Fix utest compilation
2 years ago
kseniyazaytseva
b193ea3d7b
Fix BLAS and LAPACK tests for RVV 1.0 target, update to 0.12.0 intrincics
* Update intrincics API to 0.12.0 version (Stride Segment Loads/Stores)
* Fixed nrm2, axpby, ncopy, zgemv and scal kernels
* Added zero size checks
2 years ago
Pierrick Bouvier
a4992e09bc
Fix utest compilation
Introduced recently when adding new test cases for ZSCAL
- include cblas is needed for cblas_zscal
- ASSERT macro does not exist
- missing closing )
2 years ago
Martin Kroeker
6f0e0e4021
Merge pull request #4438 from Dirreke/csky-support
Add CSKY support
2 years ago
Martin Kroeker
43cb266178
Merge pull request #4441 from martin-frbg/gemv-threshold
Increase multithreading threshold for S/DGEMV by a factor of 50
2 years ago
Martin Kroeker
d2fc4f3b4d
Increase multithreading threshold by a factor of 50
2 years ago
Martin Kroeker
88e994116c
Merge pull request #4354 from imaginationtech/img-rvv-kernel-generator
[RISC-V] Improve RVV kernel generator LMUL usage
2 years ago
Martin Kroeker
ec46ca7a43
Support Arm Compiler for Linux as classic flang ( #4436 )
* Support ArmCompilerforLinux as classic flang
2 years ago
Martin Kroeker
e3508d3713
Merge pull request #4439 from sergei-lewis/risc-v
Fix builds with t-head toolchains that use old intrinsics spec
2 years ago
Dirreke
ec89466e14
Add CSKY support
2 years ago
Sergei Lewis
9edb805e64
fix builds with t-head toolchains that use old versions of the intrinsics spec
2 years ago
Martin Kroeker
452741b67f
Merge pull request #4435 from imciner2/im/sapphire
Fix Clang sapphire rapids march flag
2 years ago
Ian McInerney
8f4e325ea8
Fix Clang sapphire rapids march flag
2 years ago
Martin Kroeker
13c764eaaa
Merge pull request #4434 from martin-frbg/issue4433
Only use mtune=native in ARM64 fallback paths when not cross-compiling
2 years ago
Martin Kroeker
025a1b2c7b
Only use mtune=native when not cross-compiling
2 years ago
Martin Kroeker
2527afaaa2
Merge pull request #4429 from martin-frbg/issue4428
Handle NAN and INF in ARM and generic/s390x ZSCAL
2 years ago
Martin Kroeker
0d2e486edf
Handle NAN and INF
2 years ago
Martin Kroeker
a782103b9c
Merge pull request #4425 from martin-frbg/issue2392
Add BLAS extension openblas_set_num_threads_local()
2 years ago
Martin Kroeker
152a6c43b6
Add blas_omp_threads_local
2 years ago
Martin Kroeker
8a9d492af7
Add default for blas_omp_threads_local
2 years ago
Martin Kroeker
b3341527ad
Merge pull request #4426 from martin-frbg/issue4415
Tweak LAPACK tests for SGS/DGS to avoid spurious errors resulting from FMA-induced inaccuracies
2 years ago
Martin Kroeker
9fab60d32f
Remove matrix dimension 6 from SGS to avoid spurious errors from FMA
2 years ago
Martin Kroeker
bf66af3dc0
remove matrix dimension 6 from DGS to avoid spurious errors from FMA
2 years ago
Martin Kroeker
87d31af2ae
Add openblas_set_num_threads_local()
2 years ago