Juliya32
|
668e28adc4
|
Delete kernel/arm64/rot.c
|
1 year ago |
SushilPratap04
|
fa880ab1cf
|
Update KERNEL.ARMV8SVE
updated KERNEL.ARMV8SVE for level 1 sve (swap, rot and scal) kernels.
|
1 year ago |
SushilPratap04
|
7822ae9617
|
Added sve kernels for rot routine.
|
1 year ago |
SushilPratap04
|
b8bc2a752e
|
Added sve optimized kernels for swap routine
|
1 year ago |
CDAC-SSDG
|
0667cf6c92
|
Added optimized scal routine files
|
1 year ago |
CDAC-SSDG
|
2718b37fed
|
Update CONTRIBUTORS.md
|
1 year ago |
Martin Kroeker
|
f66e6d32c2
|
Merge pull request #4953 from NickelWenzel/fix_trtrs_return_types
fix: return types of *trtrs routines
|
1 year ago |
Martin Kroeker
|
a8bb105ed6
|
Merge pull request #4848 from haampie/fix/cmake-min-version
cmake: set `CMP0042` to `NEW`
|
1 year ago |
Martin Kroeker
|
0e6a2cc93c
|
bump the minimum_required version instead
|
1 year ago |
Martin Kroeker
|
ac736820d7
|
Merge pull request #4955 from cdaley/optimize_gemv_forwarding
Optimize gemv forwarding on ARM64 systems
|
1 year ago |
Chris Daley
|
cb48505251
|
optimize gemv forwarding on ARM64 systems
|
1 year ago |
nickel
|
79f4bbd4cd
|
fix: return types of *trtrs routines
|
1 year ago |
Martin Kroeker
|
72461f1c8c
|
Merge pull request #4950 from ayappanec/fix-aix-build
Fix AIX build
|
1 year ago |
Ayappan Perumal
|
020cce1068
|
Fix build issues with gcc compiler as well
|
1 year ago |
Ayappan Perumal
|
b6ec73e77c
|
Fix AIX build
|
1 year ago |
Martin Kroeker
|
8a0cd5fcef
|
Merge pull request #4949 from martin-frbg/mingw32-14.2
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
|
1 year ago |
Martin Kroeker
|
4dba6ce6ea
|
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
|
1 year ago |
Martin Kroeker
|
a93ec74e95
|
Merge pull request #4948 from martin-frbg/fixhavesve
Properly report HAVE_SVE in ARM64 autodetection where applicable
|
1 year ago |
Martin Kroeker
|
c4bb4e74fc
|
NeoverseN2 has SVE too
|
1 year ago |
Martin Kroeker
|
86720778ef
|
write HAVE_SVE to config where applicable
|
1 year ago |
Martin Kroeker
|
016bdb9b0b
|
Merge pull request #4946 from XiWeiGu/la64_omatcopy_lasx
LoongArch64: Opt somatcopy with LASX
|
1 year ago |
gxw
|
ffaa5765a4
|
Bench: Add omatcopy
|
1 year ago |
Martin Kroeker
|
a93897276b
|
Merge pull request #4943 from martin-frbg/update_readme
Update README.md
|
1 year ago |
Martin Kroeker
|
3fc1225dd6
|
Merge branch 'OpenMathLib:develop' into update_readme
|
1 year ago |
Martin Kroeker
|
33078d11e4
|
stress importance of TARGET setting in DYNAMIC_ARCH builds
|
1 year ago |
Martin Kroeker
|
15a57598f5
|
Merge pull request #4944 from ChipKerchner/vectorizeBF16GEMV
[POWER] Vectorize BF16 GEMV
|
1 year ago |
Chip Kerchner
|
ab71a1edf2
|
Better VSX.
|
1 year ago |
gxw
|
bb31bbef52
|
LoongArch64: Opt somatcopy_ct with LASX
|
1 year ago |
gxw
|
b37129341b
|
LoongArch64: Opt somatcopy_cn with LASX
|
1 year ago |
gxw
|
acf6cab304
|
LoongArch64: Opt somatcopy_rn with LASX
|
1 year ago |
gxw
|
15edb441bf
|
LoongArch64: Opt somatcopy_rt with LASX
|
1 year ago |
Martin Kroeker
|
457d1c6972
|
remove unused CI badges, wiki->docs, xianyi->OpenMathLib
|
1 year ago |
Martin Kroeker
|
6a60eb1a02
|
Merge pull request #4924 from XiWeiGu/la64_readme
LoongArch64: Update README.md
|
1 year ago |
Martin Kroeker
|
8483a71169
|
Merge pull request #4937 from martin-frbg/lapack1064
Fix leading dimension for B in LAPACK tests for GGEV (Reference-LAPACK PR 1064)
|
1 year ago |
Martin Kroeker
|
22628f1a69
|
Fix leading dimension for B (Reference-LAPACK PR 1064)
|
1 year ago |
Martin Kroeker
|
27ed6da331
|
Fix leading dimension for B (Reference-LAPACK PR 1064)
|
1 year ago |
Martin Kroeker
|
7018c1b001
|
Fix leading dimension for B (Reference-LAPACK PR 1064)
|
1 year ago |
Martin Kroeker
|
a659f40fe1
|
Fix leading dimension for B (Reference-LAPACK PR 1064)
|
1 year ago |
Martin Kroeker
|
c979c1d948
|
Merge pull request #4936 from martin-frbg/fixmips64generic
Fix unroll parameter selection for MIPS64_GENERIC
|
1 year ago |
Martin Kroeker
|
a47b3c8867
|
Fix unroll parameter selection for MIPS64_GENERIC
|
1 year ago |
Chip Kerchner
|
2391dc1c0f
|
Merge branch 'vectorizeBF16GEMV' of github.ibm.com:PowerAppLibs/OpenBLAS into vectorizeBF16GEMV
|
1 year ago |
Chip Kerchner
|
36bd3eeddf
|
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
|
1 year ago |
Chip Kerchner
|
f8e113f27b
|
Replace types with include file.
|
1 year ago |
Chip Kerchner
|
a53a197934
|
Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV
|
1 year ago |
Martin Kroeker
|
3184b7f209
|
Merge pull request #4933 from ChipKerchner/thread_sbgemv
Change multi-threading logic for SBGEMV to be the same as SGEMV.
|
1 year ago |
Chip Kerchner
|
0082240044
|
Merge branch 'thread_sbgemv' into vectorizeBF16GEMV
|
1 year ago |
Chip Kerchner
|
1d51ca5798
|
Change multi-threading logic for SBGEMV to be the same as SGEMV.
|
1 year ago |
Chip Kerchner
|
c8f53b85ce
|
Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV
|
1 year ago |
Martin Kroeker
|
18a23c23f7
|
Merge pull request #4929 from martin-frbg/issue4905
Fix CBLAS_?GEMMT filling in the wrong triangle for Row-Major
|
1 year ago |
Martin Kroeker
|
5a79446bdb
|
Merge pull request #4918 from HaoZeke/testFixes
TST,BUG: Explicitly allow running tests multiple times
|
1 year ago |