Evgeni Burovski
|
fa1f173bbd
|
asv config
|
1 year ago |
Evgeni Burovski
|
058e85dc73
|
MAINT: install virualenv for asv
|
1 year ago |
Evgeni Burovski
|
17d9f2c828
|
CI: run asv on CI
|
1 year ago |
Evgeni Burovski
|
efae015273
|
vendor the asv setup
|
1 year ago |
Evgeni Burovski
|
141e422933
|
add asv.conf.json
|
1 year ago |
Evgeni Burovski
|
e05185747b
|
install the nightly wheel
|
1 year ago |
Evgeni Burovski
|
9f98cc9285
|
python3 -m asv
|
1 year ago |
Evgeni Burovski
|
c5ab738833
|
sudo apt manually
|
1 year ago |
Evgeni Burovski
|
4b2aa2e947
|
..
|
1 year ago |
Evgeni Burovski
|
612c29063a
|
.
|
1 year ago |
Evgeni Burovski
|
e23410d128
|
rename
|
1 year ago |
Evgeni Burovski
|
e07e91a9d4
|
REVERT: remove cirun-bench workflow
|
1 year ago |
Evgeni Burovski
|
412d81a9c6
|
WIP: start adding a cirun-asv workflow
|
1 year ago |
Evgeni Burovski
|
7e7edd954e
|
CI: run benchmarks on cirun
|
1 year ago |
Evgeni Burovski
|
f09031bc67
|
REVERT: temp remove other workflows
|
1 year ago |
Martin Kroeker
|
33bb4b98a4
|
Improve error message output from the fork() utest (#4753)
* Add perror to report the reason for a fork failure
* reword the malloc failure message
|
1 year ago |
Martin Kroeker
|
f13403b6b6
|
Merge pull request #4755 from martin-frbg/issue4739
Fix Intel oneAPI compiler support in the CMAKE build
|
1 year ago |
Martin Kroeker
|
8bc37f9384
|
Merge pull request #4754 from martin-frbg/issue4750-2
Add a clobber list to the arm64 SVE DOT kernel
|
1 year ago |
Martin Kroeker
|
d25ee4d0f5
|
Fix detection of Intel ifx and apply -fp-model option to it
|
1 year ago |
Martin Kroeker
|
21c0f769ef
|
ensure that cpu-specific -march options are always applied to icx
|
1 year ago |
Martin Kroeker
|
3d8054fb16
|
add clobber list
|
1 year ago |
Martin Kroeker
|
fdb88e010f
|
Merge pull request #4749 from XiWeiGu/loongarch64-qemu-update
LoongArch64: Update QEMU
|
1 year ago |
Martin Kroeker
|
dd7efcf9ef
|
Avoid exceeding the configured thread count in x86_64 TOBF16 (#4748)
* avoid setting nthreads higher than available
|
1 year ago |
guxiwei
|
ed5db5b122
|
LoongArch64: Update the address for obtaining the Clang cross-toolchain
Improve the stability and speed of testing
|
1 year ago |
guxiwei
|
1ca1bb829d
|
LoongArch64: Update QEMU
Compile the community version of QEMU to support LSX/LASX extension instructions
|
1 year ago |
Martin Kroeker
|
62c33db37d
|
Merge pull request #4746 from martin-frbg/issue4743
Correct CMAKE build definitions for CAXPYC/ZAXPYC
|
1 year ago |
Martin Kroeker
|
2f12a47405
|
fix build options for CAXPYC/ZAXPYC
|
1 year ago |
Martin Kroeker
|
f955616f98
|
Merge pull request #4740 from martin-frbg/fixlapackmod
remove LAPACK .mod files during make clean
|
1 year ago |
Martin Kroeker
|
f96ee86711
|
remove .mod files during make clean
|
1 year ago |
Martin Kroeker
|
442dec28df
|
Merge pull request #4738 from martin-frbg/issue4737
Disable GEMM3M for generic targets (not implemented)
|
1 year ago |
Martin Kroeker
|
0cf8b98f61
|
Merge pull request #4736 from XiWeiGu/loongarch_issue4728
LoongArch: Fixed issue 4728
|
1 year ago |
Martin Kroeker
|
2787c9f8e4
|
Disable GEMM3M for generic targets (not implemented)
|
1 year ago |
gxw
|
af73ae6208
|
LoongArch: Fixed issue 4728
|
1 year ago |
Martin Kroeker
|
4e9144b39f
|
Update .cirrus.yml (#4735)
* Update versions (and paths) of XCode, and update FreeBSD version
|
1 year ago |
Martin Kroeker
|
0c2ac76a63
|
Merge pull request #4734 from XiWeiGu/loongarch64_small_matrix
LoongArch: DGEMM small matrix opt
|
1 year ago |
Martin Kroeker
|
913be34bf0
|
Merge pull request #4733 from martin-frbg/issue4719
Drop the -static Fortran flag from generic RISCV builds as it breaks OpenMP
|
1 year ago |
gxw
|
8ab2e9ec65
|
LoongArch: DGEMM small matrix opt
|
2 years ago |
Martin Kroeker
|
df87aeb5a2
|
Drop the -static Fortran flag from generic builds as it breaks OpenMP
|
1 year ago |
Martin Kroeker
|
3a3ff1ba5e
|
Merge pull request #4732 from martin-frbg/issue4731
fix conflicting types for cblas_sbgemm_batch
|
1 year ago |
Martin Kroeker
|
db9f7bc552
|
fix float array types to include bfloat16
|
1 year ago |
Martin Kroeker
|
a9fae32a33
|
Merge pull request #4730 from jake-arkinstall/develop
Updated CONTRIBUTORS.md
|
1 year ago |
Jake Arkinstall
|
44004178aa
|
Updated CONTRIBUTORS.md
As requested on X (https://x.com/KroekerMartin/status/1755218919290278185)
|
1 year ago |
Martin Kroeker
|
83bc8d5dd8
|
Merge pull request #4712 from RajalakshmiSR/zscalp10
POWER: Fix issues in zscal to address lapack failures
|
1 year ago |
Martin Kroeker
|
56bd57ca99
|
Merge pull request #4720 from martin-frbg/issue3039
Resurrect and complete cblas_?gemm_batch
|
1 year ago |
Martin Kroeker
|
6b564d53fd
|
Merge pull request #4727 from martin-frbg/issue4726
Fix another corner case of infinity handling in x86_64 ZSCAL
|
1 year ago |
Martin Kroeker
|
db070a9223
|
add gemm_batch drivers
|
1 year ago |
Martin Kroeker
|
076766df4e
|
Update CMakeLists.txt
|
1 year ago |
Martin Kroeker
|
8c05765a5a
|
fix other corner cases where x=INF
|
1 year ago |
Martin Kroeker
|
516743f7dc
|
fix other instances of mishandling INF
|
1 year ago |
Martin Kroeker
|
9ff4e9714e
|
additional fixes for handling INF arguments
|
1 year ago |