Martin Kroeker
b3904aeed7
Merge pull request #5323 from imciner2/im/ofast
Switch power to use O3 instead of Ofast
7 months ago
Ian McInerney
721c80644b
Switch power to use O3 instead of Ofast
Ofast enables possibly unsafe optimizations in addition to O3. This
appears to have been added and then just continually copied into later
Power architectures, and it wasn't included in the CMake build system
when that was introduced.
Replace this with O3 so that the same level of optimization is done by
the compiler.
7 months ago
Martin Kroeker
4e6da5ed34
Update version to 0.3.30.dev
7 months ago
Martin Kroeker
8dff37827e
Update version to 0.3.30.dev
7 months ago
Martin Kroeker
c055c36b40
Merge pull request #5317 from OpenMathLib/release-0.3.0
merge back from 0.3.0 to copy tag
7 months ago
Martin Kroeker
993fad6aeb
Update version to 0.3.30
7 months ago
Martin Kroeker
3382763df6
Update version to 0.3.30
7 months ago
Martin Kroeker
e81fca06dd
Merge pull request #5316 from OpenMathLib/develop
Update from develop for 0.3.30 release
7 months ago
Martin Kroeker
d339bd5515
Merge pull request #5308 from martin-frbg/changelog0330
Update the Changelog for version 0.3.30
7 months ago
Martin Kroeker
157273fda0
another round of last minute updates for 0.3.30
7 months ago
Martin Kroeker
1546599a13
Merge pull request #5315 from loss-and-quick/arm-exec-stack
Add .note.GNU-stack in ARM epilogue to avoid writable stack
7 months ago
minicx
79b4dd0fb0
fix(arm): add .note.GNU-stack to ARM assembly to prevent writable-stack warnings
Add .section .note.GNU-stack in ARM assembly epilogue on Linux/ELF targets to
avoid warnings about a writable/executable stack and ensure shared objects do
not require an executable stack.
Signed-off-by: minicx <minicx@disroot.org>
7 months ago
Martin Kroeker
c2342fc2d0
Merge pull request #5314 from martin-frbg/dynampere1
Support AmpereOne/OneA as NeoverseN1 in DYNAMIC_ARCH builds
7 months ago
Martin Kroeker
e541bf68f5
support AmpereOne/OneA as NeoverseN1
7 months ago
Martin Kroeker
5ad6435660
Merge pull request #5312 from martin-frbg/x86cdot
Work around X86 POTRS/CDOT bug on old systems and add CI job for 32bit manylinux
7 months ago
Martin Kroeker
e684e36377
Add 32bit manylinux to match what python wheel build tests use
7 months ago
Martin Kroeker
3318a2b904
override CDOT and ZDOT with the generic C kernel
7 months ago
Martin Kroeker
85337c5160
Merge pull request #5310 from nakagawa-fj/bugfix/identify_cpu_part_for_arm64
Bug Fix: Problem with identifying some ARM64 processors
8 months ago
Martin Kroeker
53cd6e7ff7
Update Changelog.txt
8 months ago
Masato Nakagawa
1dd396033a
Fix:Problem with identifying some ARM64 processors.
8 months ago
Martin Kroeker
f1097d1cba
Merge pull request #5306 from martin-frbg/lapack1131
Fix missing initialization leading to bypassing corner cases in C/ZGEQP3RK (Reference-LAPACK PR #1131 )
8 months ago
Martin Kroeker
3fe7f196e6
Update the Changelog for version 0.3.30
8 months ago
Martin Kroeker
bad47bd024
Fix too strict leading dimensions check in LAPACKE_?gesdd_work (Reference-LAPACK PR #1126 ) ( #5307 )
* relax leading dimensions check (Reference-LAPACK PR #1126 )
8 months ago
Martin Kroeker
7f3093a0ad
Merge pull request #5305 from martin-frbg/lapack1135
Fix 2nd dimension used by LAPACKE_c/zunmlq in NaN check and transposition (Reference-LAPACK PR #1135 )
8 months ago
Martin Kroeker
1804ff58d7
fix missing initialization
8 months ago
Martin Kroeker
906b9df316
fix missing initialization
8 months ago
Martin Kroeker
f4e5177050
fix dimension used in nancheck (Reference-LAPACK PR 1135)
8 months ago
Martin Kroeker
2a6beac88f
fix dimension used in transposition (Reference-LAPACK PR 1135)
8 months ago
Martin Kroeker
d8a2324699
fix dimension used in nancheck (Reference-LAPACK PR 1135)
8 months ago
Martin Kroeker
874744976c
fix dimension used in nancheck (Reference-LAPACK PR 1135)
8 months ago
Martin Kroeker
0ea173ec8c
Merge pull request #5304 from martin-frbg/fixgemmtr_if
fix source file used for sbgemmt/sbgemmtr in CMake builds
8 months ago
Martin Kroeker
5e393f207c
fix source file used for sbgemmt/sbgemmtr
8 months ago
Martin Kroeker
dbd5643d37
Merge pull request #5302 from martin-frbg/zscal_mips_3
mips64 SICORTEX: temporarily change default C/ZSCAL to the non-asm implementation
8 months ago
Martin Kroeker
e338d34ce1
fix path
8 months ago
Martin Kroeker
d36093d084
temporarily change default C/ZSCAL to the non-asm implementation
8 months ago
Martin Kroeker
cc4b04a684
Merge pull request #5301 from martin-frbg/zscal_mips_2
kernel/mips(64): Fix cscal and zscal
8 months ago
Martin Kroeker
b3c90564d7
resync with the generic arm version for inf/nan handling
8 months ago
Martin Kroeker
6bdc7f9eb7
Merge pull request #5300 from martin-frbg/fixup5296
kernel/riscv64: Fix cscal/zscal for riscv64_generic
8 months ago
Martin Kroeker
63272b6c82
Merge pull request #5299 from martin-frbg/x86_64-ssezscal
Disable the default SSE kernels for x86_64 CSCAL/ZSCAL for now
8 months ago
Martin Kroeker
73af02b89f
use dummy2 as Inf/NAN handling flag
8 months ago
Martin Kroeker
549a9f1dbb
Disable the default SSE kernels for CSCAL/ZSCAL for now
8 months ago
Martin Kroeker
ca1ce84ee5
Merge pull request #5298 from martin-frbg/fixup5281
Fix PR5281 "kernel/arm64: fix cscal/zscal"
8 months ago
Martin Kroeker
58eeb9041c
fix handling of dummy2
8 months ago
Martin Kroeker
7c77537b25
Merge pull request #5297 from martin-frbg/zscal_x86_sparc
kernel/(x86|sparc): Fix cscal and zscal by reverting to the generic C kernels
8 months ago
Martin Kroeker
63287e1855
Merge pull request #5296 from martin-frbg/zscal_riscv
kernel/riscv64: Fix cscal and zscal
8 months ago
Martin Kroeker
d2855d3dab
Merge pull request #5285 from martin-frbg/zscal_zarch
kernel/zarch: Fix cscal and zscal
8 months ago
Martin Kroeker
1408be5fe0
Merge pull request #5282 from martin-frbg/zscal_power
kernel/power: Fixed cscal and zscal
8 months ago
Martin Kroeker
1589d0b21e
Merge pull request #5281 from martin-frbg/zscal_arm64
kernel/arm64: fixed cscal and zscal
8 months ago
Martin Kroeker
a86419fb66
Merge pull request #5280 from martin-frbg/zscal_x86_64
kernel/x86_64: fixed cscal and zscal
8 months ago
Martin Kroeker
11ff18bb0f
Merge pull request #5081 from XiWeiGu/kernel_generic_fixed_cscal_zscal
kernel/generic: Fixed cscal and zscal
8 months ago