Martin Kroeker
4609732e69
Relax version number requirement for AppleClang
5 months ago
Martin Kroeker
bf98e448eb
Add VORTEXM4 to DYNAMIC_ARCH list
5 months ago
Martin Kroeker
426b5f23ed
Add compiler options for VORTEXM4
5 months ago
Martin Kroeker
4328c91e27
relax requirements in compiler SME capability check
5 months ago
Martin Kroeker
c794d0a4ce
Add VORTEXM4
5 months ago
Martin Kroeker
a4f5fec46e
Add compiler options for VORTEXM4
5 months ago
Martin Kroeker
c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target
Add NEOVERSEV2 target support
6 months ago
Martin Kroeker
2f89a5970e
fix NeoverseV2 typo
6 months ago
Chris Sidebottom
87247daadc
Add NEOVERSEV2 target support
Did a quick run around to make `TARGET=NEVOERSEV2` build successfully.
Fixes #5385
6 months ago
Martin Kroeker
a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding
6 months ago
Martin Kroeker
82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings
6 months ago
Martin Kroeker
b24212f5df
fix numbers
6 months ago
Martin Kroeker
6ff06f5483
Add cross-compilation data for RISCV64 targets
6 months ago
Chris Sidebottom
947d7af4c9
Fix CMake references to bscal and bgemv
6 months ago
Chris Sidebottom
72d2ebb4dd
Re-add GEMV fallback for Level3
6 months ago
Chris Sidebottom
e105411460
Add infrastructure for bgemv/bscal
- Sets up all the various entrypoints for `bgemv`
- Adds `bscal` for use in the `bgemv` interface
- Adds test cases for comparing `sgemv` and `bgemv`
- Adds generic kernels for `bgemv_n` and `bgemv_t` which are accurate
enough to pass above tests
6 months ago
Chris Sidebottom
66d9185ebe
Fix CMake support
6 months ago
Chris Sidebottom
f95e7b0e32
Add infrastructure for BGEMM
Setting up all the infrastructure for BGEMM support in OpenBLAS, hopefully I found all the right places.
Derived mostly from the previous work done in https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com>
7 months ago
Chris Sidebottom
552e1c7a7a
Correct compiler flags for NEOVERSEV1 target
6 months ago
Usui, Tetsuzo
14107e37d9
Add parallel laed3
7 months ago
Martin Kroeker
560fa88c96
Add cross-build parameters for Ampere One
7 months ago
Martin Kroeker
55bb5ef867
Add compiler options for Ampere One
7 months ago
Srangrang
0a967797a1
Add FP16 support for RISCV
8 months ago
Martin Kroeker
f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a
8 months ago
Martin Kroeker
d9369bda1e
Update and amend parameters for Neoverse cpus
9 months ago
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
10 months ago
Ruiyang Wu
02fd1df10b
CMake: Pass `OpenMP` compiler and linker flags through CMake targets
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
10 months ago
Martin Kroeker
b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3
10 months ago
Martin Kroeker
f1fa370579
fix missing endif
11 months ago
Martin Kroeker
6d1444be3a
Add ARM64 options for NVIDIA HPC
11 months ago
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
11 months ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
1 year ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
11 months ago
Martin Kroeker
262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake
LoongArch64: Fixed cmake
1 year ago
Martin Kroeker
180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107
Further rearranged the rotm kernel for the different architectures.
1 year ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
1 year ago
Martin Kroeker
111c9b0733
Add translations for C_COMPILER and OSNAME
1 year ago
tingbo.liao
3c8df6358f
Further rearranged the rotm kernel for the different architectures.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
1 year ago
Martin Kroeker
fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID
1 year ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
1 year ago
Martin Kroeker
30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional
1 year ago
Martin Kroeker
32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows
1 year ago
Matthew Thompson
c4e8bac5a5
Fix indent
1 year ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
1 year ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
1 year ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
1 year ago
Martin Kroeker
57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
1 year ago
Martin Kroeker
cea9df3643
Update Cray compiler options and calling convention
1 year ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
1 year ago
Martin Kroeker
b0346e72f4
update names of loongarch64 targets for cross-compilation
1 year ago