Martin Kroeker
c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target
Add NEOVERSEV2 target support
9 months ago
Martin Kroeker
2f89a5970e
fix NeoverseV2 typo
9 months ago
Chris Sidebottom
87247daadc
Add NEOVERSEV2 target support
Did a quick run around to make `TARGET=NEVOERSEV2` build successfully.
Fixes #5385
9 months ago
Martin Kroeker
a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding
9 months ago
Martin Kroeker
82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings
9 months ago
Martin Kroeker
b24212f5df
fix numbers
9 months ago
Martin Kroeker
6ff06f5483
Add cross-compilation data for RISCV64 targets
9 months ago
Chris Sidebottom
947d7af4c9
Fix CMake references to bscal and bgemv
9 months ago
Chris Sidebottom
72d2ebb4dd
Re-add GEMV fallback for Level3
9 months ago
Chris Sidebottom
e105411460
Add infrastructure for bgemv/bscal
- Sets up all the various entrypoints for `bgemv`
- Adds `bscal` for use in the `bgemv` interface
- Adds test cases for comparing `sgemv` and `bgemv`
- Adds generic kernels for `bgemv_n` and `bgemv_t` which are accurate
enough to pass above tests
9 months ago
Chris Sidebottom
66d9185ebe
Fix CMake support
10 months ago
Chris Sidebottom
f95e7b0e32
Add infrastructure for BGEMM
Setting up all the infrastructure for BGEMM support in OpenBLAS, hopefully I found all the right places.
Derived mostly from the previous work done in https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com>
10 months ago
Chris Sidebottom
552e1c7a7a
Correct compiler flags for NEOVERSEV1 target
10 months ago
Usui, Tetsuzo
14107e37d9
Add parallel laed3
10 months ago
Martin Kroeker
560fa88c96
Add cross-build parameters for Ampere One
10 months ago
Martin Kroeker
55bb5ef867
Add compiler options for Ampere One
10 months ago
Srangrang
0a967797a1
Add FP16 support for RISCV
11 months ago
Martin Kroeker
f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a
11 months ago
Martin Kroeker
d9369bda1e
Update and amend parameters for Neoverse cpus
1 year ago
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
1 year ago
Ruiyang Wu
02fd1df10b
CMake: Pass `OpenMP` compiler and linker flags through CMake targets
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
1 year ago
Martin Kroeker
b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3
1 year ago
Martin Kroeker
f1fa370579
fix missing endif
1 year ago
Martin Kroeker
6d1444be3a
Add ARM64 options for NVIDIA HPC
1 year ago
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
1 year ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
1 year ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
1 year ago
Martin Kroeker
262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake
LoongArch64: Fixed cmake
1 year ago
Martin Kroeker
180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107
Further rearranged the rotm kernel for the different architectures.
1 year ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
1 year ago
Martin Kroeker
111c9b0733
Add translations for C_COMPILER and OSNAME
1 year ago
tingbo.liao
3c8df6358f
Further rearranged the rotm kernel for the different architectures.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
1 year ago
Martin Kroeker
fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID
1 year ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
1 year ago
Martin Kroeker
30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional
1 year ago
Martin Kroeker
32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows
1 year ago
Matthew Thompson
c4e8bac5a5
Fix indent
1 year ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
1 year ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
1 year ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
1 year ago
Martin Kroeker
57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
1 year ago
Martin Kroeker
cea9df3643
Update Cray compiler options and calling convention
1 year ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
1 year ago
Martin Kroeker
b0346e72f4
update names of loongarch64 targets for cross-compilation
1 year ago
Martin Kroeker
9c707dc6b9
Update dynamic arch list to new target scheme
1 year ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
Martin Kroeker
4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
Martin Kroeker
0228d36211
move -fopenmp to CFLAGS
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago