475 Commits (develop)

Author SHA1 Message Date
  Martin Kroeker 7c1839899e
Increase assumed L2 sizes for RISCV X280 / ZVL256B and for SVE-capable ARM64 8 months ago
  Martin Kroeker c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target 9 months ago
  Martin Kroeker 2f89a5970e
fix NeoverseV2 typo 9 months ago
  Chris Sidebottom 87247daadc Add NEOVERSEV2 target support 9 months ago
  Martin Kroeker a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding 9 months ago
  Martin Kroeker 82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings 9 months ago
  Martin Kroeker b24212f5df
fix numbers 9 months ago
  Martin Kroeker 6ff06f5483
Add cross-compilation data for RISCV64 targets 9 months ago
  Chris Sidebottom 947d7af4c9 Fix CMake references to bscal and bgemv 9 months ago
  Chris Sidebottom 72d2ebb4dd Re-add GEMV fallback for Level3 9 months ago
  Chris Sidebottom e105411460 Add infrastructure for bgemv/bscal 9 months ago
  Chris Sidebottom 66d9185ebe Fix CMake support 10 months ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 10 months ago
  Chris Sidebottom 552e1c7a7a Correct compiler flags for NEOVERSEV1 target 10 months ago
  Usui, Tetsuzo 14107e37d9 Add parallel laed3 10 months ago
  Martin Kroeker 560fa88c96
Add cross-build parameters for Ampere One 10 months ago
  Martin Kroeker 55bb5ef867
Add compiler options for Ampere One 10 months ago
  Srangrang 0a967797a1 Add FP16 support for RISCV 11 months ago
  Martin Kroeker f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a 11 months ago
  Martin Kroeker d9369bda1e
Update and amend parameters for Neoverse cpus 1 year ago
  Ruiyang Wu 1b0c0f00e9 CMake: Avoid mixed OpenMP linkage 1 year ago
  Ruiyang Wu 02fd1df10b CMake: Pass `OpenMP` compiler and linker flags through CMake targets 1 year ago
  Martin Kroeker b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3 1 year ago
  Martin Kroeker f1fa370579
fix missing endif 1 year ago
  Martin Kroeker 6d1444be3a
Add ARM64 options for NVIDIA HPC 1 year ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 1 year ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 1 year ago
  Martin Kroeker 877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode 1 year ago
  Martin Kroeker 262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake 1 year ago
  Martin Kroeker 180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107 1 year ago
  gxw 1ebcbdbab3 LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds 1 year ago
  Martin Kroeker 111c9b0733
Add translations for C_COMPILER and OSNAME 1 year ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 1 year ago
  Martin Kroeker fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID 1 year ago
  Martin Kroeker d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows 1 year ago
  Martin Kroeker 30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional 1 year ago
  Martin Kroeker 32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows 1 year ago
  Matthew Thompson c4e8bac5a5 Fix indent 1 year ago
  Matthew Thompson be19966d3b Fixes for NAG CMake 1 year ago
  Matthew Thompson 2eaf285de5 Use F_COMPILER name 1 year ago
  Matthew Thompson a8b1705dbd CMake build has wrong PIC flag for NAG 1 year ago
  Martin Kroeker 57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS 1 year ago
  Martin Kroeker cea9df3643
Update Cray compiler options and calling convention 1 year ago
  Chip Kerchner 36bd3eeddf Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 1 year ago
  Martin Kroeker b0346e72f4
update names of loongarch64 targets for cross-compilation 1 year ago
  Martin Kroeker 9c707dc6b9
Update dynamic arch list to new target scheme 1 year ago
  Martin Kroeker b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt 1 year ago
  Martin Kroeker 4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang 1 year ago
  Martin Kroeker de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake 1 year ago
  Martin Kroeker 0228d36211
move -fopenmp to CFLAGS 1 year ago