9317 Commits (1804ff58d7967c528f6006fe2f10ea3f3c2d3d06)
 

Author SHA1 Message Date
  Martin Kroeker 1804ff58d7
fix missing initialization 10 months ago
  Martin Kroeker 906b9df316
fix missing initialization 10 months ago
  Martin Kroeker 0ea173ec8c
Merge pull request #5304 from martin-frbg/fixgemmtr_if 10 months ago
  Martin Kroeker 5e393f207c
fix source file used for sbgemmt/sbgemmtr 10 months ago
  Martin Kroeker dbd5643d37
Merge pull request #5302 from martin-frbg/zscal_mips_3 10 months ago
  Martin Kroeker e338d34ce1
fix path 10 months ago
  Martin Kroeker d36093d084
temporarily change default C/ZSCAL to the non-asm implementation 10 months ago
  Martin Kroeker cc4b04a684
Merge pull request #5301 from martin-frbg/zscal_mips_2 10 months ago
  Martin Kroeker b3c90564d7
resync with the generic arm version for inf/nan handling 10 months ago
  Martin Kroeker 6bdc7f9eb7
Merge pull request #5300 from martin-frbg/fixup5296 10 months ago
  Martin Kroeker 63272b6c82
Merge pull request #5299 from martin-frbg/x86_64-ssezscal 10 months ago
  Martin Kroeker 73af02b89f
use dummy2 as Inf/NAN handling flag 10 months ago
  Martin Kroeker 549a9f1dbb
Disable the default SSE kernels for CSCAL/ZSCAL for now 10 months ago
  Martin Kroeker ca1ce84ee5
Merge pull request #5298 from martin-frbg/fixup5281 10 months ago
  Martin Kroeker 58eeb9041c
fix handling of dummy2 10 months ago
  Martin Kroeker 7c77537b25
Merge pull request #5297 from martin-frbg/zscal_x86_sparc 10 months ago
  Martin Kroeker 63287e1855
Merge pull request #5296 from martin-frbg/zscal_riscv 10 months ago
  Martin Kroeker d2855d3dab
Merge pull request #5285 from martin-frbg/zscal_zarch 10 months ago
  Martin Kroeker 1408be5fe0
Merge pull request #5282 from martin-frbg/zscal_power 10 months ago
  Martin Kroeker 1589d0b21e
Merge pull request #5281 from martin-frbg/zscal_arm64 10 months ago
  Martin Kroeker a86419fb66
Merge pull request #5280 from martin-frbg/zscal_x86_64 10 months ago
  Martin Kroeker 11ff18bb0f
Merge pull request #5081 from XiWeiGu/kernel_generic_fixed_cscal_zscal 10 months ago
  Martin Kroeker 2e2691b34b
Merge pull request #5078 from XiWeiGu/la64_fixed_cscal_zscal 10 months ago
  Martin Kroeker f4194fc65f
Merge branch 'develop' into la64_fixed_cscal_zscal 11 months ago
  Martin Kroeker e12132abd4
Use generic C/ZSCAL kernels to address inf/nan handling for now 11 months ago
  Martin Kroeker 1cefbea7ea
Use generic SCAL kernels to address inf/nan handling for now 11 months ago
  Martin Kroeker f18b7a46bf
add dummy2 flag handling for inf/nan agnostic zeroing 11 months ago
  Martin Kroeker fe220a0d7d
Merge pull request #5291 from guoyuanplct/develop 11 months ago
  Martin Kroeker bbdc265798
Merge pull request #5294 from arnej27959/arnej/fix-arm64-register 11 months ago
  Arne Juul 5442aff218 Accumulate results in output register explicitly 11 months ago
  guoyuanplct 83fcab7578 Merge branch 'develop' of https://github.com/guoyuanplct/OpenBLAS into develop 11 months ago
  guoyuanplct 2ae019161a fixed the performance problem in RISCV64_ZVL256 when OPENBLAS_K is small 11 months ago
  Martin Kroeker 02267d86f5
Merge pull request #5288 from guoyuanplct/develop 11 months ago
  guoyuanplct d2003dc886 del lines 11 months ago
  guoyuanplct 45fd2d9b07 Optimized the axpby function. 11 months ago
  Martin Kroeker fb8dc8ff5c
Add dummy2 flag handling 11 months ago
  Martin Kroeker cf06250d36
add handling of dummy2 flag 11 months ago
  Martin Kroeker 28f8fdaf0f
support flag for NaN/Inf handling and fix scaling of NaN/Inf values 11 months ago
  Martin Kroeker 669c847ceb
support extra flag for NaN handling 11 months ago
  Martin Kroeker 0163143fdd
Merge pull request #5278 from martin-frbg/fixup5276 11 months ago
  Martin Kroeker 20f2ba0141
Move declaration of i for pre-C99 compilers 11 months ago
  Martin Kroeker e2e6a4d90a
Merge pull request #5276 from nakagawa-fj/gemm_2d_thread_partitioning 11 months ago
  Martin Kroeker 9ef5995c22
Merge pull request #5277 from martin-frbg/fixmingw32 11 months ago
  Martin Kroeker 42b7d1f897
Fix addressing of alpha in CBLAS 11 months ago
  Martin Kroeker bd573a9d38
Expand mingw32 gfortran workaround to all versions after 14.1 11 months ago
  Masato Nakagawa 2351a98005 Update 2D thread-partitioned GEMM for M << N case. 11 months ago
  Martin Kroeker a5f701c4ab
Merge pull request #5274 from martin-frbg/issue5247 11 months ago
  Martin Kroeker 4ca76d9de4
Expressly provide a shared libs option 11 months ago
  Martin Kroeker 846a5436e7
Merge pull request #5273 from martin-frbg/issue5259 11 months ago
  Martin Kroeker 8779eac3b8
Do not add a 64 suffix to the library name if the user-provided suffix already contains it 11 months ago