766 Commits (586fc577d001e6fe387dd08ac99d9db4fca09cf4)

Author SHA1 Message Date
  Mateusz Sokół e37ec24a96 BLD: Finish porting `ctest` 1 year ago
  Rohit Goswami 33c48a70ce MAINT: Cleanup some flags and add GFORT condition 1 year ago
  Mateusz Sokół 3bc55f15f0 BLD: `test` port 1 year ago
  Mateusz Sokół a5ac9e7f1a BLD: Generate config.h file 1 year ago
  Mateusz Sokół cfd11eba92 BUG: Fix `except` dict feature 1 year ago
  Rohit Goswami 6a7e571075 MAINT: Fix minor missed file 1 year ago
  Rohit Goswami 17b164feee MAINT: Cleanup and lint a bit 1 year ago
  Rohit Goswami 5cadc67801 MAINT: Add more symbols for the test 1 year ago
  Rohit Goswami 86aa6b3a87 MAINT: Quick and dirty working set of symbols 2 years ago
  Rohit Goswami 571d2f3be3 ENH: Add TRMM_KERNEL bindings 2 years ago
  Rohit Goswami 854aecce82 BLD: Add L3 driver symbols 2 years ago
  Rohit Goswami e5564ec450 MAINT: Add syrk 2 years ago
  Rohit Goswami fe38ef70ed MAINT: Rework to use the ext_l3 mapping 2 years ago
  Rohit Goswami 26b98f6a10 MAINT: Use syrk as an exception 2 years ago
  Rohit Goswami 2f17fd08b7 ENH: Start adding L3 driver symbols 2 years ago
  Rohit Goswami ca8e18eda2 MAINT: Start adding L3 2 years ago
  Rohit Goswami 5267a1ec63 ENH: Compile all L2 drivers 2 years ago
  Rohit Goswami 86d32c7a14 ENH: Add in the rest of the level2 symbols 2 years ago
  Rohit Goswami 2fe1f31161 MAINT: Start working on kernels and driver L2 2 years ago
  yamazaki-mitsufumi 821ef34635 Add A64FX to the list of CPUs supported by DYNAMIC_ARCH 1 year ago
  Martin Kroeker a815594fd1
Merge pull request #4801 from markdryan/markdryan/riscv-dynamic-arch 1 year ago
  Martin Kroeker a373d0f107
Improve the error message for thread creation failure 1 year ago
  Mark Ryan 3b715e6162 Add autodetection for riscv64 1 year ago
  Martin Kroeker d0b9948b23
Guard against invalid thread_status.queue 1 year ago
  Martin Kroeker 7e9a4ba427
Merge pull request #4741 from shivammonaka/Pthread_Scalability_Improvement 1 year ago
  Martin Kroeker 9b2a0c79cb
Add Zhaoxin KX7000 1 year ago
  shivammonaka 9e22d70957 Dynamic locking in Pthread Backend to allow multiple BLAS calls to be executed parallelly 1 year ago
  Martin Kroeker db070a9223
add gemm_batch drivers 1 year ago
  Martin Kroeker d0794f88dc
add gemm_batch driver 1 year ago
  Martin Kroeker 0073affe63
Merge pull request #4693 from goplanid/locks-improvement 2 years ago
  Martin Kroeker 6ca9ffa7f5
Merge pull request #4655 from yamazakimitsufumi/update_2d_thread_distribution 2 years ago
  Deeksha Goplani 0dc80a5c8d locks improvement 2 years ago
  Martin Kroeker 8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6 2 years ago
  gxw 637c650f4f loongarch64: Add buffer offset for target LOONGSON3R5 2 years ago
  Martin Kroeker 5500b4ab26
Merge pull request #4680 from theAeon/develop 2 years ago
  Martin Kroeker f0f1ff7820
fix HUGETLB allocation for TLS mode as well 2 years ago
  Andrew Robbins edfe1aa471
Expose whether locking is enabled in get_config 2 years ago
  Martin Kroeker dc99b61380
sort unwanted interdependencies of alloc_shm and alloc_hugetlb 2 years ago
  Martin Kroeker ddcd7d6fa8
Merge branch 'develop' into Threading_Callback 2 years ago
  yamazaki-mitsufumi 51ab1903e7 Expanding the scop of 2D thread distribution 2 years ago
  gxw d8c4ea8793 loongarch: Optimizing the performance of the GEMM on servers 2 years ago
  shivammonaka 7102367fde Introduced callback to Pthread, Win32 and OpenMP backend 2 years ago
  Mark Seminatore b0ad8a78ff code to fix lost work in case of re-entrant calls to exec_blas_async() 2 years ago
  Martin Kroeker 88b5330ae7
Restore outer loop of blas_buffer_inuse setup 2 years ago
  shivammonaka d49ebc54e1 Merge branch 'shivam-develop' into shivam-Locks 2 years ago
  shivammonaka bc191015e3 Using OpenMP locks with NUM_PARALLEL 2 years ago
  Mark Seminatore b29fd48998
Merge branch 'develop' into win_tidy 2 years ago
  Mark Seminatore 98c56a7314 more cleanup 2 years ago
  Chip Kerchner d408ecedba Add environment variable to display coretype for dynamic arch. 2 years ago
  Chip Kerchner ac6b4b7aa4 Make sure CPU ID works for all POWER_10 conditions 2 years ago