4833 Commits (small_matrices)
 

Author SHA1 Message Date
  Zhang Xianyi 77460ac255 Fix gemm_batch bug for SMALL_MATRIX_OPT=1. 5 years ago
  Zhang Xianyi 88e6806e3f Init cblas_?gemm_batch implementation. 5 years ago
  Xianyi Zhang 4130d1732e Refs #2587 fix small matrix c/zgemm bug. 5 years ago
  Xianyi Zhang 255b6dd0fa Merge branch 'develop' into small_matrices 5 years ago
  Xianyi Zhang 741d6c5cb8 Refs #2587 Add small matrix optimization reference kernel for c/zgemm. 5 years ago
  Martin Kroeker 514a3d7d63
Merge pull request #2798 from kadler/aix-cpuid 5 years ago
  Kevin Adler 085aae8bdb
Fix compile error on AIX cpuid detection 5 years ago
  Xianyi Zhang 712ca43069 Change a1b0 gemm to b0 gemm. 5 years ago
  Martin Kroeker 5c6c2cd4f6
Merge pull request #2775 from Guobing-Chen/Fix_OMP_threads_specify 5 years ago
  Martin Kroeker e54be4ba1c
Merge pull request #2792 from pkubaj/patch-1 5 years ago
  pkubaj 48a1364e10
Add aliases for armv6, armv7 5 years ago
  Chen, Guobing 0c1c903f1e Fix OMP num specify issue 5 years ago
  Martin Kroeker a073fa870e
Merge pull request #2791 from martin-frbg/issue2787 5 years ago
  Martin Kroeker b2053239fc
Fix mssing dummy parameter (imag part of alpha) of zdot_thread_function 5 years ago
  Martin Kroeker b11bb6e728
Merge pull request #2790 from martin-frbg/issue2789 5 years ago
  Martin Kroeker 1840bc5b52
Add OpenMP dependency to pkgconfig file if needed 5 years ago
  Martin Kroeker 7c0977c267
Add OpenMP dependency to pkgconfig file if needed 5 years ago
  Martin Kroeker fb3d80c42a
Merge pull request #78 from xianyi/develop 5 years ago
  Martin Kroeker 9ee21a0a39
Merge pull request #2780 from Guobing-Chen/CPL_build_support 5 years ago
  Martin Kroeker bd3207b4b4
Update system.cmake 5 years ago
  Martin Kroeker b8ebfc9335
Update system.cmake 5 years ago
  Martin Kroeker 7c1986640b
fallback from cooperlake to skylake if gcc<10 5 years ago
  Martin Kroeker 71d33c952d
Typo fix 5 years ago
  Martin Kroeker 6a3c074786
-march=cooperlake requires gcc10 5 years ago
  Martin Kroeker 430f741b30
-march=cooperlake requires gcc10 5 years ago
  Martin Kroeker 6f4dc7445d
Fix typo 5 years ago
  Martin Kroeker 81fbe8d088
-march=cooperlake only available in gcc >= 10 5 years ago
  Martin Kroeker bb9cf766f5
make march=cooperlake option conditional on gcc >= 10.1 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Martin Kroeker 2c72972570
Merge pull request #2785 from albertziegenhagel/always-generate-pkg-config 5 years ago
  Albert Ziegenhagel 6b731d917f Do not require pkg-config to generate the *.pc file 5 years ago
  Martin Kroeker 5dcf47cd97
Merge pull request #2784 from martin-frbg/issue2783 5 years ago
  Martin Kroeker aa286e301b
Add typedef for bfloat16 if needed 5 years ago
  Martin Kroeker 9f0ef9cdfc
Merge pull request #77 from xianyi/develop 5 years ago
  Martin Kroeker 6bfc66663c
revert 5 years ago
  Martin Kroeker a8c6fb9e1c
revert 5 years ago
  Martin Kroeker 5ec8f716cf
revert 5 years ago
  Martin Kroeker 82f8a0aeba
Update .drone.yml 5 years ago
  Martin Kroeker d57d503c15
Update Makefile 5 years ago
  Martin Kroeker 37ac23e8a3
Add simple MT sgemm precision test and INTERFACE64 build 5 years ago
  Martin Kroeker 6a93e3b2ba
Add simple sgemm preicsion test 5 years ago
  Martin Kroeker 47ce1dd08f
Update gemm64.cpp 5 years ago
  Martin Kroeker f5fcc5baec
Add trivial gemm test for multithread consistency 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Martin Kroeker efdd237a91
Add a dedicated POWER9 build to the Travis CI (#2774) 5 years ago
  Martin Kroeker 4573cb2f43
Merge pull request #2765 from martin-frbg/issue2760 5 years ago
  Martin Kroeker 2a4bb797db
Merge pull request #2773 from martin-frbg/issue2770 5 years ago
  Martin Kroeker cbbe38bb88
Merge pull request #2772 from mhillenibm/s390x_gemm_tuning 5 years ago
  Martin Kroeker 619343278d
Fix mishandling of NO_CBLAS=0 and NO_LAPACKE=0 5 years ago
  Martin Kroeker fee361ae64
fix another source of NO_CBLAS=0 surprise 5 years ago