Commit Graph

  • *
  • *
  • |\
  • * \
  • |\ \
  • | | *
  • | * |
  • | * |
  • |/ /
  • | *
  • |/
  • *
  • |\
  • * \
  • |\ \
  • * \ \
  • |\ \ \
  • | | | *
  • | |_|/
  • |/| |
  • | | *
  • | |/
  • |/|
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • |/
  • *
  • |\
  • * \
  • |\ \
  • | | *
  • | | *
  • | | *
  • | | *
  • | | *
  • | |/
  • |/|
  • | *
  • |/
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • |/
  • *
  • |\
  • | *
  • |/
  • *
  • |\
  • | *
  • |/
  • *
  • |\
  • * \
  • |\ \
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | * |
  • | | *
  • | | *
  • | |/
  • |/|
  • * |
  • |\ \
  • | * |
  • |/ /
  • | | *
  • | |/
  • |/|
  • | | *
  • | | *
  • * | |
  • | | *
  • | | | *
  • | | | *
  • | | |/
  • | |/|
  • | | *
  • | | *
  • * | |
  • |/ /
  • | | *
  • | | *
  • | | *
  • | |/
  • |/|
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • | *
  • |/
  • *
  • |\
  • | *
  • |/
  • *
  • |\
  • * \
  • |\ \
  • | * |
  • | * |
  • * | |
  • |\ \ \
  • | | | *
  • | | | *
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • | * | |
  • |/ / /
  • * | |
  • |\ \ \
  • | | | *
  • | | | *
  • | | | *
  • | | | *
  • 7d4a22157 Remove unused TEMP2 and reshuffle to leave x18 unused (reserved on OSX) by Martin Kroeker 2021-09-17 09:18:25 +0200
  • d3a9c7ef7 Merge pull request #3382 from rafaelcfsousa/rafael/cwarnings by Martin Kroeker 2021-09-17 09:15:16 +0200
  • 72c26f4f7 Merge pull request #3381 from martin-frbg/issue3371 by Martin Kroeker 2021-09-16 07:14:49 +0200
  • 0e8b4adf2 (refs/pull/3382/head) Remove unused commented code (#if directive) by Rafael Cardoso Fernandes Sousa 2021-09-15 22:18:48 +0000
  • 8dfa61a61 (refs/pull/3381/head) Initialize abs_mask1 with itself to silence a gcc warning by Martin Kroeker 2021-09-15 22:11:35 +0200
  • 99aa10b3f Initialize abs_mask1 with itself to silence a gcc warning by Martin Kroeker 2021-09-15 22:10:43 +0200
  • b751edf62 Fix unused variable warnings on Power by Rafael Cardoso Fernandes Sousa 2021-09-15 13:36:07 -0500
  • fa8bf5776 Merge pull request #3380 from martin-frbg/structwarn by Martin Kroeker 2021-09-15 07:19:09 +0200
  • 80346b881 Merge pull request #3379 from martin-frbg/issue3369-2 by Martin Kroeker 2021-09-15 07:18:57 +0200
  • 13182b280 Merge pull request #3378 from martin-frbg/issue3368-2 by Martin Kroeker 2021-09-15 07:18:38 +0200
  • dd09f0173 (refs/pull/3380/head) Remove extraneous qualifiers from struct definition by Martin Kroeker 2021-09-14 21:52:26 +0200
  • ce036a2fc (refs/pull/3379/head) Add casts by Martin Kroeker 2021-09-14 21:41:53 +0200
  • ddf106f76 (refs/pull/3378/head) Add dedicated entries for BFLOAT16 kernels by Martin Kroeker 2021-09-14 16:17:18 +0200
  • c35739db5 Add separate entries for BFLOAT16 functions and fix missing cblas_xerbla by Martin Kroeker 2021-09-14 16:15:57 +0200
  • 2f8220d75 Add sbgemm by Martin Kroeker 2021-09-14 16:14:43 +0200
  • 5f6a60925 Add sbgemv by Martin Kroeker 2021-09-14 16:13:57 +0200
  • e02df9fc5 Propagate BUILD_BFLOAT16 to CFLAGS by Martin Kroeker 2021-09-14 16:12:27 +0200
  • 1c0a8a714 Add defaults for SBGEMV kernels by Martin Kroeker 2021-09-14 16:10:58 +0200
  • 5e4f1e367 Remove BFLOAT16 from the task list of GenerateNamedObject by Martin Kroeker 2021-09-14 16:09:46 +0200
  • af8843875 Merge pull request #3376 from martin-frbg/issue3370 by Martin Kroeker 2021-09-12 00:01:31 +0200
  • d1ee2e9c7 Merge pull request #3375 from martin-frbg/issue3369 by Martin Kroeker 2021-09-12 00:01:20 +0200
  • 0925dfe2c (refs/pull/3376/head) One instance of kernel_4x1 is used even on SKX by Martin Kroeker 2021-09-11 15:30:19 +0200
  • 1085775bc really remove the unused variable by Martin Kroeker 2021-09-11 15:05:55 +0200
  • 7d873a329 Add ifdefs around conditionally used functions by Martin Kroeker 2021-09-11 14:38:47 +0200
  • ef2471203 Move a conditionally used variable by Martin Kroeker 2021-09-11 14:37:44 +0200
  • 20581bf30 Remove unused variable by Martin Kroeker 2021-09-11 14:36:27 +0200
  • d17238599 (refs/pull/3375/head) Add casts by Martin Kroeker 2021-09-11 13:38:28 +0200
  • 3742611aa (refs/pull/3374/head) remove BFLOAT16 from FLOAT_TYPES to avoid creating bogus objects for unimplemented functions by Martin Kroeker 2021-09-10 13:38:16 +0200
  • acfa01312 Add separate handling for sbgemm by Martin Kroeker 2021-09-10 13:35:39 +0200
  • 5bfdbd1a3 Fix generation of sbgemv drivers by Martin Kroeker 2021-09-10 11:32:43 +0200
  • 0a4a07ff5 fix cmake generation of bfloat16 interfaces by Martin Kroeker 2021-09-10 11:18:18 +0200
  • 1f4fe48be add defaults for sbgemv kernels by Martin Kroeker 2021-09-10 11:17:16 +0200
  • f8d57df56 pass BUILD_BFLOAT16 to the compiler by Martin Kroeker 2021-09-10 11:15:24 +0200
  • 3e8c44869 Merge pull request #3367 from RajalakshmiSR/makesyntax by Martin Kroeker 2021-09-08 20:19:39 +0200
  • 7f4aa106f (refs/pull/3367/head) Fixing syntax error in makefile by Rajalakshmi Srinivasaraghavan 2021-09-08 07:04:13 -0500
  • a6ed4f0d3 Merge pull request #3366 from martin-frbg/azure-ubuntu by Martin Kroeker 2021-09-08 13:57:35 +0200
  • b858e6547 (refs/pull/3366/head) migrate from deprecated ubuntu-16.04 vmImage by Martin Kroeker 2021-09-08 10:51:59 +0200
  • d3d660172 Merge pull request #3365 from martin-frbg/travis-lx by Martin Kroeker 2021-09-07 16:24:33 +0200
  • da5bd8b5e Merge pull request #3364 from guowangy/bf16-cooperlake by Martin Kroeker 2021-09-07 13:57:40 +0200
  • 045ed5c91 (refs/pull/3364/head) sbgemm: fix build error in BFLOAT16 disabled by Wangyang Guo 2021-09-07 23:37:08 +0800
  • 4289cf048 sbgemm: avoid falling into SGEMM_KERNEL_DIRECT by Wangyang Guo 2021-09-07 18:34:26 +0800
  • 59a1114d0 sbgemm: cooperlake: tuning for small matrix by Wangyang Guo 2021-09-07 18:12:40 +0800
  • 682d66555 sbgemm: cooperlake: implement ncopy_16 by Wangyang Guo 2021-08-20 22:01:00 +0800
  • beccb83b1 sbgemm: cooperlake: add n24 kernel for tcopy_4 by Wangyang Guo 2021-08-19 19:46:08 +0800
  • 5fcacad32 sbgemm: cooperlake: implement tcopy_4 by Wangyang Guo 2021-08-19 00:08:06 +0800
  • bb1c4fa5b sbgemm: cooperlake: prefetch A & B by Wangyang Guo 2021-08-18 21:17:08 +0800
  • 7a2d1601e sbgemm: cooperlake: unroll core loop by 2 by Wangyang Guo 2021-08-17 23:21:19 +0800
  • 45fdf951b sbgemm: cooperlake: reorder ptr increase for performance by Wangyang Guo 2021-08-17 22:08:24 +0800
  • cece3541a sbgemm: cooperlake: fix bug in m64n12 by Wangyang Guo 2021-08-17 21:13:29 +0800
  • 8356a604f sbgemm: cooperlake: tuning for block params by Wangyang Guo 2021-08-17 19:35:40 +0800
  • 9df0953cd sbgemm: cooperlake: kernel works for NN by Wangyang Guo 2021-08-16 19:39:24 +0800
  • 2ec9f3a8a sbgemm: cooperlake: change kernel size to 16x4 by Wangyang Guo 2021-08-12 01:46:49 +0000
  • ef8f5fecc sbgemm: cooperlake: implement sbgemm_tcopy_32 by Wangyang Guo 2021-08-10 06:14:45 +0000
  • 4c294336e sbgemm: cooperlake: add dummy source files by Wangyang Guo 2021-08-10 03:23:45 +0000
  • 8c68b6f26 (refs/pull/3365/head) Update .travis.yml by Martin Kroeker 2021-09-07 11:40:40 +0200
  • 349fb4910 Disable the remaining x86_64 job on Travis by Martin Kroeker 2021-09-07 11:19:51 +0200
  • 7c72c45be Merge pull request #3363 from martin-frbg/fixpr3360 by Martin Kroeker 2021-09-07 08:02:53 +0200
  • 32fee8603 (refs/pull/3363/head) Correct misplaced ifdef lines by Martin Kroeker 2021-09-06 23:44:20 +0200
  • 272acc3f1 (refs/pull/3362/head) mention TARGET=GENERIC by Isuru Fernando 2021-09-06 11:58:05 -0500
  • ffb0793ff Update azure-pipelines.yml by Martin Kroeker 2021-09-05 23:04:46 +0200
  • b89b82d7c Update azure-pipelines.yml by Martin Kroeker 2021-09-05 20:56:13 +0200
  • 72f3ce5f0 Add NO_AVX=1 fallbacks to newer generation x86_64 for completeness (#3360) by Martin Kroeker 2021-09-05 20:35:48 +0200
  • 80e67e4b9 Update azure-pipelines.yml by Martin Kroeker 2021-09-05 19:06:48 +0200
  • 5e7eaff3f (refs/pull/3360/head) Update .travis.yml by Martin Kroeker 2021-09-05 17:13:41 +0200
  • a55515338 Add NO_AVX=1 fallbacks to newer generation x86_64 for completeness by Martin Kroeker 2021-09-05 13:35:34 +0200
  • 311a2b69c Update azure-pipelines.yml by Martin Kroeker 2021-09-04 23:47:38 +0200
  • 735faf881 Update azure-pipelines.yml by Martin Kroeker 2021-09-04 23:31:39 +0200
  • af19cda65 Add "recursive" option for IBM xlf compiler (#3359) by Martin Kroeker 2021-09-04 18:26:59 +0200
  • 3590039ef (refs/pull/3359/head) fix misedit by Martin Kroeker 2021-09-04 14:46:53 +0200
  • 726a185f4 Add "recursive" option for IBM xlf by Martin Kroeker 2021-09-04 13:22:15 +0200
  • cd1a307ed Add correct "recursive" option for xlf (from reference-lapack issue 606) by Martin Kroeker 2021-09-04 13:18:04 +0200
  • fdc031729 Update azure-pipelines.yml by Martin Kroeker 2021-09-04 13:00:12 +0200
  • e7b970f72 Update azure-pipelines.yml by Martin Kroeker 2021-09-04 12:42:56 +0200
  • 948964301 Update azure-pipelines.yml by Martin Kroeker 2021-09-04 09:30:35 +0200
  • 41ee2b3d8 Update azure-pipelines.yml by Martin Kroeker 2021-09-03 22:49:14 +0200
  • 215310f4f Update azure-pipelines.yml by Martin Kroeker 2021-09-03 22:21:26 +0200
  • 7406eabee Update azure-pipelines.yml by Martin Kroeker 2021-09-03 16:27:50 +0200
  • b0d3c6dbc Update azure-pipelines.yml by Martin Kroeker 2021-09-03 14:39:52 +0200
  • c5122dd77 Add DYNAMIC_LIST as the Windows build is surprisingly slow by Martin Kroeker 2021-09-02 18:40:24 +0200
  • 184d362e4 Update azure-pipelines.yml by Martin Kroeker 2021-09-02 17:26:11 +0200
  • 279077313 Add a Windows build with mingw gcc and make by Martin Kroeker 2021-09-02 14:37:34 +0200
  • a3e80069f Merge pull request #3355 from martin-frbg/smallgemmcr by Martin Kroeker 2021-09-02 00:27:23 +0200
  • f1e330597 (refs/pull/3355/head) Add workaround for Windows10 macro name clash by Martin Kroeker 2021-09-01 21:36:50 +0200
  • 3cdfe3361 Merge pull request #3352 from martin-frbg/3321-2n by Martin Kroeker 2021-09-01 13:52:40 +0200
  • 47171e4b9 Merge pull request #3354 from nsait-linaro/fix_gmemm_align_win_arm by Martin Kroeker 2021-08-31 21:47:21 +0200
  • 7cddbf99b (refs/pull/3354/head) Make explicit conversion condition on _WIN64 flag by Niyas Sait 2021-08-31 14:36:44 +0100
  • d1ed72fa8 [win/arm64]: Explicit casting for GMEMM_DEFAULT_ALIGN to create 64-bit value by Niyas Sait 2021-08-24 06:09:29 +0100
  • 806221440 Merge pull request #3353 from guowangy/bf16-small-matrix-cooperlake by Martin Kroeker 2021-08-30 20:39:51 +0200
  • cd10d1c03 (refs/pull/3352/head) Fix typo by Martin Kroeker 2021-08-30 14:38:28 +0200
  • 2db1a99ac Clean up debug messages by Martin Kroeker 2021-08-30 14:21:25 +0200
  • 619588fba (refs/pull/3353/head) sbgemm: remove unnecessary b0 files by Wangyang Guo 2021-08-30 17:48:11 +0800
  • f39301935 sbgemm: cooperlake: make sure hot buffer aligned to 64 by Wangyang Guo 2021-08-13 18:43:41 +0800
  • 2e44ca013 sbgemm: add missing cblas_sbgemm definition by Wangyang Guo 2021-08-13 00:51:24 +0800
  • 7d27b182f sbgemm: cooperlake: enable SBGEMM by small matrix path by Wangyang Guo 2021-08-12 06:10:51 +0000
  • 1d83ca4bc Small Matrix: support BFLOAT16 data type by Wangyang Guo 2021-08-12 03:14:18 +0000
  • bec9d9f63 Merge pull request #3335 from guowangy/small-matrix-latest by Martin Kroeker 2021-08-29 22:33:33 +0200
  • 89fc5b8f4 Fix unmap logic by Martin Kroeker 2021-08-29 19:50:24 +0200
  • 7fd12a5e6 Add likely() hints for gcc by Martin Kroeker 2021-08-29 13:54:51 +0200
  • 2ba9a567a Fix typo by Martin Kroeker 2021-08-28 17:14:59 +0200
  • b4b952eec Add auxiliary tracking space for thread buffer frees too by Martin Kroeker 2021-08-28 17:03:53 +0200