7d4a22157
Remove unused TEMP2 and reshuffle to leave x18 unused (reserved on OSX) by
2021-09-17 09:18:25 +0200
d3a9c7ef7
Merge pull request #3382 from rafaelcfsousa/rafael/cwarnings by
2021-09-17 09:15:16 +0200
72c26f4f7
Merge pull request #3381 from martin-frbg/issue3371 by
2021-09-16 07:14:49 +0200
0e8b4adf2
(refs/pull/3382/head)
Remove unused commented code (#if directive) by
2021-09-15 22:18:48 +0000
8dfa61a61
(refs/pull/3381/head)
Initialize abs_mask1 with itself to silence a gcc warning by
2021-09-15 22:11:35 +0200
99aa10b3f
Initialize abs_mask1 with itself to silence a gcc warning by
2021-09-15 22:10:43 +0200
b751edf62
Fix unused variable warnings on Power by
2021-09-15 13:36:07 -0500
fa8bf5776
Merge pull request #3380 from martin-frbg/structwarn by
2021-09-15 07:19:09 +0200
80346b881
Merge pull request #3379 from martin-frbg/issue3369-2 by
2021-09-15 07:18:57 +0200
13182b280
Merge pull request #3378 from martin-frbg/issue3368-2 by
2021-09-15 07:18:38 +0200
dd09f0173
(refs/pull/3380/head)
Remove extraneous qualifiers from struct definition by
2021-09-14 21:52:26 +0200
ce036a2fc
(refs/pull/3379/head)
Add casts by
2021-09-14 21:41:53 +0200
ddf106f76
(refs/pull/3378/head)
Add dedicated entries for BFLOAT16 kernels by
2021-09-14 16:17:18 +0200
c35739db5
Add separate entries for BFLOAT16 functions and fix missing cblas_xerbla by
2021-09-14 16:15:57 +0200
2f8220d75
Add sbgemm by
2021-09-14 16:14:43 +0200
5f6a60925
Add sbgemv by
2021-09-14 16:13:57 +0200
e02df9fc5
Propagate BUILD_BFLOAT16 to CFLAGS by
2021-09-14 16:12:27 +0200
1c0a8a714
Add defaults for SBGEMV kernels by
2021-09-14 16:10:58 +0200
5e4f1e367
Remove BFLOAT16 from the task list of GenerateNamedObject by
2021-09-14 16:09:46 +0200
af8843875
Merge pull request #3376 from martin-frbg/issue3370 by
2021-09-12 00:01:31 +0200
d1ee2e9c7
Merge pull request #3375 from martin-frbg/issue3369 by
2021-09-12 00:01:20 +0200
0925dfe2c
(refs/pull/3376/head)
One instance of kernel_4x1 is used even on SKX by
2021-09-11 15:30:19 +0200
1085775bc
really remove the unused variable by
2021-09-11 15:05:55 +0200
7d873a329
Add ifdefs around conditionally used functions by
2021-09-11 14:38:47 +0200
ef2471203
Move a conditionally used variable by
2021-09-11 14:37:44 +0200
20581bf30
Remove unused variable by
2021-09-11 14:36:27 +0200
d17238599
(refs/pull/3375/head)
Add casts by
2021-09-11 13:38:28 +0200
3742611aa
(refs/pull/3374/head)
remove BFLOAT16 from FLOAT_TYPES to avoid creating bogus objects for unimplemented functions by
2021-09-10 13:38:16 +0200
acfa01312
Add separate handling for sbgemm by
2021-09-10 13:35:39 +0200
5bfdbd1a3
Fix generation of sbgemv drivers by
2021-09-10 11:32:43 +0200
0a4a07ff5
fix cmake generation of bfloat16 interfaces by
2021-09-10 11:18:18 +0200
1f4fe48be
add defaults for sbgemv kernels by
2021-09-10 11:17:16 +0200
f8d57df56
pass BUILD_BFLOAT16 to the compiler by
2021-09-10 11:15:24 +0200
3e8c44869
Merge pull request #3367 from RajalakshmiSR/makesyntax by
2021-09-08 20:19:39 +0200
7f4aa106f
(refs/pull/3367/head)
Fixing syntax error in makefile by
2021-09-08 07:04:13 -0500
a6ed4f0d3
Merge pull request #3366 from martin-frbg/azure-ubuntu by
2021-09-08 13:57:35 +0200
b858e6547
(refs/pull/3366/head)
migrate from deprecated ubuntu-16.04 vmImage by
2021-09-08 10:51:59 +0200
d3d660172
Merge pull request #3365 from martin-frbg/travis-lx by
2021-09-07 16:24:33 +0200
da5bd8b5e
Merge pull request #3364 from guowangy/bf16-cooperlake by
2021-09-07 13:57:40 +0200
045ed5c91
(refs/pull/3364/head)
sbgemm: fix build error in BFLOAT16 disabled by
2021-09-07 23:37:08 +0800
4289cf048
sbgemm: avoid falling into SGEMM_KERNEL_DIRECT by
2021-09-07 18:34:26 +0800
59a1114d0
sbgemm: cooperlake: tuning for small matrix by
2021-09-07 18:12:40 +0800
682d66555
sbgemm: cooperlake: implement ncopy_16 by
2021-08-20 22:01:00 +0800
beccb83b1
sbgemm: cooperlake: add n24 kernel for tcopy_4 by
2021-08-19 19:46:08 +0800
5fcacad32
sbgemm: cooperlake: implement tcopy_4 by
2021-08-19 00:08:06 +0800
bb1c4fa5b
sbgemm: cooperlake: prefetch A & B by
2021-08-18 21:17:08 +0800
7a2d1601e
sbgemm: cooperlake: unroll core loop by 2 by
2021-08-17 23:21:19 +0800
45fdf951b
sbgemm: cooperlake: reorder ptr increase for performance by
2021-08-17 22:08:24 +0800
cece3541a
sbgemm: cooperlake: fix bug in m64n12 by
2021-08-17 21:13:29 +0800
8356a604f
sbgemm: cooperlake: tuning for block params by
2021-08-17 19:35:40 +0800
9df0953cd
sbgemm: cooperlake: kernel works for NN by
2021-08-16 19:39:24 +0800
2ec9f3a8a
sbgemm: cooperlake: change kernel size to 16x4 by
2021-08-12 01:46:49 +0000
ef8f5fecc
sbgemm: cooperlake: implement sbgemm_tcopy_32 by
2021-08-10 06:14:45 +0000
4c294336e
sbgemm: cooperlake: add dummy source files by
2021-08-10 03:23:45 +0000
8c68b6f26
(refs/pull/3365/head)
Update .travis.yml by
2021-09-07 11:40:40 +0200
349fb4910
Disable the remaining x86_64 job on Travis by
2021-09-07 11:19:51 +0200
7c72c45be
Merge pull request #3363 from martin-frbg/fixpr3360 by
2021-09-07 08:02:53 +0200
32fee8603
(refs/pull/3363/head)
Correct misplaced ifdef lines by
2021-09-06 23:44:20 +0200
272acc3f1
(refs/pull/3362/head)
mention TARGET=GENERIC by
2021-09-06 11:58:05 -0500
ffb0793ff
Update azure-pipelines.yml by
2021-09-05 23:04:46 +0200
b89b82d7c
Update azure-pipelines.yml by
2021-09-05 20:56:13 +0200
72f3ce5f0
Add NO_AVX=1 fallbacks to newer generation x86_64 for completeness (#3360) by
2021-09-05 20:35:48 +0200
80e67e4b9
Update azure-pipelines.yml by
2021-09-05 19:06:48 +0200
5e7eaff3f
(refs/pull/3360/head)
Update .travis.yml by
2021-09-05 17:13:41 +0200
a55515338
Add NO_AVX=1 fallbacks to newer generation x86_64 for completeness by
2021-09-05 13:35:34 +0200
311a2b69c
Update azure-pipelines.yml by
2021-09-04 23:47:38 +0200
735faf881
Update azure-pipelines.yml by
2021-09-04 23:31:39 +0200
af19cda65
Add "recursive" option for IBM xlf compiler (#3359) by
2021-09-04 18:26:59 +0200
3590039ef
(refs/pull/3359/head)
fix misedit by
2021-09-04 14:46:53 +0200
726a185f4
Add "recursive" option for IBM xlf by
2021-09-04 13:22:15 +0200
cd1a307ed
Add correct "recursive" option for xlf (from reference-lapack issue 606) by
2021-09-04 13:18:04 +0200
fdc031729
Update azure-pipelines.yml by
2021-09-04 13:00:12 +0200
e7b970f72
Update azure-pipelines.yml by
2021-09-04 12:42:56 +0200
948964301
Update azure-pipelines.yml by
2021-09-04 09:30:35 +0200
41ee2b3d8
Update azure-pipelines.yml by
2021-09-03 22:49:14 +0200
215310f4f
Update azure-pipelines.yml by
2021-09-03 22:21:26 +0200
7406eabee
Update azure-pipelines.yml by
2021-09-03 16:27:50 +0200
b0d3c6dbc
Update azure-pipelines.yml by
2021-09-03 14:39:52 +0200
c5122dd77
Add DYNAMIC_LIST as the Windows build is surprisingly slow by
2021-09-02 18:40:24 +0200
184d362e4
Update azure-pipelines.yml by
2021-09-02 17:26:11 +0200
279077313
Add a Windows build with mingw gcc and make by
2021-09-02 14:37:34 +0200
a3e80069f
Merge pull request #3355 from martin-frbg/smallgemmcr by
2021-09-02 00:27:23 +0200
f1e330597
(refs/pull/3355/head)
Add workaround for Windows10 macro name clash by
2021-09-01 21:36:50 +0200
3cdfe3361
Merge pull request #3352 from martin-frbg/3321-2n by
2021-09-01 13:52:40 +0200
47171e4b9
Merge pull request #3354 from nsait-linaro/fix_gmemm_align_win_arm by
2021-08-31 21:47:21 +0200
7cddbf99b
(refs/pull/3354/head)
Make explicit conversion condition on _WIN64 flag by
2021-08-31 14:36:44 +0100
d1ed72fa8
[win/arm64]: Explicit casting for GMEMM_DEFAULT_ALIGN to create 64-bit value by
2021-08-24 06:09:29 +0100
806221440
Merge pull request #3353 from guowangy/bf16-small-matrix-cooperlake by
2021-08-30 20:39:51 +0200
cd10d1c03
(refs/pull/3352/head)
Fix typo by
2021-08-30 14:38:28 +0200
2db1a99ac
Clean up debug messages by
2021-08-30 14:21:25 +0200
619588fba
(refs/pull/3353/head)
sbgemm: remove unnecessary b0 files by
2021-08-30 17:48:11 +0800
f39301935
sbgemm: cooperlake: make sure hot buffer aligned to 64 by
2021-08-13 18:43:41 +0800
2e44ca013
sbgemm: add missing cblas_sbgemm definition by
2021-08-13 00:51:24 +0800
7d27b182f
sbgemm: cooperlake: enable SBGEMM by small matrix path by
2021-08-12 06:10:51 +0000
1d83ca4bc
Small Matrix: support BFLOAT16 data type by
2021-08-12 03:14:18 +0000
bec9d9f63
Merge pull request #3335 from guowangy/small-matrix-latest by
2021-08-29 22:33:33 +0200
89fc5b8f4
Fix unmap logic by
2021-08-29 19:50:24 +0200
7fd12a5e6
Add likely() hints for gcc by
2021-08-29 13:54:51 +0200
2ba9a567a
Fix typo by
2021-08-28 17:14:59 +0200
b4b952eec
Add auxiliary tracking space for thread buffer frees too by
2021-08-28 17:03:53 +0200