Martin Kroeker
|
d2df5bd72c
|
Update param.h
|
4 years ago |
Martin Kroeker
|
af4d4e55d1
|
Update param.h
|
4 years ago |
Martin Kroeker
|
f5e7fe0ec4
|
Update param.h
|
4 years ago |
Martin Kroeker
|
93cec29c8f
|
Update param.h
|
4 years ago |
Martin Kroeker
|
fca8259062
|
Update param.h
|
4 years ago |
Martin Kroeker
|
656b17b4bf
|
Update param.h
|
4 years ago |
Martin Kroeker
|
c684cae97c
|
Update param.h
|
4 years ago |
Martin Kroeker
|
a7a05b78fe
|
Update param.h
|
4 years ago |
Martin Kroeker
|
49878cad51
|
Update param.h
|
4 years ago |
Martin Kroeker
|
bb05550b13
|
Update .travis.yml
|
4 years ago |
Martin Kroeker
|
699c0a0365
|
Update param.h
|
4 years ago |
Martin Kroeker
|
3ce413d1db
|
Update param.h
|
4 years ago |
Martin Kroeker
|
1049dfefa1
|
Update param.h
|
4 years ago |
Martin Kroeker
|
3e409b156d
|
Update param.h
|
4 years ago |
Martin Kroeker
|
4217096c92
|
Update param.h
|
4 years ago |
Martin Kroeker
|
ceb535c1ea
|
Update param.h
|
4 years ago |
Martin Kroeker
|
2b3d2ef789
|
Update param.h
|
4 years ago |
Martin Kroeker
|
17376df24f
|
Update param.h
|
4 years ago |
Martin Kroeker
|
2cc76cc843
|
Update param.h
|
4 years ago |
Martin Kroeker
|
1489e977bf
|
Update param.h
|
4 years ago |
Martin Kroeker
|
0a92a783b1
|
Update param.h
|
4 years ago |
Martin Kroeker
|
4224f7ee5d
|
Update param.h
|
4 years ago |
Martin Kroeker
|
98548457e8
|
Update param.h
|
4 years ago |
Martin Kroeker
|
eda222a144
|
Update .travis.yml
|
4 years ago |
Martin Kroeker
|
fa7e4d86fc
|
try 512/512 for neoverse dgemm
|
4 years ago |
Martin Kroeker
|
4b5c24b45f
|
double neoverse dgemm p&q again
|
4 years ago |
Martin Kroeker
|
8a1e00bba8
|
increase dgemm pq for neoverse
|
4 years ago |
Martin Kroeker
|
7eca8d1a77
|
run dgemm benchmark on neoverse
|
4 years ago |
Martin Kroeker
|
e76ff6a44e
|
Merge pull request #3385 from martin-frbg/update_readme
Update README.md
|
4 years ago |
Martin Kroeker
|
5c537a5de0
|
Update README.md
|
4 years ago |
Martin Kroeker
|
5edd88c919
|
Merge pull request #3384 from martin-frbg/issue3383
Modify ARMV8 kernels to leave x18 unused as it is reserved on OSX
|
4 years ago |
Martin Kroeker
|
90cc944625
|
Move alphaI to x22 to leave x18 unused (reserved on OSX)
|
4 years ago |
Martin Kroeker
|
590fbff06e
|
move alpha to x19/x20 to leave x18 unused for OSX
|
4 years ago |
Martin Kroeker
|
380940271b
|
Move temp to x21 to leave x18 unused (reserved on OSX)
|
4 years ago |
Martin Kroeker
|
7d75177446
|
Move temp to x21 to leave x18 unused (reserved on OSX)
|
4 years ago |
Martin Kroeker
|
0a4ac4b585
|
Use x21 for I to leave x18 unused (reserved on OSX)
|
4 years ago |
Martin Kroeker
|
7d4a221579
|
Remove unused TEMP2 and reshuffle to leave x18 unused (reserved on OSX)
|
4 years ago |
Martin Kroeker
|
d3a9c7ef7f
|
Merge pull request #3382 from rafaelcfsousa/rafael/cwarnings
[POWER] Remove unused variable warnings.
|
4 years ago |
Martin Kroeker
|
72c26f4f7f
|
Merge pull request #3381 from martin-frbg/issue3371
Silence compiler warnings about uninitialized variables
|
4 years ago |
Rafael Cardoso Fernandes Sousa
|
0e8b4adf22
|
Remove unused commented code (#if directive)
|
4 years ago |
Martin Kroeker
|
8dfa61a61c
|
Initialize abs_mask1 with itself to silence a gcc warning
|
4 years ago |
Martin Kroeker
|
99aa10b3ff
|
Initialize abs_mask1 with itself to silence a gcc warning
actual initialization is via the _mm_cmpeq_ep18, which I've seen claimed to be the fastest way to set an xmm register to all 1s
|
4 years ago |
Rafael Cardoso Fernandes Sousa
|
b751edf624
|
Fix unused variable warnings on Power
|
4 years ago |
Martin Kroeker
|
fa8bf57768
|
Merge pull request #3380 from martin-frbg/structwarn
Remove extraneous qualifiers from struct definition
|
4 years ago |
Martin Kroeker
|
80346b8813
|
Merge pull request #3379 from martin-frbg/issue3369-2
Add casts to fix compiler warnings for SkylakeX sasum/dasum
|
4 years ago |
Martin Kroeker
|
13182b2801
|
Merge pull request #3378 from martin-frbg/issue3368-2
Rework generation of BFLOAT16 objects in CMAKE builds and fix missing CBLAS_XERBLA
|
4 years ago |
Martin Kroeker
|
dd09f0173e
|
Remove extraneous qualifiers from struct definition
|
4 years ago |
Martin Kroeker
|
ce036a2fc0
|
Add casts
|
4 years ago |
Martin Kroeker
|
ddf106f769
|
Add dedicated entries for BFLOAT16 kernels
|
4 years ago |
Martin Kroeker
|
c35739db5e
|
Add separate entries for BFLOAT16 functions and fix missing cblas_xerbla
|
4 years ago |