Martin Kroeker
92557c6feb
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
56591ee298
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
c05bcae293
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
2167e983f6
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
157a6e019b
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
1755fa561a
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
5640d59049
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
139945e587
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
1a52cbf9ea
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
35f8ccb87d
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
feaf7fa165
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
e6f7a017ba
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
79be0d36c1
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
e0a1724410
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
442e53b7df
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
b137520315
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
3004e37511
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
8c70d80293
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
5436ba12f6
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
d5318d7e6f
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
2204003435
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
3371bd85a3
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
edd50d54cf
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
acec942bea
Set visibility of internal symbols to hidden
3 years ago
Martin Kroeker
142f13fffd
Update trsv_U.c
3 years ago
Martin Kroeker
0223e30118
Update trsv_L.c
3 years ago
Martin Kroeker
0c4ce5e721
Update ztrsv_L.c
3 years ago
Martin Kroeker
13dbc94329
Update ztrsv_U.c
3 years ago
Martin Kroeker
111c88b911
Update ztrsv_U.c
3 years ago
Martin Kroeker
9ddfe82741
Update ztrsv_L.c
3 years ago
Martin Kroeker
e10802018d
Update trsv_U.c
3 years ago
Martin Kroeker
85657fc3a0
Update trsv_L.c
3 years ago
Martin Kroeker
451993e8a9
Set visibility attribute on internal symbols to hidden
3 years ago
Martin Kroeker
7f0b11fbc1
Exclude some complex drivers when NO_LAPACK is set
4 years ago
Martin Kroeker
5f6a609253
Add sbgemv
4 years ago
Chen, Guobing
a7b1f9b1bb
Implementation of BF16 based gemv
1. Add a new API -- sbgemv to support bfloat16 based gemv
2. Implement a generic kernel for sbgemv
3. Implement an avx512-bf16 based kernel for sbgemv
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
5 years ago
Martin Kroeker
887e00fd7f
Adapt for supporting only a subset of variable types
5 years ago
Martin Kroeker
3287848c8f
Support building only seleced types
5 years ago
Martin Kroeker
806f89166e
Make ARMV7 compile with xcode and add a CI job for it ( #2537 )
* Add an ARMV7 iOS build on Travis
* thread_local appears to be unavailable on ARMV7 iOS
* Add no-thumb option for ARMV7 IOS build to get it to accept DMB ISH
* Make local labels in macros of nrm2_vfpv3.S compatible with the xcode assembler
6 years ago
Martin Kroeker
8617d75548
Revert "Avoid taking root of negative number in symv_thread.c"
6 years ago
Sebastian Berg
6355c25dde
Avoid taking root of negative number in symv_thread.c
This is similar to fixes in gh-1929, but there was one remaining
occurance of this type of pattern in the driver/level2/*_thread.c
files.
6 years ago
Martin Kroeker
45333d5793
Fix error introduced during cleanup
7 years ago
Martin Kroeker
78d9910236
Correct range_n limiting
same bug as seen in #1388 , somehow missed in corresponding PR #1389
7 years ago
Martin Kroeker
5a720cf9ca
Re-enable loop unrolling in trmv and remove the scary warning
fixes #1748 as that half of the fix for #1332 appears to have been an overreaction on my part.
7 years ago
Martin Kroeker
368d14f8c8
Fix harmless typo
fixes #1872
7 years ago
Martin Kroeker
0427277cef
Allow optimization for small m, large n only if it can be made threadsafe
otherwise the introduction of a static array in 8e5a108 to improve #532 breaks concurrent calls from multiple threads as seen in #1844
7 years ago
Martin Kroeker
cc9500db41
Merge pull request #1403 from brada4/develop
Address few more warnings
8 years ago
Andrew
bfc2a88594
remove unused buffer
8 years ago
Martin Kroeker
177b78c8b4
Issue1388 ( #1389 )
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262 - should fix #1388
* Calculation of range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
8 years ago
Andrew
281a2b952f
warning cleanup ( #1380 )
* dead increments in driver/level2
* dead increments in kernel/generic
* part dead increments in kernel/x86_64
8 years ago