Ashwin Sekhar T K
bc4e96311b
Optimized zgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
82b791bf1a
Optimized cgemm kernel for CORTEXA57
Also, add a generic ztrmm 4x4 kernel
10 years ago
Ashwin Sekhar T K
262c1479a7
Optimized dgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
3d8be7b6ad
Improve the sgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
8055add702
Optimized gemv kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
f801abd58b
Optimized swap kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
5ee916d1fd
Optimized scal kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
f4cadff039
Optimized rot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
c29ea30dcd
Optimized nrm2 kernels for CORTEXA57
10 years ago
Ashwin Sekhar T K
d1b2ec5eba
Optimized dot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
183b2e6cdc
Optimized copy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
b5143de005
Optimized axpy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
9af52f1af8
Optimized asum kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
25a9a1da48
Optimized iamax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
64df449fe7
Optimized amax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
c425f99e36
Adding arm64 target CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Zhang Xianyi
e5b96e55a7
Fix build bug for ARM64.
11 years ago
Benedikt Huber
58c90d5937
# The first commit's message is:
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
12 years ago
wernsaar
fe5f46c330
added experimental support for ARMV8
12 years ago