Zhang Xianyi
05bb391c3a
Refs #330 . Fixed the compatible issue with clang on Mac OSX.
12 years ago
Zhang Xianyi
9b5be29886
Refs #310 . Fixed Segfault bug on nehalem when Julia calling dgeqrt3 on OSX.
Please also check JuliaLang/julia#4099
Julia test script:
A=rand(256, 256)
qrfact(A)
I found this was a bug in kernel/x86_64/dgemm_ncopy_8.S.
However, I cannot use gdb with julia. Thus, this is a walkaround fix.
12 years ago
wernsaar
53eaf41901
added support for HASWELL
12 years ago
wernsaar
9423f980f6
modified trsm kernel
12 years ago
wernsaar
c6156b2ef2
added trsm kernels from origin
12 years ago
wernsaar
034a5b2083
modified zsymv
12 years ago
wernsaar
27d4234d4d
merged symv
12 years ago
wernsaar
402d6e91db
Merge remote branch 'origin/develop' into armv7
12 years ago
wernsaar
b3254eecaf
Merge remote branch 'origin/haswell' into develop
12 years ago
wernsaar
d910404f00
Merge remote branch 'origin/piledriver' into develop
12 years ago
wernsaar
ffe70b1fdc
modified Makefile.L3
12 years ago
wernsaar
0b6e13b689
Merge remote branch 'origin/develop' into haswell
12 years ago
wernsaar
e09dc279a2
Merge remote branch 'origin/develop' into piledriver
12 years ago
wernsaar
4be4db590c
Merge remote branch 'origin/develop' into armv7
12 years ago
wernsaar
5c648a8984
Merge remote branch 'origin/develop' into haswell
12 years ago
wernsaar
c44dc4dd3c
Merge remote branch 'origin/develop' into piledriver
12 years ago
wernsaar
9d3fae15a8
Merge branch 'develop' into armv7
12 years ago
wernsaar
2d3c884294
added complex gemv kernels for ARMV6 and ARMV7
12 years ago
wernsaar
d54a061713
optimized gemv_n_vfp.S
12 years ago
wernsaar
86afb47e83
added optimized ctrmm kernel for ARMV6
12 years ago
wernsaar
42a4dff056
added optimized ztrmm kernel for ARMV6
12 years ago
wernsaar
5bc322a66c
optimized strmm kernel for ARMV6
12 years ago
wernsaar
dec7ad0dfd
optimized dtrmm kernel for ARMV7
12 years ago
wernsaar
274304bd03
add optimized cgemm kernel for ARMV6
12 years ago
wernsaar
5007a534c4
optimized zgemm kernel for ARMV6
12 years ago
wernsaar
a537d7d8d7
optimized zgemm_kernel_2x2_vfp.S
12 years ago
wernsaar
b42145834f
optimized sgemm kernel for ARMV6
12 years ago
wernsaar
3d5e792c72
optimized sgemm kernel for ARMV6
12 years ago
wernsaar
a9bd12da2c
optimized dgemm kernel for ARMV6
12 years ago
wernsaar
697e198e8a
added zgemm_kernel for ARMV6
12 years ago
wernsaar
36b0f7fe1d
added optimized gemv_t kernel for ARMV6
12 years ago
wernsaar
d2b20c5c51
add optimized axpy kernel
12 years ago
wernsaar
fe5f46c330
added experimental support for ARMV8
12 years ago
wernsaar
25c6050593
add single and double precision gemv_n kernel for ARMV6
12 years ago
wernsaar
12e02a00e0
added ncopy kernels for ARMV6
12 years ago
wernsaar
29a3196f56
added optimized sgemm and strmm kernel for ARMV6
12 years ago
wernsaar
8776a73773
added optimized dgemm and dtrmm kernel for ARMV6
12 years ago
wernsaar
7e84acd3e8
fixed bug in SAVE macros, that are not found by any test routine
12 years ago
wernsaar
33d3ab6e09
small optimizations for zgemv kernels
12 years ago
wernsaar
9a0f978929
added nrm2 kernel for ARMV6
12 years ago
wernsaar
7f210587f0
renamed some ncopy and tcopy files
12 years ago
wernsaar
9f0a3a35b3
removed obsolete file sdot_vfpv3.S
12 years ago
wernsaar
dbae93110b
added sdot_vfp.S
12 years ago
wernsaar
19cd5c64a2
renamed swap_vfpv3.S to swap_vfp.S
12 years ago
wernsaar
9adf87495e
renamed some dot kernels
12 years ago
wernsaar
440db4cdda
delete rot_vfpv3.S
12 years ago
wernsaar
cd93cae5a7
renamed rot_vfpv3.S to rot_vfp.S
12 years ago
wernsaar
8565afb3c2
renamed asum_vfpv3.S to asum_vfp.S
12 years ago
wernsaar
5bf7cf8d67
renamed scal_vfpv3.S to scal_vfp.S
12 years ago
wernsaar
29a005c635
renamed iamax assembler kernel
12 years ago