Shivraj Patil
57df7956ee
Added CGEMM, ZGEMM, STRMM, DTRMM, CTRMM, ZTRMM. Updated macros in SGEMM, DGEMM, STRMM.
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
9 years ago
Zhang Xianyi
4a30a2584a
Merge pull request #897 from ksraste/develop
STRSM optimized for MSA
9 years ago
Werner Saar
f04af36ad0
Merge pull request #898 from wernsaar/develop
added experimental support for optimized lapack fortran functions
9 years ago
Kaustubh Raste
011431b9d7
STRSM optimized for MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
9 years ago
Kaustubh Raste
c8a7860eb3
STRSM optimized
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
9 years ago
Zhang Xianyi
2daad2bcb5
Merge pull request #893 from biddisco/develop
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PRO…
9 years ago
John Biddiscombe
053044ae4d
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
9 years ago
Aleksey Kuleshov
fca66262c4
mips64/axpy: fix error when INCY == 0
9 years ago
Werner Saar
412bcd187a
optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S
9 years ago
Werner Saar
bd06b246cc
Merge pull request #890 from wernsaar/develop
optimized dtrsm_kernel_LT for POWER8
9 years ago
Werner Saar
8b140220c8
optimized dtrsm_kernel_LT for POWER8
9 years ago
Werner Saar
8fb5a1aaff
added optimized dtrsm_LT kernel for POWER8
9 years ago
Kaustubh Raste
ad9f317870
STRSM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
9 years ago
Shivraj Patil
c4ba40e308
SGEMM optimization for MIPS P5600 and I6400 using MSA. Unrolled k loop in DGEMM kernel function
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
9 years ago
Zhang Xianyi
7a19065369
Merge pull request #878 from ksraste/develop
DTRSM bug fix for MIPS P5600 and I6400
9 years ago
Werner Saar
6a2bde7a2d
optimized dgemm and dgetrf for POWER8
9 years ago
Kaustubh Raste
d7cbc7ac13
DTRSM bug fix for MIPS P5600 and I6400
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
9 years ago
Werner Saar
88011f625d
Merge pull request #876 from wernsaar/develop
optimized dgemm on power8 for 20 threads
9 years ago
Werner Saar
8310d4d3f7
optimized dgemm for 20 threads
9 years ago
Kaustubh Raste
edb5980c13
DTRSM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
9 years ago
Shivraj Patil
085cf236c2
conflict resolved by syncing with 'xianyi:develop'
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
9 years ago
Shivraj Patil
b7b3d8ec8e
DGEMM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
9 years ago
Zhang Xianyi
cd7af5260a
Merge pull request #847 from sva-img/develop
MIPS P5600(32 bit) and I6400(64 bit) cores support added.
9 years ago
Werner Saar
56948dbf0f
optimized dgemm for POWER8
9 years ago
Werner Saar
0d0c6f7d7d
optimized dgemm for POWER8
9 years ago
Werner Saar
298b13bba4
updated some kernel files for EXCAVATOR
9 years ago
Werner Saar
78b05f6476
bugfix for EXCAVATOR and DYNAMIC_ARCH
9 years ago
Werner Saar
a3da10662f
added sgemm_tcopy_8_power8.S
9 years ago
Werner Saar
d46f07bb4e
added cgemm_tcopy_8_power8.S
9 years ago
Werner Saar
879a51165f
Optimized zgemm and tested zgemm again
9 years ago
Shivraj Patil
2c3dfe2bf3
MIPS P5600(32 bit) and I6400(64 bit) cores support added.
Seperated mips and mips64 files.
Configurations support for mips 32 bit.
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
9 years ago
Werner Saar
9276c9012f
Optimized sgemm and dgemm and tested again.
9 years ago
wernsaar
6fbca2a4a1
Merge pull request #845 from wernsaar/develop
optimized sgemm for power8
9 years ago
Werner Saar
0001260f4b
optimized sgemm
9 years ago
Werner Saar
3c6294ca3d
added optimized sgemm_tcopy for power8
9 years ago
Zhang Xianyi
f24d5307cf
Refs #834 . Fix zgemv config bug on Steamroller.
9 years ago
Werner Saar
8037d78eed
bugfix for arm scal.c and zscal.c
9 years ago
wernsaar
0a4276bc2f
Merge pull request #837 from wernsaar/develop
updated zgemm- and ztrmm-kernel for POWER8
9 years ago
Werner Saar
e173c51c04
updated zgemm- and ztrmm-kernel for POWER8
9 years ago
Werner Saar
9c42f0374a
Updated cgemm- and sgemm-kernel for POWER8 SMP
9 years ago
Zhang Xianyi
d4380c1fe4
Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver.
9 years ago
Werner Saar
a51102e9b7
bugfixes for sgemm- and cgemm-kernel
9 years ago
Werner Saar
c5b1fbcb2e
updated optimized cgemm- and ctrmm-kernel for POWER8
9 years ago
Werner Saar
d4c0330967
updated cgemm- and ctrmm-kernel for POWER8
9 years ago
Werner Saar
6a9bbfc227
updated sgemm- and strmm-kernel for POWER8
9 years ago
Werner Saar
68a69c5b50
added optimized dgemv_n kernel for POWER8
9 years ago
Werner Saar
c2464a7c4a
added optimized casum kernel for POWER8
9 years ago
Werner Saar
294f933869
added optimized zasum kernel for POWER8
9 years ago
Werner Saar
f59c9bd6ef
added optimized sasum kernel for POWER8
9 years ago
Werner Saar
c53be46d78
added optimized dasum kernel for POWER8
9 years ago