185 Commits (6da558d2abe339328718ccce7ca7b1b16a8fcae7)

Author SHA1 Message Date
  wernsaar 6da558d2ab changes for compatibility with Pathscale compiler 12 years ago
  wernsaar 5118a7f4d1 small optimizations on dgemm_kernel for Piledriver 12 years ago
  wernsaar e172b70ea2 added cgemm_kernel for Piledriver 12 years ago
  wernsaar 1cf4b974b2 added zgemm_kernel for Piledriver 12 years ago
  wernsaar 7bccff1512 added sgemm_kernel for PILEDRIVER 12 years ago
  wernsaar 2840d56aeb added dgemm_kernel for Piledriver 12 years ago
  Zhang Xianyi 6c4a7d0828 Import AMD Piledriver DGEMM kernel generated by AUGEM. 12 years ago
  wernsaar 067e8417fd removed unnessesary instructions from zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar a82da3d069 removed unnessesary instructions 12 years ago
  Zhang Xianyi 1569bf14f8 Refs #282. Fixed zgemv_n typo bug on Win64. 12 years ago
  Zhang Xianyi c0159d44a3 Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop 12 years ago
  wernsaar c17a850c1c modified KERNEL.BULLDOZER 12 years ago
  wernsaar 099853fff6 added dtrsm_kernel_RN_8x2_bulldozer.S 12 years ago
  wernsaar 44d23881b5 dtrsm_kernel_LT_8x2_bulldozer.S performance optimization 12 years ago
  Zhang Xianyi 32fb6b9bb2 Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop 12 years ago
  wernsaar aaeb8eaecd modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 8aeec32ea0 modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 87fc9de572 added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 564aa60fec removed dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar f645665dd6 fixed bug in dgemv_t_bulldozer.S 12 years ago
  wernsaar e45a347cd2 repaired trmm bug in sgemm_kernel_16x2_bulldozer.S 12 years ago
  wernsaar 99727ac013 repaired trmm bug in cgemm_kernel_4x2_bulldozer.S 12 years ago
  wernsaar 6e0a2fbc0c repaired trmm bug in zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar 0a22f99c58 repaired trmm bug in dgemm_kernel_8x2_bulldozer.S 12 years ago
  wernsaar cff70a666d added generic trmm kernels and modified Makefile.L3 12 years ago
  wernsaar 84bd0aabaa added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  Zhang Xianyi 72b1edaf1b Merge branch 'develop' into bulldozer 12 years ago
  wangqian 1b3b9e841d Fixed a computational error in zgemm_kernel_4x4_sandy.S file. 12 years ago
  Zhang Xianyi 2ed0f6ab60 Fixed the typo. 12 years ago
  Zhang Xianyi 886cbaf4e4 Support AMD Piledriver by bulldozer kernels. 12 years ago
  Zhang Xianyi 57944538b6 Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX. 12 years ago
  Zhang Xianyi fa916a0fac Fixed #238 bug in lsame on x86. 12 years ago
  Zhang Xianyi fb298b34ae Merge pull request #235 from wernsaar/develop 12 years ago
  wernsaar 16012767f4 added dcopy_bulldozer.S 12 years ago
  wernsaar bcbac31b47 added ddot_bulldozer.S 12 years ago
  wernsaar 8dc0c72583 added daxpy_bulldozer.S 12 years ago
  wernsaar 89405a1a0b cleanup of dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar 4f2b12b8a8 added dgemv_t_bulldozer.S 12 years ago
  Zhang Xianyi 646e168d26 Merge pull request #233 from wernsaar/develop 12 years ago
  wernsaar 93dbbe1fb8 added dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar a135f5d9ed added gemm_tcopy_2_bulldozer.S 12 years ago
  wernsaar d0b6299b13 added dgemm_tcopy_8_bulldozer.S 12 years ago
  wernsaar 9e58dd509e added gemm_ncopy_2_bulldozer.S 12 years ago
  wernsaar 7c8227101b cleanup of dgemv_n_bulldozer.S and optimization of inner loop 12 years ago
  wernsaar f67fa62851 added dgemv_n_bulldozer.S 12 years ago
  Zhang Xianyi cd1d473ba0 Merge pull request #230 from wernsaar/develop 12 years ago
  wernsaar 0ded1fcc1c performance optimizations in sgemm_kernel_16x2_bulldozer.S 12 years ago
  wernsaar a789b588cd added cgemm_kernel_4x2_bulldozer.S 12 years ago
  wernsaar 8eaa04acbb added zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar d854b30ae6 Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3 12 years ago