2032 Commits (925d4e1dc69a94de5733c03b022b7c2c7521935e)
 

Author SHA1 Message Date
  Zhang Xianyi acb11905d5 Fixed #199. Saved USE_THREAD switch for make install. 12 years ago
  Zhang Xianyi 109500178c Refs #220. Support Power7 by old Power6 kernels. 12 years ago
  Zhang Xianyi e50a664865 Refs #215. Fixed the compatible between <complex.h> and <complex> in C++. 12 years ago
  Zhang Xianyi 357078b93e Refs #216. Revert the default value of GEMM_MULTITHREAD_THRESHOLD to 4. 12 years ago
  wernsaar 731220f870 changed DGEMM_DEFAULT_P and DGEMM_DEFAULT_Q to 248 for BULLDOZER 64bit 12 years ago
  wernsaar 69aa6c8fb1 bad performance with some data 12 years ago
  wernsaar 60b263f3d2 removed trsm_kernel_RT_4x4_bulldozer.S. wrong results 12 years ago
  wernsaar 7ac306e0da added trsm_kernel_RT_4x4_bulldozer.S 12 years ago
  wernsaar 4cb454cdf2 added trsm_kernel_LT_4x4_bulldozer.S 12 years ago
  wernsaar 19ad2fb128 prefetch improved. Defined 2 different kernels for inner loop 12 years ago
  Zhang Xianyi 5d96e4f224 Refs #210. Disable checking /lib/libpthread.so*. 12 years ago
  wernsaar 6821677489 minor improvements and code cleanup 12 years ago
  Xianyi Zhang dbbda55e67 Updated the mailing list for OpenBLAS. 12 years ago
  Xianyi Zhang 6c34a7f43c Updated the mailing list for OpenBLAS. 12 years ago
  Zhang Xianyi 3326f3152c Merge pull request #213 from wernsaar/develop 12 years ago
  wernsaar 7641f6e253 Merged some improvements into dgemm_kernel_4x4_bulldozer.S. 12 years ago
  Zhang Xianyi 48bdc1ad3b Added NO_PARALLEL_MAKE flag to disable parallel make. 12 years ago
  Zhang Xianyi 3ad29452d1 Merge pull request #211 from wernsaar/develop 12 years ago
  wernsaar 6e3f6f25a5 New version of dgemm_kernel_4x4_bulldozer.S 12 years ago
  Zhang Xianyi 990efcab6e Merge branch 'loongson3b' into loongson3a 12 years ago
  Zhang Xianyi 75a5dc3975 Added the configure for the host loongcc compiling on Loongson3. 12 years ago
  Xianyi Zhang 986d542acb Merge branch 'loongson3a' into loongson3b 12 years ago
  Xianyi Zhang 6958c1a1aa Fixed the SEGFAULT bug with Loongcc and Loongson3. 12 years ago
  Zhang Xianyi a068d54981 Refs #209. Export the missing cblas_cdotc_sub functions. 13 years ago
  Xianyi Zhang d692ee07f7 Merge branch 'loongson3a' into loongson3b 13 years ago
  Xianyi Zhang 1a57717b1a Added the configuration of Loongcc compiler for Loongson 3 CPU. 13 years ago
  Xianyi Zhang 6b01d58712 Disable the optimization of muli-threading gemm on the Loongson3A. 13 years ago
  Xianyi Zhang 35b943f17f Merge branch 'develop' into loongson3a 13 years ago
  Zhang Xianyi e029242870 Merge pull request #206 from wlbksy/patch-1 13 years ago
  wlbksy 7a9b94b519 Fix #204 13 years ago
  Kenneth Hoste 66b919d99f adjusted Makefile to allow for provided required LAPACK source files rather than downloading them 13 years ago
  Zhang Xianyi f4846afbad Merge pull request #201 from Explorer09/develop 13 years ago
  Explorer09 53588bc786 getarch.c: Minor re-ordering of architecture list 13 years ago
  Explorer09 b47f13ee4c getarch.c: Minor re-ordering of architecture list 13 years ago
  Explorer09 309f90e563 TargetList.txt: minor re-ordering 13 years ago
  Explorer09 773c01f496 Typo correction in README.md 13 years ago
  Zhang Xianyi d831b2ff8b Override CFLAGS in LAPACK make.in. 13 years ago
  Zhang Xianyi 724ae159ce Fixed the Windows x86_64 ABI bug in s/daxpy kernels. 13 years ago
  Zhang Xianyi 2c9a203bd1 Merge pull request #198 from wernsaar/develop 13 years ago
  wernsaar f300ce3df5 new optimization of dgemm kernel for bulldozer: 10% performance increase 13 years ago
  Zhang Xianyi e2c7c75715 Merge pull request #197 from wernsaar/develop 13 years ago
  wernsaar 66e64131ed optimized again bulldozer dgemm kernel 13 years ago
  Zhang Xianyi 5900b1462e Merge pull request #195 from wernsaar/develop 13 years ago
  wernsaar 9405f26f4b new dgemm_kernel for bulldozer 13 years ago
  Zhang Xianyi 54e7b37630 Merge branch 'develop' 13 years ago
  Zhang Xianyi 529f1b5006 Refs#194. Export the missing LAPACK s/dlamc3 functions. 13 years ago
  Zhang Xianyi e5ac3007e0 Merge branch 'develop' 13 years ago
  Zhang Xianyi 0d0405b434 Updated the doc for 0.2.6 version. 13 years ago
  Zhang Xianyi f1ce74ffdd Improved the print when OS don't support AVX. 13 years ago
  Zhang Xianyi d744c9590a In OpenMP threading, preallocate the thread buffer instead of allocating the buffer every time. This patch improved the performance slightly. 13 years ago