wernsaar
|
409b52255c
|
changed default optimization flag from O3 to O2 for ARM
|
11 years ago |
wernsaar
|
a35a1a9ae7
|
changed makefiles for lapack development
|
11 years ago |
Zhang Xianyi
|
75acf96d94
|
Refs #329 #287. Only disable -fopenmp for LAPACK Fortran codes on Windows.
|
12 years ago |
wernsaar
|
2594728eb7
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
wernsaar
|
65ebab0688
|
modified Makefile.system
|
12 years ago |
wernsaar
|
0b6e13b689
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
wernsaar
|
5c648a8984
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
Zhang Xianyi
|
5048a80032
|
Refs #283. Fixed the incorrect usage of long data type for Windows 64.
|
12 years ago |
Zhang Xianyi
|
dfd1064d7b
|
refs #287. Don't enable OpenMP for netlib LAPACK sequential Fortran codes.
|
12 years ago |
Zhang Xianyi
|
c937090121
|
Added gfortran dependency for LSB/lsbcc.
|
12 years ago |
Zhang Xianyi
|
c92ae012a6
|
Refs #279. Provide ONLY_CBLAS flag. If you only need CBLAS without
a fortran compiler, please try make ONLY_CBLAS=1.
This mode only compiler CBLAS without BLAS fortran interface and LAPACK.
|
12 years ago |
Zhang Xianyi
|
2638370844
|
Init code base for Intel Haswell.
|
12 years ago |
Zhang Xianyi
|
673e453b3f
|
Enable bulldozer kernels.
|
12 years ago |
Zhang Xianyi
|
a07cc39571
|
Refs #266. Fixed the compiling bug with Open64 5.0.
|
12 years ago |
Zhang Xianyi
|
5b504d6c23
|
Refs #263. Rollback bulldozer and piledriver kernels to barcelona kernels.
|
12 years ago |
Zhang Xianyi
|
77b572fa0b
|
Merge branch 'loongson3a' into develop
Conflicts:
Makefile.system
|
12 years ago |
Zhang Xianyi
|
b67252c2e4
|
Ensure the correct stack alignment on Win32.
|
12 years ago |
Zhang Xianyi
|
e80e285928
|
Update build matrix for Travis CI.
|
12 years ago |
Zhang Xianyi
|
6df39ad9e7
|
Refs #248. Support LAPACK and LAPACKE with lsbcc.
For LAPACKE, use LAPACK_COMPLEX_STRUCTURE.
The reson is lsbcc didn't define complex I in complex.h.
|
12 years ago |
Zhang Xianyi
|
3eb5af1955
|
Refs #247. Included lapack source codes. Avoid downloading tar.gz from netlib.org
Based on 3.4.2 version, apply patch.for_lapack-3.4.2.
|
12 years ago |
Zhang Xianyi
|
f54f5bac9e
|
Refs #248. Fixed the LSB compatiable issue for BLAS only.
For example, make CC=lsbcc NO_LAPACK=1.
|
12 years ago |
Zhang Xianyi
|
886cbaf4e4
|
Support AMD Piledriver by bulldozer kernels.
|
12 years ago |
Zhang Xianyi
|
cc522aa21d
|
Use quiet make for Travis CI.
|
12 years ago |
Zhang Xianyi
|
cd1d473ba0
|
Merge pull request #230 from wernsaar/develop
Refs #230. New dgemm and sgemm Kernel for BULLDOZER
|
12 years ago |
Zhang Xianyi
|
56f160134d
|
Refs #231. Change the default C compiler to clang on Mac OSX.
|
12 years ago |
wernsaar
|
d854b30ae6
|
Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3
|
12 years ago |
Zhang Xianyi
|
960b0c88a7
|
Refs #227. Detected LLVM/Clang compiler.
|
12 years ago |
Zhang Xianyi
|
f2fb8c7035
|
Change LIBSUFFIX from .lib to .a on windows.
|
12 years ago |
Zhang Xianyi
|
357078b93e
|
Refs #216. Revert the default value of GEMM_MULTITHREAD_THRESHOLD to 4.
|
12 years ago |
Zhang Xianyi
|
48bdc1ad3b
|
Added NO_PARALLEL_MAKE flag to disable parallel make.
|
12 years ago |
Zhang Xianyi
|
990efcab6e
|
Merge branch 'loongson3b' into loongson3a
|
12 years ago |
Zhang Xianyi
|
75a5dc3975
|
Added the configure for the host loongcc compiling on Loongson3.
|
12 years ago |
Xianyi Zhang
|
6958c1a1aa
|
Fixed the SEGFAULT bug with Loongcc and Loongson3.
|
12 years ago |
Xianyi Zhang
|
1a57717b1a
|
Added the configuration of Loongcc compiler for Loongson 3 CPU.
|
13 years ago |
Zhang Xianyi
|
5c8bf6ae0e
|
Merge branch 'bulldozer' into develop
|
13 years ago |
Zaheer Chothia
|
4db6660de4
|
Refs #185. Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey!
The 'const' modifications were done automatically using this scripts:
https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas
|
13 years ago |
Zhang Xianyi
|
b7c0fa6bd2
|
Init AMD Bulldozer codebase.
|
13 years ago |
Alexander Nasonov
|
e85549ee11
|
Fix NetBSD build.
|
13 years ago |
Zhang Xianyi
|
08c177ca36
|
Refs #145. Update LAPACK to 3.4.2 version.
|
13 years ago |
Zhang Xianyi
|
2573311308
|
refs #140. Fixed zdot incompatibility ABI issue with GCC 4.7 on Win 32.
GCC 4.7 uses MSVC ABI on Win 32. This means the caller pops the hidden pointer for returning
aggregate structures larger than 8 bytes.
|
13 years ago |
Zhang Xianyi
|
758e34efbb
|
Fixed the detection bug on Loongson 3A server.
|
13 years ago |
Zhang Xianyi
|
f76a384841
|
Refs #139. Added NO_AVX flag to use old Nehalem kernels on Sandy Bridge.
For example, make NO_AVX=1 or make DYNAMIC_ARCH=1 NO_AVX=1
|
13 years ago |
Jameson Nash
|
d0e731e8b8
|
provide support for passing CFLAGS, FFLAGS, PFLAGS, FPFLAGS to make on the command line
|
13 years ago |
Zhang Xianyi
|
068861a927
|
Refs #133. Users can set COMMON_OPT flag to control CFLAGS and FFLAGS.
|
13 years ago |
Zaheer Chothia
|
e8306f623a
|
Refs #127. Generate DLL without a version suffix on Windows.
|
13 years ago |
Xianyi Zhang
|
25f1a573fd
|
Fixed the build bug when DYNAMIC_ARCH=0.
|
13 years ago |
Xianyi Zhang
|
34fd3b85a8
|
Refs #113. Fixed BOBCATE typo in dynamic arch building.
|
13 years ago |
Zhang Xianyi
|
d6cab3f37e
|
Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX.
|
13 years ago |
Xianyi Zhang
|
a53c6e2440
|
Merge branch 'develop' into sandybridge
|
13 years ago |
Zaheer Chothia
|
14c3511e92
|
Respect C compiler set on the command line or inherited from the environment
|
13 years ago |