wernsaar
|
5fa6158731
|
renoved flag no-integrated-as, because not working on macosx
|
11 years ago |
wernsaar
|
84badf8086
|
EXPERIMENTAL: added the flag -no-integrated-as for clang compiler in Makefile.system
|
11 years ago |
wernsaar
|
793175be3a
|
added experimental support for big numa machines
|
11 years ago |
Zhang Xianyi
|
134fa320e6
|
Refs #415. Fixed the x86/i386 compiling bug with DYNAMIC_ARCH=1.
|
11 years ago |
Zhang Xianyi
|
c94762bb56
|
Refs #401. Added NO_AVX2 flag for old binutils (e.g. RHEL6)
|
11 years ago |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
11 years ago |
wernsaar
|
88b6bf251a
|
force fallback for x86 32bit
|
11 years ago |
wernsaar
|
4a2ab7460b
|
Ref #391: force fallback for x86 32bit
|
11 years ago |
wernsaar
|
316df0e821
|
fixed bug for INTERFACE64
|
11 years ago |
wernsaar
|
438002204d
|
Ref #393: fix for INTERFACE64=0 and ARCH_X86 in divtable
|
11 years ago |
wernsaar
|
409b52255c
|
changed default optimization flag from O3 to O2 for ARM
|
11 years ago |
wernsaar
|
a35a1a9ae7
|
changed makefiles for lapack development
|
11 years ago |
Zhang Xianyi
|
75acf96d94
|
Refs #329 #287. Only disable -fopenmp for LAPACK Fortran codes on Windows.
|
12 years ago |
wernsaar
|
2594728eb7
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
wernsaar
|
65ebab0688
|
modified Makefile.system
|
12 years ago |
wernsaar
|
0b6e13b689
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
wernsaar
|
5c648a8984
|
Merge remote branch 'origin/develop' into haswell
|
12 years ago |
Zhang Xianyi
|
5048a80032
|
Refs #283. Fixed the incorrect usage of long data type for Windows 64.
|
12 years ago |
Zhang Xianyi
|
dfd1064d7b
|
refs #287. Don't enable OpenMP for netlib LAPACK sequential Fortran codes.
|
12 years ago |
Zhang Xianyi
|
c937090121
|
Added gfortran dependency for LSB/lsbcc.
|
12 years ago |
Zhang Xianyi
|
c92ae012a6
|
Refs #279. Provide ONLY_CBLAS flag. If you only need CBLAS without
a fortran compiler, please try make ONLY_CBLAS=1.
This mode only compiler CBLAS without BLAS fortran interface and LAPACK.
|
12 years ago |
Zhang Xianyi
|
2638370844
|
Init code base for Intel Haswell.
|
12 years ago |
Zhang Xianyi
|
673e453b3f
|
Enable bulldozer kernels.
|
12 years ago |
Zhang Xianyi
|
a07cc39571
|
Refs #266. Fixed the compiling bug with Open64 5.0.
|
12 years ago |
Zhang Xianyi
|
5b504d6c23
|
Refs #263. Rollback bulldozer and piledriver kernels to barcelona kernels.
|
12 years ago |
Zhang Xianyi
|
77b572fa0b
|
Merge branch 'loongson3a' into develop
Conflicts:
Makefile.system
|
12 years ago |
Zhang Xianyi
|
b67252c2e4
|
Ensure the correct stack alignment on Win32.
|
12 years ago |
Zhang Xianyi
|
e80e285928
|
Update build matrix for Travis CI.
|
12 years ago |
Zhang Xianyi
|
6df39ad9e7
|
Refs #248. Support LAPACK and LAPACKE with lsbcc.
For LAPACKE, use LAPACK_COMPLEX_STRUCTURE.
The reson is lsbcc didn't define complex I in complex.h.
|
12 years ago |
Zhang Xianyi
|
3eb5af1955
|
Refs #247. Included lapack source codes. Avoid downloading tar.gz from netlib.org
Based on 3.4.2 version, apply patch.for_lapack-3.4.2.
|
12 years ago |
Zhang Xianyi
|
f54f5bac9e
|
Refs #248. Fixed the LSB compatiable issue for BLAS only.
For example, make CC=lsbcc NO_LAPACK=1.
|
12 years ago |
Zhang Xianyi
|
886cbaf4e4
|
Support AMD Piledriver by bulldozer kernels.
|
12 years ago |
Zhang Xianyi
|
cc522aa21d
|
Use quiet make for Travis CI.
|
12 years ago |
Zhang Xianyi
|
cd1d473ba0
|
Merge pull request #230 from wernsaar/develop
Refs #230. New dgemm and sgemm Kernel for BULLDOZER
|
12 years ago |
Zhang Xianyi
|
56f160134d
|
Refs #231. Change the default C compiler to clang on Mac OSX.
|
12 years ago |
wernsaar
|
d854b30ae6
|
Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3
|
12 years ago |
Zhang Xianyi
|
960b0c88a7
|
Refs #227. Detected LLVM/Clang compiler.
|
12 years ago |
Zhang Xianyi
|
f2fb8c7035
|
Change LIBSUFFIX from .lib to .a on windows.
|
12 years ago |
Zhang Xianyi
|
357078b93e
|
Refs #216. Revert the default value of GEMM_MULTITHREAD_THRESHOLD to 4.
|
12 years ago |
Zhang Xianyi
|
48bdc1ad3b
|
Added NO_PARALLEL_MAKE flag to disable parallel make.
|
12 years ago |
Zhang Xianyi
|
990efcab6e
|
Merge branch 'loongson3b' into loongson3a
|
12 years ago |
Zhang Xianyi
|
75a5dc3975
|
Added the configure for the host loongcc compiling on Loongson3.
|
12 years ago |
Xianyi Zhang
|
6958c1a1aa
|
Fixed the SEGFAULT bug with Loongcc and Loongson3.
|
12 years ago |
Xianyi Zhang
|
1a57717b1a
|
Added the configuration of Loongcc compiler for Loongson 3 CPU.
|
13 years ago |
Zhang Xianyi
|
5c8bf6ae0e
|
Merge branch 'bulldozer' into develop
|
13 years ago |
Zaheer Chothia
|
4db6660de4
|
Refs #185. Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey!
The 'const' modifications were done automatically using this scripts:
https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas
|
13 years ago |
Zhang Xianyi
|
b7c0fa6bd2
|
Init AMD Bulldozer codebase.
|
13 years ago |
Alexander Nasonov
|
e85549ee11
|
Fix NetBSD build.
|
13 years ago |
Zhang Xianyi
|
08c177ca36
|
Refs #145. Update LAPACK to 3.4.2 version.
|
13 years ago |
Zhang Xianyi
|
2573311308
|
refs #140. Fixed zdot incompatibility ABI issue with GCC 4.7 on Win 32.
GCC 4.7 uses MSVC ABI on Win 32. This means the caller pops the hidden pointer for returning
aggregate structures larger than 8 bytes.
|
13 years ago |