Zhang Xianyi
36adfe8d64
Merge branch 'hotfix-v0.2.8' into develop
12 years ago
Zhang Xianyi
a07cc39571
Refs #266 . Fixed the compiling bug with Open64 5.0.
12 years ago
Zhang Xianyi
b5c2ac4fd6
Fixed #264 the memory leak bug in dtrtri_U.
12 years ago
Zhang Xianyi
749f45ffc8
Fixed the FMA3 detection bug.
12 years ago
Zhang Xianyi
534c5ec919
Fixed #261 . Use strncmp instead of a comparing trick.
12 years ago
Zhang Xianyi
bd2da90e13
Fixed typo in getarch_2nd.c.
12 years ago
Zhang Xianyi
5b504d6c23
Refs #263 . Rollback bulldozer and piledriver kernels to barcelona kernels.
12 years ago
Zhang Xianyi
a2930664f4
Refs #262 . Added executable stack markings.
12 years ago
Zhang Xianyi
6e0db36373
Merge branch 'sfabbro-ldflags' into develop
12 years ago
Zhang Xianyi
1e1250b703
Fixed #260 . Fixed generating 32-bit shared library on previous commit.
12 years ago
Zhang Xianyi
23186d9f21
Fixed the FMA3 detection bug.
12 years ago
Zhang Xianyi
e6ebbfd314
Merge branch 'ldflags' of https://github.com/sfabbro/OpenBLAS into sfabbro-ldflags
12 years ago
Zhang Xianyi
4471c77905
Fixed #261 . Use strncmp instead of a comparing trick.
12 years ago
Sebastien Fabbro
9f0fb6e662
Respect user's LDFLAGS
12 years ago
Zhang Xianyi
f26b7a08aa
Merge branch 'develop'
12 years ago
Zhang Xianyi
63f14189e3
Refs #259 . Fixed missing LAPACK functions in shared library.
12 years ago
Zhang Xianyi
e39384432b
Merge branch 'develop'
12 years ago
Zhang Xianyi
c5437149c0
Merge pull request #257 from staticfloat/develop
Add in return value for `interface/trtri.c`
12 years ago
Elliot Saba
6f5b395009
Fix xianyi/OpenBLAS#256
12 years ago
Zhang Xianyi
d4f9571818
Refs #255 . Didn't use f77 compiler.
12 years ago
Zhang Xianyi
937d838619
Update CONTRIBUTORS.md
12 years ago
Zhang Xianyi
a8f9b6a665
Merge branch 'develop'
12 years ago
Zhang Xianyi
6209c8fc44
Fixed #253 . Update doc for v0.2.7 version.
12 years ago
Zhang Xianyi
238ceb4ac0
Merge branch 'loongson3b' into develop
12 years ago
Zhang Xianyi
77b572fa0b
Merge branch 'loongson3a' into develop
Conflicts:
Makefile.system
12 years ago
Zhang Xianyi
f69f89b846
Fixed #254 . Added the date of changes in contributors file.
12 years ago
Zhang Xianyi
c77032b0cc
create contributor file.
12 years ago
wangqian
1b3b9e841d
Fixed a computational error in zgemm_kernel_4x4_sandy.S file.
12 years ago
Zhang Xianyi
b67252c2e4
Ensure the correct stack alignment on Win32.
12 years ago
Zhang Xianyi
c69e73b868
Fixed typo in generating shared library on x86_64.
12 years ago
Zhang Xianyi
b51e2ba1ee
Modified Makefile to avoid redundant echo.
12 years ago
Zhang Xianyi
9c0a834f98
Modified Makefile.install
12 years ago
Zhang Xianyi
2a7503e563
Refs #225 . Fixed a bug in GEMM OpenMP threading.
12 years ago
Zhang Xianyi
fd0c388681
Refs #191 . A walk around for dtrtri_U single thread bug.
This function caused the failure of ERKALE serial test.
I replaced it with LAPACK source code.
12 years ago
Zhang Xianyi
61a9582987
Changed makefile for lapack.
12 years ago
Zhang Xianyi
b681064c6c
Updated travis.
12 years ago
Zhang Xianyi
e80e285928
Update build matrix for Travis CI.
12 years ago
Zhang Xianyi
2ed0f6ab60
Fixed the typo.
12 years ago
Zhang Xianyi
5448643557
Fixed generating dll bug in last commit.
12 years ago
Zhang Xianyi
824c3c4df3
Fixed #251 . Merge branch 'grisuthedragon-develop' into develop
12 years ago
grisuthedragon
c19a488af2
create openblas_get_parallel to retrieve information which
parallelization model is used by OpenBLAS.
12 years ago
Zhang Xianyi
32d2ca3035
Refs #214 , #221 , #246 . Fixed the getrf overflow bug on Windows.
I used a smaller threshold since the stack size is 1MB on windows.
12 years ago
Zhang Xianyi
6df39ad9e7
Refs #248 . Support LAPACK and LAPACKE with lsbcc.
For LAPACKE, use LAPACK_COMPLEX_STRUCTURE.
The reson is lsbcc didn't define complex I in complex.h.
12 years ago
Zhang Xianyi
3a96e4cbcb
Merge pull request #249 from wernsaar/develop
replaced defined(DOUBLE) by !defined(XDOUBLE)
12 years ago
wernsaar
6f008abcef
replaced defined(DOUBLE) by !defined(XDOUBLE)
12 years ago
Zhang Xianyi
3eb5af1955
Refs #247 . Included lapack source codes. Avoid downloading tar.gz from netlib.org
Based on 3.4.2 version, apply patch.for_lapack-3.4.2.
12 years ago
Zhang Xianyi
fbb75e58b1
Fixed the typo in getarch.c
12 years ago
Zhang Xianyi
f54f5bac9e
Refs #248 . Fixed the LSB compatiable issue for BLAS only.
For example, make CC=lsbcc NO_LAPACK=1.
12 years ago
Zhang Xianyi
5d3312142a
Refs #221 #246 . Fixed the overflowing stack bug in mutlithreading BLAS3.
When NUM_THREADS(MAX_CPU_NUNBERS) is very large ,e.g. 256.
typedef struct {
volatile BLASLONG working[MAX_CPU_NUMBER][CACHE_LINE_SIZE * DIVIDE_RATE];
} job_t;
job_t job[MAX_CPU_NUMBER];
The job array is equal 8MB.
Thus, We use malloc instead of stack allocation.
12 years ago
Zhang Xianyi
886cbaf4e4
Support AMD Piledriver by bulldozer kernels.
12 years ago