Martin Koehler
76c6e33e54
Enable EXCAVATOR kernels for A12-9800
9 years ago
Ashwin Sekhar T K
2757b49767
THUNDERX2T99: Add Optimized CGEMM Implementation
9 years ago
Ashwin Sekhar T K
f279ff4789
THUNDERX2T99: Add Optimized SGEMM Implementation
9 years ago
Zhang Xianyi
0863a0d4b4
Merge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch
Add new targets for ARM64
9 years ago
Werner Saar
c1c5a63d3c
prepared parameter.c for UNROLL values, that are not a power of two
9 years ago
Ashwin Sekhar T K
4b55fae337
ARM64: Add Cavium THUNDERX2T99 Target
9 years ago
Ashwin Sekhar T K
0b8e876d89
VULCAN: Add optimized DGEMM implementation
9 years ago
Ashwin Sekhar T K
4713e7c47f
ARM64: Add the VULCAN Target
9 years ago
jiahaipeng
1aa1e6cb54
modify the blas_l1_thread.c for support multi-threded for L1 fuction with return value
9 years ago
Werner Saar
b9bb009236
Merge pull request #1053 from wernsaar/develop
prepared driver/level3 functions for UNROLL values, that are not a po…
9 years ago
Werner Saar
a2672d5589
prepared driver/level3 functions for UNROLL values, that are not a power of two
9 years ago
Martin Kroeker
51aa157e64
Relocate declaration of alloc_lock outside ifdef block
9 years ago
Martin Kroeker
87c7d10b34
Fix thread data races detected by helgrind 3.12
Ref. #995 , may possibly help solve issues seen in 660,883
9 years ago
Martin Kroeker
0ef7841473
Update xerbla.c
9 years ago
Martin Kroeker
104ad066af
Use appropriate int32/int64 format for error number in message string
9 years ago
Alex Arslan
a16ace68f5
Include system headers on FreeBSD
9 years ago
Martin Kroeker
596ead0f8d
Add files via upload
9 years ago
Zhang Xianyi
66c9a9b33d
Merge pull request #981 from howard0su/develop
USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN
9 years ago
Martin Kroeker
8a8f3932eb
Update dynamic.c
Add Bay Trail "Pentium N3520" atom
9 years ago
Howard Su
ff1da01476
USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN
to determine the number of CPU. In ARM platform,
online CPU will increasing when there is more workload.
while configure cpu is the max number of CPU.
9 years ago
Zhang Xianyi
ef52a9266b
Fixed #979 . Patch for NetBSD.
9 years ago
Martin Kroeker
7de829f713
Update dynamic.c
Add Braswell (extended model 4, model 12) N3150 as Nehalem
9 years ago
John Biddiscombe
053044ae4d
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
9 years ago
Ashwin Sekhar T K
0fb380c966
Update NUMA CPU binding
When the number of process can all be
accommodated within the current node,
then use cores from the current node only.
9 years ago
Werner Saar
78b05f6476
bugfix for EXCAVATOR and DYNAMIC_ARCH
9 years ago
Werner Saar
2b967590a0
bugfix in dynamic.c
9 years ago
Theoractice
aa744dfa59
Update memory.c
10 years ago
theoractice
61cf8f74d9
Fix access violation on Windows while static linking
10 years ago
Zhang Xianyi
68eb4fa329
Add missing openblas_env makefile.
10 years ago
Zhang Xianyi
05196a8497
Refs #716 . Only call getenv at init function.
10 years ago
Jerome Robert
53ba1a77c8
ztrmv_L.c: no longer need a 4kB buffer
Fix #786
10 years ago
Zhang Xianyi
1edf30b790
Change Opteron(SSE3) to Opteron_SSE3 at dyanmaic core name.
10 years ago
Zhang Xianyi
6b85dbb6dc
Refs #696 . Turn off stack limit setting on Linux.
I cannot reproduce SEGFAULT of lapack-test with default stack size
on ARM Linux.
10 years ago
Zhang Xianyi
d06b92906a
Add gemm3m building for CMake.
10 years ago
Jerome Robert
78dcf5c3d5
Improve performances of ztrmv on small matrices
* Use stack allocation
* Disable multi-threading
* Ref #727
10 years ago
Martin Kroeker
935356c34f
Update dynamic.c and cpuid_x86.c for Intel Avoton.
Second part of "support Intel Avoton via Nehalem kernel"
10 years ago
Zhang Xianyi
f5df444ceb
Merge pull request #762 from jeromerobert/bug760
Let openblas_get_num_threads return the number of active threads
10 years ago
Zhang Xianyi
aaa8551c57
Merge pull request #749 from lotheac/illumos_fixes
illumos fixes
10 years ago
Jerome Robert
0d87c1ffb6
Let openblas_get_num_threads return the number of active threads
... not the number of allocated threads.
Close #760
10 years ago
Lauri Tirkkonen
e737e32fd1
RLIMIT_NPROC doesn't exist on illumos
10 years ago
Lauri Tirkkonen
97cd4b8aee
illumos fixes to memory.c
10 years ago
Werner Saar
b07d733a71
added updates for syrk and syr2k
10 years ago
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
10 years ago
Werner Saar
0d22551a6b
increase the stack size limit in the constructor
10 years ago
Ralph Campbell
fbc21266e6
Minor C code fixes in driver/
10 years ago
Zhang Xianyi
839395fc25
Detect AMD Trinity and Richland.
10 years ago
j-bo
6040858b22
Fix #673
Add lacking headers declarations when compiling for Android ARM7
10 years ago
Zhang Xianyi
70642fe4ed
Refs #668 . Raise the signal when pthread_create fails.
Thank James K. Lowden for the patch.
10 years ago
Zhang Xianyi
2feef49fa8
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
1ce054fcb3
Refs #669 . Fixed the build bug with gcc on Mac OS X.
10 years ago