Zhang Xianyi
c2fdeb6c22
Merge pull request #429 from idunham/numprocs
Fix link error on Linux/musl.
11 years ago
Isaac Dunham
f7eb81a846
Fix link error on Linux/musl.
get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)
11 years ago
Zhang Xianyi
edc329883c
Merge pull request #427 from wernsaar/develop
added experimental support for big numa machines
11 years ago
wernsaar
793175be3a
added experimental support for big numa machines
11 years ago
Zhang Xianyi
83c4ba8d32
Merge pull request #426 from wernsaar/develop
added benchmark program for lapack ?getri functions
11 years ago
wernsaar
271af406f3
bugfix for linux affinity code
11 years ago
wernsaar
f5f50b3563
added benchmarks for lapack potrf, potrs and potri functions
11 years ago
wernsaar
651dd22d7d
added benchmark program for lapack ?getri functions
11 years ago
Zhang Xianyi
f329f77bd0
Merge pull request #425 from wernsaar/develop
added benchmark for lapack ?geev routines
11 years ago
wernsaar
7c611a2f95
bugfix for zgeev
11 years ago
wernsaar
296564e369
added lapack geev benchmark
11 years ago
Zhang Xianyi
27af6e35d3
Merge pull request #424 from ihnorton/fix_arm_cpuid
cpuid_arm: fix detection when cpuinfo uses "Processor"
11 years ago
Isaiah Norton
a183ad1df4
cpuid_arm: fix detection when cpuinfo uses "Processor"
instead of "model name"
11 years ago
wernsaar
799a0eabbd
bugfix in cholesky.c
11 years ago
wernsaar
ca63503e61
extented plot-filter.sh for linpack and cholesky benchmarks
11 years ago
Zhang Xianyi
4f83217df6
Merge pull request #422 from wernsaar/develop
optimization of sandybridge cgemm-kernel
11 years ago
wernsaar
5087096711
optimization of sandybridge cgemm-kernel
11 years ago
Zhang Xianyi
21f7768b26
Merge pull request #421 from wernsaar/develop
optimized sgemm- and cgemm-kernel for haswell
11 years ago
wernsaar
46bc4fd50c
optimized cgemm kernel for haswell
11 years ago
wernsaar
1cc02b4337
optimized sgemm kernel for haswell
11 years ago
Zhang Xianyi
6e223db7fc
Merge pull request #420 from wernsaar/develop
Optimizations for HASWELL
11 years ago
wernsaar
1d33547222
optimized zgemm kernel for haswell
11 years ago
wernsaar
3ea4dadd30
optimizations for trsm
11 years ago
wernsaar
1b10ff129a
optimizations for trmm
11 years ago
wernsaar
125610d23b
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
11 years ago
wernsaar
e213a42cde
added a sample plot-filter scripts and a header file for gnuplot
11 years ago
wernsaar
e4663be46a
added symv benchmark
11 years ago
wernsaar
11637b6926
add benchmark for ger
11 years ago
Zhang Xianyi
80bf3e6a35
Merge pull request #419 from wernsaar/develop
added optimized sgemv kernels for Sandy Bridge, Haswell, Bullldozer, and Piledriver.
11 years ago
wernsaar
6acbafe45b
added sgemv_n microkernel for haswell
11 years ago
wernsaar
5392d11b04
optimized sgemv_n_microk_sandy.c
11 years ago
wernsaar
c0fe95fb72
added sgemv_n microkernel for sandybridge
11 years ago
wernsaar
d9d4077c93
added sgemv_t microkernel for haswell
11 years ago
wernsaar
02eb72ac42
bugfix in sgemv_t_microk_sandy.c
11 years ago
wernsaar
c06f9986d4
added sgemv_t microkernel for sandybridge
11 years ago
wernsaar
2cce125c79
added optimized sgemv_t for bulldozer and piledriver
11 years ago
wernsaar
b3938fe371
don't use this sgemv_n on Windows
11 years ago
Zhang Xianyi
e6668dd83b
Merge pull request #414 from staticfloat/sf/symlinkfix
Don't create an absolute symlink when installing on Darwin
11 years ago
wernsaar
c8a4a56177
performance optimizations for sgemv_n
11 years ago
wernsaar
3c5732615d
added blocked sgemv_n and microkernel for bulldozer and piledriver
12 years ago
Zhang Xianyi
f20c0f9819
Merge branch 'develop'
12 years ago
Zhang Xianyi
134fa320e6
Refs #415 . Fixed the x86/i386 compiling bug with DYNAMIC_ARCH=1.
12 years ago
Elliot Saba
a79df1ff49
Don't create an absolute symlink when installing on Darwin
12 years ago
wernsaar
7ceb25d7b3
changed string GFORTRAN to lowercase
12 years ago
Zhang Xianyi
21b5347fbe
Merge branch 'develop'
12 years ago
Zhang Xianyi
f2eb480738
OpenBLAS 0.2.10 version.
12 years ago
Zhang Xianyi
c94762bb56
Refs #401 . Added NO_AVX2 flag for old binutils (e.g. RHEL6)
12 years ago
wernsaar
51413925bd
adjust number of threads for small size in cgemv and zgemv
12 years ago
wernsaar
b985cea65d
adjust number of threads for sgemv and dgemv
12 years ago
wernsaar
d286daa2ba
adjusted number of threads for small size
12 years ago