OpenBLAS

Commit Graph

Author	SHA1	Message	Date
Arda Aytekin	e3666931d8	Update .travis.yml Updated `.travis.yml` file to add emulated tests for `ARMV6` and `ARMV8` architectures with `gcc` and `clang`. Created prebuilt images with required dependencies. Squashed layers into one.	7 years ago
Martin Kroeker	667fed579d	Merge pull request #1856 from rengolin/armv8-a57 [Arm64) Revert A53 detection as A57	7 years ago
Martin Kroeker	96d2f2c9b2	Merge pull request #1831 from brada4/hemv disable threading in C/ZSWAP copying from S/DSWAP	7 years ago
Martin Kroeker	653e657a58	Merge pull request #1857 from brada4/fc-1847 Add gfortran -frecursive option from upstream and #1847	7 years ago
Martin Kroeker	5f8f0583d4	Merge branch 'develop' into fc-1847	7 years ago
Martin Kroeker	974a6a30f2	Merge pull request #1858 from brada4/buff-1847 Add minimum threshold for number of buffers	7 years ago
Andrew	9531d0e175	lets fit it in one 4k page	7 years ago
Andrew	40cce0e353	handle cmake too	7 years ago
Andrew	3fd41313fc	add low bound for number of buffers	7 years ago
Andrew	a931afe269	init	7 years ago
Andrew	7d3502b500	Add -frecursive gfortran option by default	7 years ago
Andrew	066f8065d1	init	7 years ago
Renato Golin	fb5b2177ca	[Arm64) Revert A53 detection as A57 This patch reverts the decision of treating A53 like A57, which was based on an analysis done on server class hardware and is not representative of all A53s out there. Fixes #1855.	7 years ago
Martin Kroeker	f1c02273cb	Merge pull request #1846 from fenrus75/threadsize gemm/dgemm: add a way for an arch kernel to specify preferred sizes	7 years ago
Martin Kroeker	661035477c	Merge pull request #1850 from martin-frbg/issue1811 Restore Android/ARMv7 build fix from #778	7 years ago
Martin Kroeker	aa7e47aa0a	Merge pull request #1849 from martin-frbg/aix_install2 Use installbsd on AIX	7 years ago
Martin Kroeker	9c177d270b	Restore Android/ARMv7 build fix from #778 for #1811	7 years ago
Martin Kroeker	b025523197	Use installbsd on AIX (and fix misplaced parenthesis from previous commit). See #1803	7 years ago
Martin Kroeker	5b50bd36f7	Merge pull request #1845 from martin-frbg/aix_install Accomodate AIX install, which has different syntax	7 years ago
Arjan van de Ven	5b708e5eb1	sgemm/dgemm: add a way for an arch kernel to specify prefered sizes The current gemm threading code can make very unfortunate choices, for example on my 10 core system a 1024x1024x1024 matrix multiply ends up chunking into blocks of 102... which is not a vector friendly size and performance ends up horrible. this patch adds a helper define where an architecture can specify a preference for size multiples. This is different from existing defines that are minimum sizes and such. The performance increase with this patch for the 1024x1024x1024 sgemm is 2.3x (!!)	7 years ago
Arjan van de Ven	dcc5d6291e	skylakex: Make the sgemm/dgemm beta code robust for a N=0 or M=0 case in the threading code there are cases where N or M can become 0, and the optimized beta code did not handle this well, leading to a crash during the audit for the crash a few edge conditions on the if statements were found and fixed as well	7 years ago
Martin Kroeker	7b5aea52bb	Accomodate AIX install, which has different syntax for #1803	7 years ago
Martin Kroeker	f5595d0262	Merge pull request #1843 from martin-frbg/aix_numprocs Add get_num_procs implementation for AIX	7 years ago
Martin Kroeker	326d394a0f	Add get_num_procs implementation for AIX (and copy HAIKU implementation to the non-TLS version of the code as well)	7 years ago
Martin Kroeker	6af8e35a24	Merge pull request #1837 from embray/set-num-thread-after-fork Ensure that blas_thread_init has been called in openblas_set_num_threads	7 years ago
Erik M. Bray	38cf5d9364	ensure that threading has been initialized in the first place before calling openblas_set_num_threads	7 years ago
Martin Kroeker	8a43baacb2	Merge pull request #1836 from martin-frbg/zen2core Fix detection of Ryzen2 (missing CORE_ZEN)	7 years ago
Martin Kroeker	64ca44873b	Fix detection of Ryzen2 (missing CORE_ZEN)	7 years ago
Martin Kroeker	76a66eaac8	Merge pull request #1829 from ashwinyes/develop_aarch64_dynamic_arch_support Add DYNAMIC_ARCH support for ARM64	7 years ago
Andrew	2992e3886a	disable threading in C/ZSWAP copying from S/DSWAP	7 years ago
Ashwin Sekhar T K	d5aeff636f	ARM64: Enable DYNAMIC_ARCH Enable DYNAMIC_ARCH feature on ARM64. This patch uses the cpuid feature in linux kernel to detect the core type at runtime (https://www.kernel.org/doc/Documentation/arm64/cpu-feature-registers.txt). If this feature is missing in kernel, then the user should use the OPENBLAS_CORETYPE env variable to select the desired core type.	7 years ago
Ashwin Sekhar T K	af2837c392	ARM64: Remove #define ARMV8 for THUNDERX	7 years ago
Ashwin Sekhar T K	e7b66cd36e	ARM64: Fix DYNAMIC_ARCH compilation for cores which dont use GEMM3M	7 years ago
Ashwin Sekhar T K	d50abc8903	ARM64: Move parameters from parameter.c to param.h Remove the runtime setting of P, Q, R parameters for targets ARMV8, THUNDERX2T99. Instead set them as constants in param.h at compile time.	7 years ago
Ashwin Sekhar T K	351a0c777c	ARM64: Remove XGENE1 references Remove XGENE1 target as the implementation for the same is incomplete. Moreover whoever wishes to use on XGENE1 can use the generic ARMV8 target as there are no XGENE1 specific optimizations in OpenBLAS.	7 years ago
Martin Kroeker	e3c262e5cf	Merge pull request #1825 from brada4/hemv Delay _hemv threading in attempt to address #1820	7 years ago
Andrew	a293bdcd5e	re-arrange new code for readability	7 years ago
Andrew	c7bbf9c987	Attempt to tame _hemv threading #1820	7 years ago
Andrew	898a8dcaba	init	7 years ago
Martin Kroeker	71c6deed60	Merge pull request #1821 from ashwinyes/develop_aarch64_armv8neonkernels Use ThunderX2 Neon Kernels for ARMV8 Target	7 years ago
Ashwin Sekhar T K	21f46a1cf2	ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 Currently the generic ARMV8 target uses C implementations for many routines. Replace these with the neon implementations written for THUNDERX2T99 target which are upto 6x faster for certain routines.	7 years ago
Ashwin Sekhar T K	caf339412f	ARM64: Remove dependency of THUNDERX2T99 Makefile on CORTEXA57 Makefile	7 years ago
Ashwin Sekhar T K	8001fdcd2a	ARM64: Remove dependency of THUNDERX Makefile on ARMV8 Makefile	7 years ago
Ashwin Sekhar T K	162e312832	ARM64: Remove dependency of CORTEXA57 Makefile on ARMV8 Makefile	7 years ago
Ashwin Sekhar T K	c3d93caa8d	ARM64: Remove dependency of XGENE1 Makefile on ARMV8 Makefile	7 years ago
Martin Kroeker	a71923514f	Merge pull request #1815 from fenrus75/sgemm_beta_fix enable the SGEMM/SKX C based kernel	7 years ago
Arjan van de Ven	55b244ca0d	enable the SGEMM/SKX C based kernel In QA the final bug was found so now the sklyakex sgemm C based kernel can be activated....	7 years ago
Martin Kroeker	2263d3906c	Merge pull request #1812 from martin-frbg/issue1806-2 Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake…	7 years ago
Martin Kroeker	81c9985c3a	Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake-avx512	7 years ago
Martin Kroeker	56ebc7b53e	Merge pull request #1808 from martin-frbg/issue1806 Add -march=skylake-avx512 to CFLAGS when the target is Skylake	7 years ago

1 2 3 4 5 ...

3262 Commits (e3666931d8b54f0bf918e45bc3da6ce51ea2a52a) All Branches Search

3262 Commits (e3666931d8b54f0bf918e45bc3da6ce51ea2a52a)

All Branches