Ali Saidi
19f3a4091c
Make rpcc() on arm64 get closer to what x86 returns
The Arm implementation of rpcc() uses the architected timer
which is defined by the SBSA to be between 10-400MHz. These numbers
are much smaller than the cycle counter frequency used by x86. Make
the numbers closer by shifting the cycle counter up by the number of
leading zeros in the cntfrq_el0 register which gets us closer to a
noraml cpu clock cycle range.
6 years ago
Martin Kroeker
430ee31e66
Merge pull request #2447 from martin-frbg/issue2446
Always select ARMV8 parameters for big servers when cpu is TSV110 or EMAG8180
6 years ago
Martin Kroeker
8164fd1328
Always assume server-class cpu count for TSV110 and EMAG8180
6 years ago
Martin Kroeker
531c6b96d6
Merge pull request #34 from xianyi/develop
rebase
6 years ago
Martin Kroeker
ddcbed6690
Merge pull request #2437 from martin-frbg/issue2434
[WIP] Add support for Ampere EMAG8180 ARMV8 cpu
6 years ago
Martin Kroeker
f8ec538c82
Add Ampere EMAG8180
6 years ago
Martin Kroeker
ca4f7dceff
Add parameters for EMAG8180 DYNAMIC_ARCH support with cmake
6 years ago
Martin Kroeker
1ddf9f1067
Add EMAG8180 to arm64 DYNAMIC_ARCH list for cmake
6 years ago
Martin Kroeker
4c5fac5a2b
Typo fix
6 years ago
Martin Kroeker
320e2648cd
Add EMAG8180 to DYNAMIC_CORE list for ARM64
6 years ago
Martin Kroeker
9b732696c6
Add DYNAMIC_ARCH support for ARMV8 EMAG8180
6 years ago
Martin Kroeker
c9dcb3d4a4
Merge pull request #2443 from aaawuanjun/develop
[OpenBlas]:benchmark/copy.c has time,x,y data loop problems
6 years ago
Martin Kroeker
3bb7f0138e
Merge pull request #2442 from martin-frbg/lapackpr390
Apply fix from Reference-LAPACK PR 390
6 years ago
wuanjun 00447568
c93ae92579
[OpenBlas]:benchmark/copy.c has time,x,y data loop problems
6 years ago
Martin Kroeker
87ac1ceb0b
Apply fix from Reference-LAPACK PR390, NaN not propagating
6 years ago
Martin Kroeker
9e40c080f2
Apply fix from Reference-LAPACK PR390, NaN not propagating
6 years ago
Martin Kroeker
1e8410f18c
Merge pull request #2441 from martin-frbg/ismin2
Add proper defaults for the IxMIN/IxMAX kernels on mips64 and power
6 years ago
Martin Kroeker
07454bf4d5
Add proper defaults for IxMIN/IxMAX kernels
the fallbacks from Makefile.L1 assume a combined source for absolute value and non-absolute (with ifdef USE_ABS) but here we have separate implementations
6 years ago
Martin Kroeker
4046985913
Add proper defaults for IxMIN/IxMAX kernels
the fallbacks from Makefile.L1 assume a combined source for absolute value and non-absolute (with ifdef USE_ABS) but here we have separate implementations
6 years ago
Martin Kroeker
75577f95a7
Merge pull request #33 from xianyi/develop
rebase
6 years ago
Martin Kroeker
33d92c7a37
Merge pull request #2435 from martin-frbg/issue2433
Fix handling of ppc endianness
6 years ago
Martin Kroeker
e57b11acca
Add preliminary support for EMAG8180
6 years ago
Martin Kroeker
71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor
6 years ago
Martin Kroeker
e8d82c01d4
Recognize Ampere EMAG8180
6 years ago
Martin Kroeker
0b39cf95b0
Fix endianness conditionals
6 years ago
Martin Kroeker
76b2cec6ce
Get endianness into Makefile variable
6 years ago
Martin Kroeker
276c1791ea
Merge pull request #32 from xianyi/develop
rebase
6 years ago
Martin Kroeker
c5bbfd8fee
Merge pull request #2432 from isuruf/install_name
Fix install name on osx again
6 years ago
Isuru Fernando
130c1741e5
Fix install name on osx again
6 years ago
Martin Kroeker
8f782f0673
Merge pull request #2426 from zbeekman/nightly-homebrew-check
Nightly homebrew check
6 years ago
Martin Kroeker
6a517dcb6a
Merge pull request #2427 from martin-frbg/powermin
Fix ISMIN and ISMAX kernel choices for POWER8
6 years ago
Martin Kroeker
9f39f0a2c3
Specify ismin/ismax assembly kernels for POWER8 directly
to fix utest failure in new ismin test - Makefile.L1 defaults look wrong
6 years ago
Izaak Beekman
1a88c4ab26
Fix bottle upload problem & typo
6 years ago
Izaak Beekman
0b44802164
Test push & PRs only when workflow file changes
Also, add comments to clarify what the test is testing
6 years ago
Izaak Beekman
2c242b4cef
Add Github Action to build development branch nightly with Homebrew
6 years ago
Martin Kroeker
0bfb7336d2
Merge pull request #2424 from isuruf/osx
Fix building on osx
6 years ago
Martin Kroeker
403cde104e
Merge pull request #30 from xianyi/develop
rebase
6 years ago
Martin Kroeker
634f2bddda
Merge pull request #2414 from marxin/fix-iamax_sse-implementation
Fix iamax sse implementation and add utests
6 years ago
Martin Liska
aeea14ee40
Come up with LOAD_AND_COMPARE_TO_MXX macro in iamax_sse.S.
6 years ago
Martin Liska
18bcc36a69
Fix implementation of iamax_sse.S as reported in #2116 .
The was a typo in iamax_sse.S where one of the comparison
was cmpeqps instead of cmpeqss. That misdetected index
for sequences where the minimum value was 0.
6 years ago
Martin Liska
0e7f43c898
Add missing USE_MIN in kernel/CMakeLists.txt.
6 years ago
Martin Kroeker
79e201fbba
Merge pull request #2423 from xianyi/issue2419
Restore -march flag for Android builds
6 years ago
Isuru Fernando
4326dcb460
Pass CFLAGS from env to Makefile.prebuild and remove iOS hack
6 years ago
Martin Kroeker
e32f3b1447
Restore -march flag for Android builds
fixes #2419 - renewed discussion in #2112 suggests removal of the option was primarily aimed at non-Android builds
6 years ago
Martin Kroeker
d483e9270a
Update KERNEL.POWER8
6 years ago
Martin Kroeker
01834aee33
Merge pull request #29 from xianyi/develop
rebase
6 years ago
Martin Kroeker
d92bd5be24
Update KERNEL.POWER8
6 years ago
Martin Kroeker
46e4b12946
Update KERNEL.POWER8
6 years ago
Martin Kroeker
5e94aa4877
Merge pull request #2417 from marxin/make-ctest-verbose-for-drone
Make ctest verbose for drone
6 years ago
Martin Kroeker
93f3e27574
Merge pull request #2415 from marxin/add-cmake-to-gitignore
Add CMake related files to .gitignore.
6 years ago