Rohit Goswami
|
efe87a3f5b
|
BUG,ENH: Fix .S handling
TMP: Fixup
|
1 year ago |
Rohit Goswami
|
5cadc67801
|
MAINT: Add more symbols for the test
|
1 year ago |
Rohit Goswami
|
86aa6b3a87
|
MAINT: Quick and dirty working set of symbols
Well working as in has enough symbols for the import, currently is
failing NumPy tests, including dot matrix multiplications...
|
1 year ago |
Rohit Goswami
|
891052a3ef
|
MAINT: Add the _beta variant of gemm
|
1 year ago |
Rohit Goswami
|
517eca96c6
|
MAINT: Add the gemm_small_kernel variants
w/o and with b0
|
1 year ago |
Rohit Goswami
|
cbafa8114f
|
ENH: Add more L3 symbols
|
1 year ago |
Rohit Goswami
|
571d2f3be3
|
ENH: Add TRMM_KERNEL bindings
|
1 year ago |
Rohit Goswami
|
ca8e18eda2
|
MAINT: Start adding L3
|
1 year ago |
Rohit Goswami
|
2fe1f31161
|
MAINT: Start working on kernels and driver L2
|
1 year ago |
Rohit Goswami
|
321ec276e0
|
BLD: Start working on L3
|
1 year ago |
Rohit Goswami
|
bb80fc754b
|
BLD: Finalize the generic and L1, L2
|
1 year ago |
Rohit Goswami
|
1c3caac427
|
BLD: Fixup more L1 from Kernel generic
|
1 year ago |
Rohit Goswami
|
7587dc9975
|
BLD: Re-work the L2 gemv
|
1 year ago |
Rohit Goswami
|
69edd1d5db
|
BLD,BUG: Fix an extent issue
|
1 year ago |
Rohit Goswami
|
b68012462d
|
BLD: Add more kernels
|
1 year ago |
Rohit Goswami
|
5599d73f4a
|
BLD: Generate L1 symbol flags correctly
|
1 year ago |
Rohit Goswami
|
76be8f851d
|
BLD: Add the ? variant for kernel
|
1 year ago |
Rohit Goswami
|
01717ce320
|
ENH: Use the kernel style
Necessary to extend this to L2/L3
|
1 year ago |
Rohit Goswami
|
cced76830e
|
MAINT: Cleanup kernel meson
|
1 year ago |
Rohit Goswami
|
91b355e953
|
MAINT: Fix filepaths for q variants [L1]
|
1 year ago |
Rohit Goswami
|
dcf05e00d4
|
MAINT: Cleanup a bit
|
1 year ago |
Rohit Goswami
|
85db158f02
|
MAINT: Minor refactors to have common precisions
|
1 year ago |
Rohit Goswami
|
28bfd1b3e5
|
MAINT: Simplify and generalize
|
1 year ago |
Rohit Goswami
|
5a7a5a4e55
|
MAINT: Move the precisions out to main meson.build
|
1 year ago |
Rohit Goswami
|
97861ab436
|
MAINT: Cleanup makefile to meson for parallel opt
Needs some work
|
1 year ago |
Rohit Goswami
|
ec9f6504d6
|
MAINT: Cleanup undefined symbols
|
1 year ago |
Rohit Goswami
|
33e66c5400
|
MAINT,BLD: Cleanup SIMD with meson arrays
|
1 year ago |
Rohit Goswami
|
61aab3ce11
|
MAINT: Move -m64 out to cpu_family()
|
1 year ago |
Rohit Goswami
|
9d9b4337ad
|
MAINT: Add simd flags
|
1 year ago |
Rohit Goswami
|
34cf7fd754
|
MAINT: Generalize and setup F_INTERFACE
|
1 year ago |
Rohit Goswami
|
10481ed4f4
|
MAINT: Rework make defines to meson arguments
For SMALL_MATRIX_OPT and MAX_STACK_ALLOC
|
1 year ago |
Rohit Goswami
|
5a1dba3346
|
TMP: Focus on getting a single test example up
Use:
nm -gC bbdir/libopenblas.a | grep drot
❯ gcc trial.c -o trail -I$(pwd)/tmpmake/include -L$(pwd)/bbdir -lopenblas -Wl,--verbose | grep openblas
❯ ./trail
Resulting vectors:
x: 3.000000 4.000000 5.000000 6.000000
y: 2.000000 2.000000 2.000000 2.000000
|
1 year ago |
Rohit Goswami
|
523a57f985
|
BLD: Add generic BLAS2 modes
|
1 year ago |
Rohit Goswami
|
e91b0216cd
|
ENH: Add more L2 flags
|
1 year ago |
Rohit Goswami
|
552f81045d
|
BLD: Add swap and refactor a bit
|
1 year ago |
Rohit Goswami
|
c76e7c6b95
|
TMP: Be more DRY
|
1 year ago |
Rohit Goswami
|
e9a3897174
|
ENH: Start abstracting rules for kernels
|
1 year ago |
Martin Kroeker
|
a875304eb0
|
fix inverted conditional for NAN handling
|
1 year ago |
Martin Kroeker
|
24acdd6bbb
|
correct offset
|
1 year ago |
Martin Kroeker
|
fb7c53c5e5
|
Merge pull request #4807 from martin-frbg/scalfixes
[WIP]Make NAN handling in the SCAL kernels depend on the dummy2 parameter
|
1 year ago |
Martin Kroeker
|
15c53dd2e0
|
Merge pull request #4794 from XiWeiGu/Fixed_Numpy_CI_Test
Try to fixed numpy ci test failures
|
1 year ago |
Martin Kroeker
|
a4e56e0452
|
Merge pull request #4806 from Mousius/small-gemm
Small GEMM for AArch64 with SVE
|
1 year ago |
yamazaki-mitsufumi
|
88caf02f62
|
Fix ambiguous error on Mac OS
|
1 year ago |
Martin Kroeker
|
b613754143
|
Update scal..c
|
1 year ago |
Martin Kroeker
|
f5d04318e3
|
Merge branch 'OpenMathLib:develop' into scalfixes
|
1 year ago |
Martin Kroeker
|
73f8866ffb
|
make NAN handling depend on DUMMY2 parameter
|
1 year ago |
Martin Kroeker
|
dfbc2348a8
|
fix NAN handling
|
1 year ago |
Martin Kroeker
|
c064319ecb
|
fix alpha=NAN case
|
1 year ago |
Martin Kroeker
|
c2ffd90e8c
|
make NAN handling depend on dummy2 parameter
|
1 year ago |
Chris Sidebottom
|
ea4ab3b310
|
Better header guard around bridge
|
1 year ago |