OpenBLAS

78 MB

Branch: develop

Author	SHA1	Message	Date
Chris Sidebottom	ea2faf0c9a	Add optimized BGEMM for NEOVERSEN2 target This re-uses the existing NEOVERSEN2 8x4 `sbgemm` kernel to implement `bgemm`.	11 months ago
Ye Tao	38ee7c9301	Add dispatch of SBGEMVNKERNEL for NEOVERSEN2 and NEOVERSEV2	1 year ago
Annop Wongwathanarat	edaf51dd99	Add sbgemv_t_bfdot kernel for ARM64 This improves performance for sbgemv_t by up to 100x on NEOVERSEV1. The geometric mean speedup is ~61x for M=N=[2,512].	1 year ago
Matthias Langer	0050a9660b	Correctly detect ARM Neoverse V2 CPUs.	2 years ago