| Author | SHA1 | Message | Date |
|---|---|---|---|
|
|
7d46e31de1 |
POWER10: Optimize dgemv_n
Handling as 4x8 with vector pairs gives better performance than existing code in POWER10. |
5 years ago |
|
|
f77b6a83f4 |
dgemv optimization for POWER10
Making use of new vector pair POWER10 instructions in dgemv_n and dgemv_t. Also adding a new block 4x128 to make use of Matrix-Multiply Assist (MMA) feature introduced in POWER ISA v3.1. Tested on simulator and there are no new test failures. |
5 years ago |