Replies: 3 comments 2 replies
-
That is probably the crossover point where OpenBLAS "decides" the problem is large enough to switch to multithreading (and the decision is made too early in case 1). Need to look into this... |
Beta Was this translation helpful? Give feedback.
-
This is the complete test code that can be run. |
Beta Was this translation helpful? Give feedback.
-
@martin-frbg Maybe it's the matrix transpose that changes the return of GEMM_SMALL_MATRIX_PERMIT.
Am I doing the right thing? My knowledge of matrices is weak. I try to explain it logically from the code. |
Beta Was this translation helpful? Give feedback.
-
I found a strange thing about the dgemm_ function.
This is pseudocode, the full code is given later.
strange results:
Can anyone explain this phenomenon? Just by adding a row or a column, the time increases several times.
env:
Beta Was this translation helpful? Give feedback.
All reactions