Skip to content

Switch to libxsmm JIT backend #26

@sebwolf-de

Description

@sebwolf-de

Problem: The current approach using libxsmm_gemm_generator can only generate GEMMs with alpha = +/-1. If YATeTo encounters a GEMM with |alpha| != 1, it falls back to default code (nested for loops), which is not performant.
Solution: Use the new libxsmm interface libxsmm_?gemm(transa, transb, m, n, k, alpha, a, lda, b, ldb, beta, c, ldc);

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions