Skip to content

Conversation

@binneswa
Copy link

@binneswa binneswa commented Nov 5, 2025

Added SME1 FP32 fp32_mopa_2VLx2VL GEMM kernel support

@binneswa
Copy link
Author

Any update on this Pull Request ?

@morgolock
Copy link
Contributor

morgolock commented Nov 11, 2025

Hi @binneswa

Thanks for your contribution.

Could you please tell us more about the use case for this kernel? which model will use it, details of the workload (types, shapes,ect)

I'll discuss your PR with the team.

@binneswa
Copy link
Author

binneswa commented Nov 12, 2025

Hi @morgolock , Thanks
This kernel is being used in GeekBenchAi workloads, Device: sme1 devices and Os is Android
models like Mobilenet uses it,

@binneswa
Copy link
Author

Any update on this Pull Request ?

@morgolock
Copy link
Contributor

Hi @binneswa

We're looking into this. The problem is that these kernels are generated with an internal tool, we are considering uploading a new PR with the SME1 kernels for you to test and confirm that it solves the problem.

Hope this helps

@morgolock
Copy link
Contributor

Hi @binneswa

We have just merged PR 1216 implementing the sme1 kernels.

I'm closing your PR in favor of PR 1216.

Thanks for your contribution. If you experience any problems, please raise an issue https://github.com/ARM-software/ComputeLibrary/issues

Hope this helps

@morgolock morgolock closed this Nov 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants