-
Notifications
You must be signed in to change notification settings - Fork 175
Open
Description
Hi mllm team,
I'm conducting NPU-aware algorithm optimization research using Qualcomm Device Cloud (QDC) with Snapdragon 8 Gen 3. Due to the lack of Hexagon SDK access and compilation environment constraints on the cloud platform, I'm unable to compile the QNN backend from source.
Could you please provide:
- Pre-compiled QNN backend binaries for Android ARM64
- Or guidance on using QNN AOT compilation to avoid device-side operator package compilation
My setup:
- Device: QDC Snapdragon 8 Gen 3
- QNN SDK: v2.14 (device firmware V73)
- Goal: Run NPU-aware optimization experiments (token pruning, quantization, KV cache)
Models are already downloaded. I can work with any recent mllm version that's compatible.
Thank you!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels