Skip to content
Discussion options

You must be logged in to vote

Hi @chenghuaWang ,

Based on QNN documentation, LPBQ quantization support both of Conv2D and Linear operation.
You can apply this patch to run unit test for Linear with LPBQ.

Reproduce Command:

python3 backends/qualcomm/tests/test_qnn_delegate.py TestQNNQuantizedOperator.test_qnn_backend_16a4w_per_block_linear -b build-android  -H ${HOST} -s ${DEVICE} -m SM8850 -r {EXECUTORCH_ROOT} -a unit_test

Patch:

diff --git a/backends/qualcomm/tests/test_qnn_delegate.py b/backends/qualcomm/tests/test_qnn_delegate.py
index c57dbbcc33..7267efbb36 100644
--- a/backends/qualcomm/tests/test_qnn_delegate.py
+++ b/backends/qualcomm/tests/test_qnn_delegate.py
@@ -2186,6 +2186,18 @@ class TestQNNQuantizedOper…

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by chenghuaWang
Comment options

You must be logged in to vote
1 reply
@shewu-quic
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
partner: qualcomm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Qualcomm module: qnn Issues related to Qualcomm's QNN delegate and code under backends/qualcomm/
3 participants