Does Quantized ONNX Model Always Start with QuantizeLinear as the First Layer? #23064
Unanswered
nirdesh17
asked this question in
Training Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have a question about the structure of quantized ONNX models.
When we quantize an ONNX model to
int8
oruint8
using static quantization, is it guaranteed that the first layer of the quantized model will always be aQuantizeLinear
node? Or does this depend on the specific quantization method or the tool used for the quantization process?I’m trying to understand whether this is a standardized behavior for models quantized with static quantization or if it varies based on implementation details.
Any insights, explanations, or references to relevant documentation would be greatly appreciated.
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions