You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to do qutization using openvino/tnn/mnn for this onnx model, my question is do I need to remove the
QuantizeLinear and DequantizeLinear in red box first and then do qutization?
or should I just do qutization, and the openvino/tnn/mnn will remove it automatically?
and I also check the tflite model(using generate_tflite.py) --> onnx model, it seems the quantizated tflite model/onnx model contains node QuantizeLinear and DequantizeLinear, is it normal?
The text was updated successfully, but these errors were encountered:
I train a x2 model, and after that I finetune it using QAT training using below command:
and then I convert it to ONNX model using below cammand: "python -m tf2onnx.convert --saved-model
then I open this onnx model using netron:
I want to do qutization using openvino/tnn/mnn for this onnx model, my question is do I need to remove the
QuantizeLinear and DequantizeLinear in red box first and then do qutization?
or should I just do qutization, and the openvino/tnn/mnn will remove it automatically?
and I also check the tflite model(using generate_tflite.py) --> onnx model, it seems the quantizated tflite model/onnx model contains node QuantizeLinear and DequantizeLinear, is it normal?
The text was updated successfully, but these errors were encountered: