Constant fold initializer for DQ node #23366

chilo-ms · 2025-01-14T23:25:13Z

Some NPUs require weights/initializers to be in FP32, FP16, INT8, UINT8 and INT4 if consumed by Q/DQ nodes.
In other words, ORT needs to dequantize "specific data type" initializers to FP32 for them.

This PR leverages ORT ConstantFolding optimizer to dequantize initializer for DQ node if the initializer has a specific data type.

github-actions

You can commit the suggested changes from lintrunner.

github-actions · 2025-01-14T23:30:55Z

include/onnxruntime/core/session/onnxruntime_session_options_config_keys.h

+// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.
+// This feature is required by some NPU's. 
+// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]


Suggested change

// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.

// This feature is required by some NPU's.

// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]

// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.

// This feature is required by some NPU's.

// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]

github-actions · 2025-01-14T23:30:55Z

onnxruntime/test/optimizer/graph_transform_test.cc

+                                        false /*skip_dequantize_linear*/,
+                                        false /*dequantize_initializer_for_dequantize_linear*/, 
+                                        empty_config_options),


Suggested change

false /*skip_dequantize_linear*/,

false /*dequantize_initializer_for_dequantize_linear*/,

empty_config_options),

false /*skip_dequantize_linear*/,

false /*dequantize_initializer_for_dequantize_linear*/,

empty_config_options),

chilo-ms · 2025-01-15T00:26:10Z

In the selection of DQ nodes to constant fold, how can user/EP specify which data type of initializers to be considered?

constant fold initializer for QD

bbb5862

github-actions bot reviewed Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constant fold initializer for DQ node #23366

Constant fold initializer for DQ node #23366

chilo-ms commented Jan 14, 2025

github-actions bot left a comment

github-actions bot Jan 14, 2025

github-actions bot Jan 14, 2025

chilo-ms commented Jan 15, 2025

Constant fold initializer for DQ node #23366

Are you sure you want to change the base?

Constant fold initializer for DQ node #23366

Conversation

chilo-ms commented Jan 14, 2025

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot Jan 14, 2025

Choose a reason for hiding this comment

github-actions bot Jan 14, 2025

Choose a reason for hiding this comment

chilo-ms commented Jan 15, 2025