[Build] Issue with "include_ops_by_config" and DynamicQuantizeMatMul in WASM CPU execution provider #22761
Labels
build
build issues; typically submitted using template
platform:web
issues related to ONNX Runtime web; typically submitted using template
quantization
issues related to quantization
Describe the issue
Here's my required_operators_and_types.config file:
When I build onnxruntime for wasm without the config file, ORT inference works:
When I build onnxruntime for wasm with the config file, ORT inference gives the following error:
In the cmake output, we see the file that contains this operator being replaced:
In this replacement file, there are some lines related to DynamicQuantizeMatMul:
Urgency
No response
Target platform
WebAssembly
Build script
python ./vendor/onnxruntime/tools/ci_build/build.py
--build_dir="./build/build-ort-wasm-simd"
--config=MinSizeRel
--build_wasm_static_lib
--parallel
--minimal_build
--disable_ml_ops
--disable_rtti
--use_preinstalled_eigen
--eigen_path=$(realpath "./vendor/eigen")
--skip_tests
--enable_wasm_simd
#--include_ops_by_config="./onnx-models/required_operators_and_types.config"
#--enable_reduced_operator_type_support \
Error / output
Ort::Exception: Could not find an implementation for DynamicQuantizeMatMul(1) node with name '/crosstransformer/layers.0/self_attn/MatMul_quant'
Visual Studio Version
No response
GCC / Compiler Version
No response
The text was updated successfully, but these errors were encountered: