Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[CPU EP] Add blocked quantization to DequantizeLinear op kernel (#20901)
### Description Added blocked quantization to DequantizeLinear op kernel. All existing [input types and output types](https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#commicrosoftdequantizelinear) are supported. All axes are supported. The implementation in the PR is naive - single thread and scalar instructions. Multi-threading and vector instructions are planned in the future based on the needs. ### Motivation and Context onnx introduced blocked quantization in opset 21 for [DequantizeLinear](https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#commicrosoftdequantizelinear). This PR adds the spec support in onnx runtime.
- Loading branch information