[CPU EP] Add blocked quantization to DequantizeLinear op kernel (#20901) · microsoft/onnxruntime@3ecb012

Commit

[CPU EP] Add blocked quantization to DequantizeLinear op kernel (#20901)

### Description
Added blocked quantization to DequantizeLinear op kernel. All existing
[input types and output
types](https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#commicrosoftdequantizelinear)
are supported. All axes are supported.

The implementation in the PR is naive - single thread and scalar
instructions. Multi-threading and vector instructions are planned in the
future based on the needs.


### Motivation and Context
onnx introduced blocked quantization in opset 21 for
[DequantizeLinear](https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#commicrosoftdequantizelinear).
This PR adds the spec support in onnx runtime.

Loading branch information

fajin-corp authored Jun 4, 2024

1 parent 5faeaf6 commit 3ecb012

0 comments on commit `3ecb012`

Please sign in to comment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `3ecb012`

Commit

There are no files selected for viewing

0 comments on commit 3ecb012

0 comments on commit `3ecb012`