[CUDAProvider] Graph Optimization output an invalid model #23118

Cookiee235 · 2024-12-16T04:58:33Z

Describe the issue

Bug Report

The initial onnx model was feed into optimizer.optimize_mode for optimization on CUDA and output the optimizaed model.
However, when further loading the optimized model, It failed and threw "This is an invalid model. Graph output (v4_0) does not exist in the graph."

Expected Behavior:

The optimized model should be an valid model.

The TraceBack:

Traceback (most recent call last):
  File "/share_container/optfuzz/ONNX/bugs/bug8.py", line 10, in <module>
    optimized_session = ort.InferenceSession(optimized_model_path)
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/software/onnxruntime/build/Linux/Release/onnxruntime/capi/onnxruntime_inference_collection.py", line 465, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/software/onnxruntime/build/Linux/Release/onnxruntime/capi/onnxruntime_inference_collection.py", line 526, in _create_inference_session
    sess = C.InferenceSession(session_options, self._model_path, True, self._read_config_from_model)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : Load model from ./opt.onnx failed:/software/onnxruntime/onnxruntime/core/graph/graph.cc:1467 void onnxruntime::Graph::InitializeStateFromModelFileGraphProto() This is an invalid model. Graph output (v4_0) does not exist in the graph.

The original onnx model (part):

The optimized onnx model (part):

To reproduce

Step 1: Download the model via this link
Step 2: run the following script:

import onnx
import onnxruntime as ort
from onnxruntime.transformers import optimizer

model_path = "duplicate_output.onnx"
optimized_model_path = f"./opt.onnx"
optimized_model = optimizer.optimize_model(model_path, opt_level=1, use_gpu=True)  # set opt_level=1 remove the duplicate output
optimized_model.save_model_to_file(optimized_model_path)
print(onnx.load(optimized_model_path).graph.output)  # need delete the "v4_0" var output simultaneously
optimized_session = ort.InferenceSession(optimized_model_path)

Urgency

No response

Platform

Linux

OS Version

Ubuntu 20.04

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

5c1b7cc

ONNX Runtime API

Python

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

xadupre · 2024-12-17T12:05:50Z

In your model, the operator Div does x/x. That should lead to a constant 1. You could replace this operator by ConstantOfShape(Shape(x)) in your case.

Cookiee235 · 2024-12-17T13:18:04Z

@xadupre Thank you for your suggestion. With your guidance, the optimization of this model now avoids the crash.

However, since the given model is technically invalid, ONNX Runtime should ideally produce an optimized model instead of crashing unexpectedly. Fixing this issue at the ONNXRuntime source code would not only prevent such crashes but also ensure the generation of an optimized model. This would be an exciting improvement!

@xadupre Do think should we fix this issue in the ONNXRuntime? Thanks a lot!

Cookiee235 changed the title ~~Graph Optimization output an invalid model~~ [CUDAProvider] Graph Optimization output an invalid model Dec 16, 2024

github-actions bot added the model:transformer issues related to a transformer model: BERT, GPT2, Hugging Face, Longformer, T5, etc. label Dec 16, 2024

yuslepukhin added the core runtime issues related to core runtime label Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDAProvider] Graph Optimization output an invalid model #23118

[CUDAProvider] Graph Optimization output an invalid model #23118

Cookiee235 commented Dec 16, 2024

xadupre commented Dec 17, 2024

Cookiee235 commented Dec 17, 2024

[CUDAProvider] Graph Optimization output an invalid model #23118

[CUDAProvider] Graph Optimization output an invalid model #23118

Comments

Cookiee235 commented Dec 16, 2024

Describe the issue

Bug Report

Expected Behavior:

The TraceBack:

The original onnx model (part):

The optimized onnx model (part):

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

xadupre commented Dec 17, 2024

Cookiee235 commented Dec 17, 2024