[CUDAProvider] Graph Optimization output an invalid model #23118
Labels
core runtime
issues related to core runtime
model:transformer
issues related to a transformer model: BERT, GPT2, Hugging Face, Longformer, T5, etc.
Describe the issue
Bug Report
The initial onnx model was feed into
optimizer.optimize_mode
for optimization on CUDA and output the optimizaed model.However, when further loading the optimized model, It failed and threw "This is an invalid model. Graph output (v4_0) does not exist in the graph."
Expected Behavior:
The optimized model should be an valid model.
The TraceBack:
The original onnx model (part):
The optimized onnx model (part):
To reproduce
Step 1: Download the model via this link
Step 2: run the following script:
Urgency
No response
Platform
Linux
OS Version
Ubuntu 20.04
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
5c1b7cc
ONNX Runtime API
Python
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: