Enable pointer-generator T5 models in BeamSearch #23134

amancini-N · 2024-12-17T11:35:49Z

Description

Introduces a new optional input (encoder_ibnput_ids) in the decoder graph of the T5 implementation for BeamSearch. This allows usage of pointer generator networks in decoder graph.

Motivation and Context

Fixes Support pointer-generator in BeamSearch op #23123

onnxruntime/test/testdata/dummy_t5_model_generator.py

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc

tianleiwu · 2024-12-17T18:40:42Z

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc


-  ORT_RETURN_IF(first_past_input_index_ != 2 && first_past_input_index_ != 3,
-                "kFirstPastInputIndex currently only supports 2 or 3");
+  ORT_RETURN_IF(first_past_input_index_ != 2 && first_past_input_index_ != 3 && first_past_input_index_ != 4,


From SetPastInputIndex implementation, this assertion of first_past_input_index_ seems always True so we can remove it.

sure, will do

tianleiwu · 2024-12-17T19:19:07Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

tianleiwu · 2024-12-17T19:19:08Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline

tianleiwu · 2024-12-17T19:19:10Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-12-17T19:19:36Z

Azure Pipelines successfully started running 6 pipeline(s).

azure-pipelines · 2024-12-17T19:19:45Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-12-17T19:19:47Z

Azure Pipelines successfully started running 10 pipeline(s).

amancini-N · 2024-12-18T15:35:11Z

@tianleiwu I don't think I got which is the problem on the iOS failure. All the involved tests seems passing there. Do you have some insights?

tianleiwu · 2024-12-18T18:08:59Z

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc

-                  subgraph_inputs[2]->Name());
+  const int enc_attn_mask_index = 1 + has_encoder_input_ids_;
+  const int enc_hidden_state_index = enc_attn_mask_index + 1;
+  if (has_encoder_input_ids_) {


remove this check since the definition has_encoder_input_ids = subgraph_inputs[1]->Name() == "encoder_input_ids" so this is not necessary.

tianleiwu · 2024-12-18T18:12:41Z

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc

@@ -49,11 +49,12 @@ namespace transformers {

 Status T5DecoderSubgraph::Validate(const std::vector<const NodeArg*>& subgraph_inputs,
                                   const std::vector<const NodeArg*>& subgraph_outputs) {
-  bool has_hidden_state = subgraph_inputs[2]->Name() == "encoder_hidden_states" ? true : false;
-  SetPastInputIndex(has_hidden_state);
+  bool has_encoder_input_ids = subgraph_inputs[1]->Name() == "encoder_input_ids";


Recommend to add a comment about example inputs:
input_ids, encoder_input_ids (optional), encoder_attention_mask, encoder_hidden_states (optional),
past_self_key_0, past_self_value_0, past_cross_key_0, past_cross_value_0,
...

tianleiwu · 2024-12-18T18:30:01Z

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc

@@ -238,7 +268,7 @@ Status T5DecoderSubgraph::CreateInitialFeeds(
  // When first_past_input_index_ == 3, the encoder_hidden_states and past states are copied from the second output
  // of encoder.
  // When first_past_input_index_ == 2, the past states are copied from the second output of encoder.
-  for (size_t j = static_cast<size_t>(4) - first_past_input_index_; j < encoder_fetches.size(); j++) {
+  for (size_t j = static_cast<size_t>(2) - has_hidden_state_; j < encoder_fetches.size(); j++) {


The decoder inputs input_ids, encoder_input_ids (optional), encoder_attention_mask, encoder_hidden_states (optional), past_self_key_0, past_self_value_0, past_cross_key_0, past_cross_value_0, ....

This loop is used to add feeds for encoder_hidden_states (optional), past_self_key_0, past_self_value_0, past_cross_key_0, past_cross_value_0, ... from encoder output.

The encoder output is like logits, encoder_hidden_states (optional), past_self_value_0, past_cross_key_0, past_cross_value_0, ... so j shall start from 1 (the second output). Here we assume that, if encoder hidden state is not used in decoder, we shall not output it in encoder for best performance.

I understand that we might also need change some code in encoder output validation to make sure all outputs are used by decoder. That means, if encoder_hidden_states is not used by decoder, it shall not exist in encoder output.

Another possible implementation is to use name to match then construct a mapping from encoder input/output index to decoder input index. That could be more flexible.

Suggest to update the comment before this loop.

Enable pointer-generator T5 models in BeamSearch

ca1c474

github-advanced-security bot found potential problems Dec 17, 2024

View reviewed changes

onnxruntime/test/testdata/dummy_t5_model_generator.py Fixed Show fixed Hide fixed

onnxruntime/test/testdata/dummy_t5_model_generator.py Fixed Show fixed Hide fixed

onnxruntime/test/testdata/dummy_t5_model_generator.py Fixed Show fixed Hide fixed

Linting changes

c350042

github-advanced-security bot found potential problems Dec 17, 2024

View reviewed changes

onnxruntime/contrib_ops/cpu/transformers/subgraph_t5_decoder.cc Dismissed Show dismissed Hide dismissed

tianleiwu reviewed Dec 17, 2024

View reviewed changes

tianleiwu reviewed Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable pointer-generator T5 models in BeamSearch #23134

Enable pointer-generator T5 models in BeamSearch #23134

amancini-N commented Dec 17, 2024

tianleiwu Dec 17, 2024

amancini-N Dec 18, 2024

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

amancini-N commented Dec 18, 2024

tianleiwu Dec 18, 2024 •

edited

Loading

tianleiwu Dec 18, 2024 •

edited

Loading

tianleiwu Dec 18, 2024 •

edited

Loading

Enable pointer-generator T5 models in BeamSearch #23134

Are you sure you want to change the base?

Enable pointer-generator T5 models in BeamSearch #23134

Conversation

amancini-N commented Dec 17, 2024

Description

Motivation and Context

tianleiwu Dec 17, 2024

Choose a reason for hiding this comment

amancini-N Dec 18, 2024

Choose a reason for hiding this comment

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

amancini-N commented Dec 18, 2024

tianleiwu Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

tianleiwu Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

tianleiwu Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

tianleiwu Dec 18, 2024 •

edited

Loading

tianleiwu Dec 18, 2024 •

edited

Loading

tianleiwu Dec 18, 2024 •

edited

Loading