Add scalar support in ORT backend #213

Tabrizian · 2023-09-20T02:30:57Z

testing: triton-inference-server/server#6343

src/onnxruntime.cc

rmccorm4 · 2023-09-21T21:53:58Z

src/onnxruntime.cc

-          model_state_->MaxBatchSize(), true /* compare_exact */));
+      // if max_batch_size == 0 and is a scalar tensor all the
+      // dimensions specified must be equal to 1
+      if (model_state_->MaxBatchSize() > 0 || iit->second.dims_.size() > 0) {


Summarizing offline discussion, I'd rather see MaxBatchSize() removed from this condition of "is_scalar" if not necesary. Maybe something like this:

const bool is_scalar = (iit->second.dims_.size() == 0); if (is_scalar) { // Dimensional "volume" of Triton dims must be 1 for scalars. if std::any_of(dims.begin(), dims.end(), [](int dim) { dim != 1}) { // ERROR } scalar_outputs_[io_name] = dims; } else { RETURN_IF_ERROR(CompareDimsSupported(...)); }

rmccorm4 · 2023-09-22T18:52:07Z

src/onnxruntime.cc

@@ -885,7 +885,9 @@ ModelState::AutoCompleteIO(const char* key, const OnnxTensorInfoMap& io_infos)
      triton::common::TritonJson::Value reshape_dims(
          ModelConfig(), triton::common::TritonJson::ValueType::ARRAY);
      RETURN_IF_ERROR(reshape.Add("shape", std::move(reshape_dims)));
-      RETURN_IF_ERROR(io.Add("reshape", std::move(reshape)));
+      if (MaxBatchSize() > 0) {


Can you add a comment about why reshape causes issues with non-batching case for future reference?

Actually, is this breaking any functionality for non-batching models that specify a "reshape" in their model config? Such as our densenet example: https://github.com/triton-inference-server/server/blob/main/docs/examples/model_repository/densenet_onnx/config.pbtxt

This looks like it's restricted to (1) autocomplete and (2) dims are empty. But just double checking.

added a comment.

I don't think it would break it since they all do have some dimensions.

rmccorm4 · 2023-09-22T18:55:19Z

src/onnxruntime.cc

@@ -2283,6 +2352,22 @@ ModelInstanceState::ReadOutputTensors(
          batchn_shape, dtype, output_tensor, &output_buffer, string_buffers,
          offsets));

+      // If the number of dimensions is equal to zero, it means that it is a
+      // scalar and it would use the dimensions specified in the mdel


Suggested change

// scalar and it would use the dimensions specified in the mdel

// scalar and it would use the dimensions specified in the model

Tabrizian force-pushed the imant-scalar branch from 6b42724 to 0b6b36a Compare September 21, 2023 21:08

Tabrizian marked this pull request as ready for review September 21, 2023 21:08

Add scalar support in ORT backend

a855fa5

Tabrizian force-pushed the imant-scalar branch from 0b6b36a to a855fa5 Compare September 21, 2023 21:09

Tabrizian mentioned this pull request Sep 21, 2023

Add testing for scalar I/O in ORT backend triton-inference-server/server#6343

Merged

Tabrizian requested review from tanmayv25, rmccorm4 and GuanLuo September 21, 2023 21:13

rmccorm4 reviewed Sep 21, 2023

View reviewed changes

src/onnxruntime.cc Show resolved Hide resolved

rmccorm4 reviewed Sep 21, 2023

View reviewed changes

src/onnxruntime.cc Outdated Show resolved Hide resolved

rmccorm4 reviewed Sep 21, 2023

View reviewed changes

Review edits

a356758

Tabrizian requested a review from rmccorm4 September 22, 2023 16:57

rmccorm4 reviewed Sep 22, 2023

View reviewed changes

Review edit

f2c16e2

Tabrizian requested a review from rmccorm4 September 25, 2023 14:12

rmccorm4 approved these changes Sep 25, 2023

View reviewed changes

Tabrizian merged commit 8a1d1a3 into main Sep 26, 2023
3 checks passed

Tabrizian deleted the imant-scalar branch September 26, 2023 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scalar support in ORT backend #213

Add scalar support in ORT backend #213

Tabrizian commented Sep 20, 2023 •

edited

Loading

rmccorm4 Sep 21, 2023 •

edited

Loading

Tabrizian Sep 22, 2023

rmccorm4 Sep 22, 2023

rmccorm4 Sep 22, 2023 •

edited

Loading

Tabrizian Sep 25, 2023

rmccorm4 Sep 22, 2023

	// scalar and it would use the dimensions specified in the mdel
	// scalar and it would use the dimensions specified in the model

Add scalar support in ORT backend #213

Add scalar support in ORT backend #213

Conversation

Tabrizian commented Sep 20, 2023 • edited Loading

rmccorm4 Sep 21, 2023 • edited Loading

Choose a reason for hiding this comment

Tabrizian Sep 22, 2023

Choose a reason for hiding this comment

rmccorm4 Sep 22, 2023

Choose a reason for hiding this comment

rmccorm4 Sep 22, 2023 • edited Loading

Choose a reason for hiding this comment

Tabrizian Sep 25, 2023

Choose a reason for hiding this comment

rmccorm4 Sep 22, 2023

Choose a reason for hiding this comment

Tabrizian commented Sep 20, 2023 •

edited

Loading

rmccorm4 Sep 21, 2023 •

edited

Loading

rmccorm4 Sep 22, 2023 •

edited

Loading