New pipeline save/load features #14

shijianjian · 2023-04-12T16:16:59Z

An ideal feature but not sure if it is better for app or pipeline. A proper save/load can make the limbus serve as ONNX.

1. Save and Load

Requesting a feature to export/save and import/load a built Pipeline.

Format Candidates:

joblib, recommended by scikit-learn. https://scikit-learn.org/stable/model_persistence.html

MyPipeline().save("myapp.someformat")

Then one can load and use with

Load("myapp.someformat").run(1)

2. Input/Output Information

In order to integrate the exported app/pipeline to other applications, it is better to include the input and output information like pipeline.graph.input and pipeline.graph.output.

3. Accepts I/O

Accepting input and output for the built app/pipeline.

4. A flag with if packaging models into the output or not.

We support a very big xxx.pipeline file.

Or a file structure like:

XXX.pipeline
- model_a.onnx
- model_b.tensorrt

A Proposal

Maybe having Input and Output Component.

For example:

self.c1 = Input("c1", shape=(B,))
self.t1 = Input("t1", shape=(B, 3))
self.t2 = Input("t2", shape=(B, 3))
self.stack = stack("stack")
self.out = Output("o", shape=None)

self.c1 >> self.stack.inputs.dim
self.t1>> self.stack.inputs.tensors.select(0)
self.t2>> self.stack.inputs.tensors.select(1)
self.stack.outputs.out >> self.out

pipeline = Pipeline()
pipeline.add_nodes([c1, t1, t2, stack, show])

pipeline.save("mypipeline.pipeline")

# Adding some ONNX-ish APIs
pipeline.graph.input  # returns [c1, t1, t2] and their shapes
pipeline.graph.output  # returns [o] and its shape

out = pipeline.exec([torch.tensor([0.]), np.array([[1, 2, 3.]]), ...])

The text was updated successfully, but these errors were encountered:

tp-nan · 2024-01-15T02:57:13Z

Format Candidates:

joblib, recommended by scikit-learn. https://scikit-learn.org/stable/model_persistence.html
MyPipeline().save("myapp.someformat")

That's an interesting feature. Is there any detail for how to implement this feature?

shijianjian · 2024-01-15T06:42:55Z

Format Candidates:

joblib, recommended by scikit-learn. https://scikit-learn.org/stable/model_persistence.html
MyPipeline().save("myapp.someformat")

That's an interesting feature. Is there any detail for how to implement this feature?

Not so complex if all the classes are serializable. It is more or less a pickle file only. Well, since maybe some model files are involved, we may also append those ONNX files into the pickle object.

edgarriba · 2024-01-15T08:55:51Z

i think @lferraz was somehow serializing already pipelines

shijianjian added enhancement New feature or request help wanted Extra attention is needed labels Apr 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New pipeline save/load features #14

New pipeline save/load features #14

shijianjian commented Apr 12, 2023 •

edited

Loading

tp-nan commented Jan 15, 2024

shijianjian commented Jan 15, 2024

edgarriba commented Jan 15, 2024

New pipeline save/load features #14

New pipeline save/load features #14

Comments

shijianjian commented Apr 12, 2023 • edited Loading

1. Save and Load

2. Input/Output Information

3. Accepts I/O

4. A flag with if packaging models into the output or not.

A Proposal

tp-nan commented Jan 15, 2024

shijianjian commented Jan 15, 2024

edgarriba commented Jan 15, 2024

shijianjian commented Apr 12, 2023 •

edited

Loading