tvm2onnx

Use this tvm2onnx integration to convert pre-optimized-and-compiled TVM models to .onnx format for execution in ONNX Runtime. How this integration works at a high level is that it represents TVM optimizations in the form of a custom operator that ONNX Runtime can execute.

Getting Started

This project is designed to run in a docker container and integrate with Visual Studio Code. For Visual Studio Code add the Remove Development (ms-vscode-remote.vscode-remote-extensionpack) extension. With this installed you can click the green status display in the lower left of VSCode and select "Open in Remove Container". This should build the docker container and launch an interactive session. Testing and developer tools lint and format are designed to work in this container.

Tutorial

There is a tutorial which demonstrates a simple end-to-end example of tuning, converting the tuned model to onnx using tvm2onnx, and then running inference on the TVM-in-ONNX model using onnxruntime.

Developing

Code development should be done in the docker container as it maintains all project dependencies. While it may be possible to develop without using the docker container that is not supported.

Running lint and test

From the root development directory in the docker container you can run make format to format the code. Proper formatting is required for merge requests. make lint to lint the code. Passing lint is required for merge requests. make test to run the test suite.

Scripts

Scripts are located in the tvm2onnx/scripts directory. onnx_package.py can package a TVM model to a .tvm.onnx file which is simply an onnx model in tar format. The packaged model contains code to implement custom ops which is also located in the .tvm.onnx. This custom op files is registered with the ONNX Runtime. The python code below demonstrates using the custom op library.

        sess_options = onnxruntime.SessionOptions()
        sess_options.register_custom_ops_library(<path_to_custom_op_library>)
        engine = onnxruntime.InferenceSession(
            onnx_model_path,
            sess_options=sess_options,
        )
        output_data = engine.run(output_names=None, input_feed=input_data)

onnx_benchmark.py runs an ONNX model which is in the .tvm.onnx file format.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
scripts		scripts
tests		tests
tutorial		tutorial
tvm2onnx		tvm2onnx
.devcontainer.json		.devcontainer.json
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.mypy.ini		.mypy.ini
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tvm2onnx

Getting Started

Tutorial

Developing

Running lint and test

Scripts

About

Releases

Packages

Contributors 8

Languages

License

octoml/tvm2onnx

Folders and files

Latest commit

History

Repository files navigation

tvm2onnx

Getting Started

Tutorial

Developing

Running lint and test

Scripts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages