Skip to content

0.2.0: Gökkal

Latest
Compare
Choose a tag to compare
@vertexclique vertexclique released this 17 Nov 23:04
· 29 commits to master since this release

This release comes with:

  • ONNX interface
  • New asynchronous servicing methods
  • Shareable server runtime
  • Nuclei asynchronous runtime
  • Inferring input facts for frozen model
  • Improves throughput:
    • ~4.8361 GiB/s prediction throughput
    • 3_000 concurrent requests take ~4ms on average