[Feature] [OIM] Make inference backend configurable #1525

Yu-amd · 2025-02-11T19:55:22Z

Priority

P2-High

OS type

Ubuntu

Hardware type

GPU-AMD

Running nodes

Single Node

Description

Based on discussion with Zhiwei at Intel, I understand there are plans to create OIM for selecting inference backend based on the models.

I'm opening this feature request to add the requirement that OIM should allow user to configure and override the backend used. The reason is that not all inference backends will be available on all hardware and software platforms due to various reasons, so we want to ensure the user can manually configure the actual inference backend for the OPEA workload.

Yu-amd added the feature New feature or request label Feb 11, 2025

Yu-amd changed the title ~~[Feature] Make inference backend configurable~~ [Feature] [OIM] Make inference backend configurable Feb 11, 2025

yinghu5 assigned mkbhanda Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] [OIM] Make inference backend configurable #1525

[Feature] [OIM] Make inference backend configurable #1525

Yu-amd commented Feb 11, 2025

[Feature] [OIM] Make inference backend configurable #1525

[Feature] [OIM] Make inference backend configurable #1525

Comments

Yu-amd commented Feb 11, 2025

Priority

OS type

Hardware type

Running nodes

Description