How to use dynamic batch features #199

hudengjunai · 2023-03-01T06:34:35Z

Hello, I have launched the opt-125M inference, and send request to server with locust. but what ever config the max_batch_size, the InferenceEngine always run in batch_size =1. how can i use the dynamic batch feature in Batch_server_manager?

ver217 · 2023-03-20T06:26:58Z

It will make batch only when a sequence of input can be batched, e.g. when they have same generation steps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use dynamic batch features #199

How to use dynamic batch features #199

hudengjunai commented Mar 1, 2023

ver217 commented Mar 20, 2023

How to use dynamic batch features #199

How to use dynamic batch features #199

Comments

hudengjunai commented Mar 1, 2023

ver217 commented Mar 20, 2023