Multi-GPU support for FasterTransformer #233

yeahdongcn · 2023-07-17T02:04:03Z

I'm interested in how to use FasterTransformer to accelerate the LLM deployment on CoreWeave and by following this guide, I've successfully deployed an inference service with 1 GPU.

After looking more into FasterTransformer, I would like to get my inference running on Multi-GPU. So I'm wondering if another guide could be provided to address this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPU support for FasterTransformer #233

Multi-GPU support for FasterTransformer #233

yeahdongcn commented Jul 17, 2023

Multi-GPU support for FasterTransformer #233

Multi-GPU support for FasterTransformer #233

Comments

yeahdongcn commented Jul 17, 2023