Add LoraX #2

ghost · 2023-12-10T21:25:13Z

It is worth mentioning https://github.com/predibase/lorax on this list. It was built based on tgi v0.9.4 (when they still had the apache license) to enable dynamic lora adaptor loading on the inference time.

Related - maybe it's also worth mentioning dynamic lora adaptor loading as a feature itself. Two ideas I've seen so far are LoRaX (loading lora adaptor from disk at request time) and S-LoRA (pre-load adaptors in the GPU memory and route the compute at request time).

I can start a PR to add LoRaX to the table.

lapp0 · 2023-12-10T22:47:24Z

Sounds good to me, thanks for contributing!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LoraX #2

Add LoraX #2

ghost commented Dec 10, 2023 •

edited by ghost

Loading

lapp0 commented Dec 10, 2023

Add LoraX #2

Add LoraX #2

Comments

ghost commented Dec 10, 2023 • edited by ghost Loading

lapp0 commented Dec 10, 2023

ghost commented Dec 10, 2023 •

edited by ghost

Loading