Transition from Ollama to Hugging Face 🤗 #50

latekvo · 2024-06-23T20:51:46Z

🤗

latekvo · 2024-06-26T11:45:24Z

Seems to be working, but requires further testing, and we have to decide what to do with the ollama setup.
I bet there must be a better way than just having a separate compose and containers all together.
We could remove ollama support completely IF - and this is a blocking requirement - if we recreate it's auto-allocation algorithm.
Currently llama takes a fixed amount of memory per config, and uses it to split the provided model and launch it accordingly.
We'd have to somehow calculate how many layers + context we can load at once, and apply it to the llama.cpp loader.
Then, we can confidently remove the inferior bridged solution ;)

latekvo · 2024-06-30T19:55:19Z

This modification turned out to require way to much additional work on the user's side, and upkeep of 2 separate systems from our side, as relying solely on bare bones llama.cpp is not as consistant as ollama.

I'm closing for now, but it's still up for consideration in future releases.

latekvo added 4 commits June 23, 2024 22:50

change compose to use hf

e6193b3

add legacy ollama compose backup

3cdb7ba

various running fixes

0409401

fix small and medium huggingface configs

1bddb44

latekvo marked this pull request as ready for review June 26, 2024 11:45

latekvo marked this pull request as draft June 30, 2024 18:04

latekvo added 2 commits June 30, 2024 20:56

add gpu testing feature

81d20f3

add gpu as resources and cleanup dead code

ce75295

latekvo closed this Jun 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transition from Ollama to Hugging Face 🤗 #50

Transition from Ollama to Hugging Face 🤗 #50

latekvo commented Jun 23, 2024 •

edited

Loading

latekvo commented Jun 26, 2024 •

edited

Loading

latekvo commented Jun 30, 2024

Transition from Ollama to Hugging Face 🤗 #50

Transition from Ollama to Hugging Face 🤗 #50

Conversation

latekvo commented Jun 23, 2024 • edited Loading

latekvo commented Jun 26, 2024 • edited Loading

latekvo commented Jun 30, 2024

latekvo commented Jun 23, 2024 •

edited

Loading

latekvo commented Jun 26, 2024 •

edited

Loading