Switch Ollama distro to use ollama embeddings #904

ashwinb · 2025-01-30T04:35:46Z

🚀 Describe the new functionality needed

We should not need sentence transformers for calculating embeddings if we are using ollama. No reason to have a torch dependency. Specifically, we need to update templates/ollama/run.yaml to point the embedding model to the ollama inference provider.

💡 Why is this needed? What if we don't build it?

Dependency on the transformers package is a two-edged sword due to its complexity.

Other thoughts

Need to ensure that client-sdk tests pass (i.e., the standard all-MiniLM alias we use everywhere "just works" -- if there is a more standard HuggingFace ID for that model, maybe we should use that.)

The text was updated successfully, but these errors were encountered:

manojks1999 · 2025-01-30T17:42:30Z

/assign

manojks1999 · 2025-01-30T17:42:53Z

@ashwinb can I work on this

ashwinb · 2025-01-30T18:18:50Z

Thanks @manojks19999 -- please hop on to the Discord if you need any help.

ashwinb added the good first issue Good for newcomers label Jan 30, 2025

ashwinb assigned manojks1999 Jan 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch Ollama distro to use ollama embeddings #904

Switch Ollama distro to use ollama embeddings #904

ashwinb commented Jan 30, 2025 •

edited

Loading

manojks1999 commented Jan 30, 2025

manojks1999 commented Jan 30, 2025

ashwinb commented Jan 30, 2025

Switch Ollama distro to use ollama embeddings #904

Switch Ollama distro to use ollama embeddings #904

Comments

ashwinb commented Jan 30, 2025 • edited Loading

🚀 Describe the new functionality needed

💡 Why is this needed? What if we don't build it?

Other thoughts

manojks1999 commented Jan 30, 2025

manojks1999 commented Jan 30, 2025

ashwinb commented Jan 30, 2025

ashwinb commented Jan 30, 2025 •

edited

Loading