Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decouple inference and indexing #30

Open
dconathan opened this issue Sep 29, 2024 · 2 comments
Open

Decouple inference and indexing #30

dconathan opened this issue Sep 29, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@dconathan
Copy link

We have infra to host the model and I’m looking to just host an “indexing service” that calls the hosted model instead of running inference “locally”. This decouples the “stateless” services (the model) with the stateful services (storing/loading indexes) and has other benefits (ie easier to scale the model to more replicas).

I’d love to reuse byaldi for the indexing logic (have to write my own from scratch otherwise), but the current implementation is pretty tightly coupled.

Do you see this as something byaldi would support, or is this out of scope?

@bclavie
Copy link
Contributor

bclavie commented Oct 3, 2024

I have a bit of cop-out answer: this is something byaldi will eventually support (I'm hoping to do it whenever I have time), but I'm not sure it'll support it very soon. I'll keep this issue open for now as we'll eventually get around to it!

@bclavie bclavie added the enhancement New feature or request label Oct 3, 2024
@jdchawla29
Copy link

Hi, let me know if #33 helps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants