Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Multimodal embeddings #338

Open
dberardo-com opened this issue Jan 3, 2025 · 1 comment
Open

[Feature]: Multimodal embeddings #338

dberardo-com opened this issue Jan 3, 2025 · 1 comment
Assignees

Comments

@dberardo-com
Copy link

What problem does the new feature solve?

I would like to use images/pdfs as well as texts as both input of user queries as well as knowledge base for the embeddings store.

What does the feature do?

images/pdfs and/or textual documents should be treated as optional columns in source tables.

user can use images and/or texts to perform queries and those queries could return texts and/or images as results.

example:

  • user creates a table with stock exchange pdf market reports --> images are extracted in an "image" column
  • user creates a different table with the same stock exchange pdf market reports --> texts are extracted in a "text column"
  • user creates 2 different vectorizers that use different embedding models that can process text/image chunks
  • now user inputs a semantic search query "i would like to read about Tesla stock data for the year 2015" --> result is given from the images and/or texts best matching the prompt
  • now user inputs a text generation query "which stock performed best in 2024 ?" --> result is given
  • now user inputs a text generation query "which stock performed best in 2000 ?" --> result is not given because the reports only contained info about year >2010

Implementation challenges

No response

Are you going to work on this feature?

None

@dberardo-com
Copy link
Author

a quick proof of concept analysis makes me understand that this should be possible to do with the current technology stack: https://blog.geomusings.com/2024/07/19/image-similarity-with-pgvector/ ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants